Roots2

WANCA

(Beta version*)

Wanca (from Proto Uralic *wanča ‘root’) is a collection of links to web pages written in various Uralic languages. The pages have been found using the automated system developed in the SUKI project. At the moment, Wanca consists of 57 491 links to 919 sites containing pages written in 36 of the smaller Uralic languages.

You can browse the link collection through the languages and language groups or by the list of top-domains. See instructions for how to use Wanca. Native speakers and scholars are invited to create a personal account and help us verify the language labels given to the pages by our automatic language identifier. The verified links will be used for new crawls and to further improve our language identifier. After signing up you can apply for expert rights by sending us a message through our contact form.

In the SUKI project we are promoting the smaller Uralic languages and we do not intentionally collect links to pages written in Hungarian, Finnish or Estonian. Wanca is the result of the Language Programme of the Kone Foundation for small Finno-Ugric languages.

* The site is still under construction. When initially identifying the language of the links we found, we did not want to miss any potential texts written in small Uralic languages. Therefore, the site contains many links that turned out not to contain the target languages. We are re-identifying the links with a stricter filter, but with automatical language identification it is not possible to identify the language of all web pages correctly so we need the help of those who know these languages. We also wanted to get feedback on both the site and the links and, therefore, opened the site for public use already at this stage. A list of the recent updates made to Wanca can be found here.