The Ruthenian UD treebank includes texts written in the territories of modern Belarus, Lithuania, Ukraine, and Poland in ca. 1300-1700. A sample of legal and nonfiction texts is drawn from the Ruthenian Corpus.
The Ruthenian UD treebank includes texts written in "prosta mova" ("ruska mova", Old Belarusian, Old Ukrainian), a Western descendant of Old East Slavic spoken in the territories of modern Belarus, Lithuania, Ukraine, and Poland. A sample of legal and nonfiction texts written in ca. 1380-1650 is drawn from the Ruthenian Corpus, a historical language corpus resource currently being compiled by an independent research partnership.
We are grateful to Maria Ermolova, Vladimir Shatin, Natalia Iordani, Oksana Nika, Andrey Yakuboy, Sofia Chernousenko, and Maxim Eremeev for their fruitful collaboration, efforts in text preparation, comments and discussions.
-
2024-11-15 v2.14
- Texts in Old Ukrainian added to test, dev, and train.
- Texts of Lithuanian Metrica Book of inscriptions, Vol. 3 (1440-1498) added to train and dev
-
2023-11-15 v2.13
- Texts of Polotsk letters added to dev and train; lemmas & grammar & syntax corrected.
-
2023-05-15 v2.12
- Texts of Polotsk letters added; lemmas added; grammar & syntax corrected.
-
2022-11-15 v2.11
- Initial release in Universal Dependencies.
=== Machine-readable metadata (DO NOT REMOVE!) ================================ Data available since: UD v2.11 License: CC BY-SA 4.0 Includes text: yes Genre: legal nonfiction Lemmas: manual native UPOS: manual native XPOS: not available Features: manual native Relations: manual native Contributors: Lyashevskaya, Olga; Sitchinava, Dmitri; Shvedova, Maria Contributing: elsewhere Contact: olesar@yandex.ru ===============================================================================