The development of a morphosyntactic analyzer of Tibetan texts aims to create a consistent formal grammatical description (formal grammar) of the Tibetan language, including all grammar levels of the language system from morphosyntax (syntax of morphemes) to the syntax of complex sentences and supra-phrasal entities. А new version markup is being created to reflect both the structures of the immediate constituents and those of dependency. The formal grammar reflects all classes of models, and their grammatical properties. Besides the module for formal grammar, token vocabularies, and a central module for Tibetan grammatical categories, their possible values, and restrictions on their combination are created.
Translated title of the contributionCOMPUTER MORPHOSYNTACTIC ANALYSIS OF THE NON SEGMENTED TEXT (BASED ON THE MATERIAL OF THE CORPUS OF TIBETAN GRAMMAR TREATISES)
Original languageRussian
Title of host publicationСтруктурная и прикладная лингвистика. Межвузовский сборник
Subtitle of host publicationВыпуск 12. К 60-летию отделения прикладной, компьютерной и математической лингвистики СПбГУ
Place of PublicationСПб.
PublisherИздательство Санкт-Петербургского университета
Pages69-80
StatePublished - 2019

Publication series

NameСтруктурная и прикладная лингвистика
PublisherИзд-во С.-Петерб. ун-та
ISSN (Print)0202-2400

    Research areas

  • WORD SEGMENTATION, TREEBANKS, Formal grammars, parsing, Morphosyntax, TIBETAN GRAMMAR

ID: 41768965