The article presents the experience of developing computer ontology as one of the tools for Tibetan idioms processing. A computer ontology that contains a consistent specification of meanings of lexical units with different relations between them represents a model of lexical semantics and both syntactic and semantic valencies, reflecting the Tibetan linguistic picture of the world. The article presents an attempt to classify Tibetan idioms, including compounds, which are idiomatized clips of syntactic groups that have frozen inner syntactic relations and are often characterized by omission of grammatical morphemes; and the application of this classification for idioms processing in computer ontology. The article also proposes methods of using computer ontology for avoiding idioms processing ambiguity.

Original languageEnglish
Title of host publicationText, Speech, and Dialogue - 21st International Conference, TSD 2018, Proceedings
EditorsPetr Sojka, Aleš Horák, Ivan Kopecek, Karel Pala
PublisherSpringer Nature
Pages76-83
Number of pages8
ISBN (Print)9783030007935
DOIs
StatePublished - 10 Sep 2018
Event21st International Conference on Text, Speech, and Dialogue, TSD 2018 - Brno, Czech Republic
Duration: 11 Sep 201814 Sep 2018

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11107 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference21st International Conference on Text, Speech, and Dialogue, TSD 2018
Country/TerritoryCzech Republic
CityBrno
Period11/09/1814/09/18

    Research areas

  • Compounds, Computer ontology, Corpus linguistics, Idioms, Immediate constituents, Natural language processing, Tibetan corpus, Tibetan language

    Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

ID: 35769642