The paper is devoted to the construction of the Russian thesaurus on Corpus Linguistics. The linguistic resource involved in research is the Russian corpus on Corpus Linguistics developed in St.-Petersburg State University together with the Institute of Linguistic Studies and different vocabularies. The semi-automatic terminology extraction is performed with the help of the linguistic and statistical tools which allow generation of the lists of single-word and multi-word terms provided with frequency data and lexical-syntactic patterns. The lexical-syntactic patterns are used in the analysis of the contexts which contain the definitions of the terms, expose the relationships between the terms, provide their synonyms, translation equivalents, etc.
Original languageRussian
Title of host publicationИнформационные технологии и письменное наследие: El'Manuscript-10
Subtitle of host publicationМатериалы Международной научной конференции
Place of PublicationУфа
PublisherЭлектронное издательство "Вагант"
Pages95-98
ISBN (Print)978-5-66678-639-8
StatePublished - 2010
EventИнформационные технологии и письменное наследие: El'Manuscript-10 - Уфа; Ижевск, Russian Federation
Duration: 28 Oct 201031 Oct 2010

Conference

ConferenceИнформационные технологии и письменное наследие: El'Manuscript-10
Country/TerritoryRussian Federation
CityУфа; Ижевск
Period28/10/1031/10/10

ID: 4633160