The paper briefly describes the following databases: "Online Sound Archives from St. Petersburg Collections", "Regional Variants of the Russian Speech", and "Multimedia Dictionaries of the minor Languages of Russia", the principle feature of which is the built-in support for scientific, practical and cultural researches. Though these databases are addressed to researchers engaged mainly in Spoken Language Processing and because of that their main object is Sound, proposed database ideology and general approach to text/speech data representation and access may be further used for elaboration of various language resources containing text, audio and video data. Such approach requests for special representation of the database material. Thus, all text and sound files should be accompanied by information on their multi-level segmentation, which should allow the user to extract and analyze any segment of text or speech. Each significant segment of the database should be perceived as a potential object of investigation and should be supplied by tables of descriptive parameters, mirroring its various characteristics. The list of these parameters for all potential objects is open for further possible extension.

Original languageEnglish
StatePublished - 2000
Event2nd International Conference on Language Resources and Evaluation, LREC 2000 - Athens, Greece
Duration: 31 May 20002 Jun 2000

Conference

Conference2nd International Conference on Language Resources and Evaluation, LREC 2000
Country/TerritoryGreece
CityAthens
Period31/05/002/06/00

    Scopus subject areas

  • Linguistics and Language
  • Library and Information Sciences
  • Education
  • Language and Linguistics

ID: 88462733