• Анатолий Владимирович Венцов
  • Юлия Олеговна Нигматулина
  • Ольга Васильевна Раева
  • Елена Игоревна Риехакайнен
  • Наталия Арсеньевна Слепокурова
A corpus of spoken Russian is used to create a database of “broken” discourse units. The database includes over 700 elements, which are either semantic-syntactic units broken by pauses or their fragments. Every unit of the database is to be provided with a sound file, an orthographic transcription and the description of its melodic contour. The principles of the description are discussed in the paper.
Translated title of the contributionFROM A SPEECH CORPUS TO A DATABASE OF "BROKEN" DISCOURSE UNITS
Original languageRussian
Title of host publicationКорпусная лингвистика – 2015
Subtitle of host publicationТруды международной конференции
Place of PublicationСПб
PublisherИздательство Санкт-Петербургского университета
Pages154-161
ISBN (Print)9785846514980
StatePublished - 2015
EventМеждународная конференция "Корпусная лингвистика - 2015" - Санкт-Петербург, Russian Federation
Duration: 22 Jun 201526 Jun 2015

Conference

ConferenceМеждународная конференция "Корпусная лингвистика - 2015"
Country/TerritoryRussian Federation
CityСанкт-Петербург
Period22/06/1526/06/15

ID: 11422444