End-to-End Speech Recognition in Russian › SPbU Researchers Portal

Standard

End-to-End Speech Recognition in Russian. / Markovnikov, Nikita; Kipyatkova, Irina; Lyakso, Elena.

Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. ed. / Rodmonga Potapova; Oliver Jokisch; Alexey Karpov. Springer Nature, 2018. p. 377-386 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11096 LNAI).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review

Harvard

Markovnikov, N, Kipyatkova, I & Lyakso, E 2018, End-to-End Speech Recognition in Russian. in R Potapova, O Jokisch & A Karpov (eds), Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11096 LNAI, Springer Nature, pp. 377-386, 20th International Conference on Speech and Computer, Leipzig, Germany, 18/09/18. https://doi.org/10.1007/978-3-319-99579-3_40

APA

Markovnikov, N., Kipyatkova, I., & Lyakso, E. (2018). End-to-End Speech Recognition in Russian. In R. Potapova, O. Jokisch, & A. Karpov (Eds.), Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings (pp. 377-386). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11096 LNAI). Springer Nature. https://doi.org/10.1007/978-3-319-99579-3_40

Vancouver

Markovnikov N, Kipyatkova I, Lyakso E. End-to-End Speech Recognition in Russian. In Potapova R, Jokisch O, Karpov A, editors, Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. Springer Nature. 2018. p. 377-386. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). https://doi.org/10.1007/978-3-319-99579-3_40

Author

Markovnikov, Nikita ; Kipyatkova, Irina ; Lyakso, Elena. / End-to-End Speech Recognition in Russian. Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings. editor / Rodmonga Potapova ; Oliver Jokisch ; Alexey Karpov. Springer Nature, 2018. pp. 377-386 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

BibTeX

@inproceedings{e641e52feb6d4b1d84f987bc7a77065b,

title = "End-to-End Speech Recognition in Russian",

abstract = "End-to-end speech recognition systems incorporating deep neural networks{\^A} (DNNs) have achieved good results. We propose applying CTC{\^A} (Connectionist Temporal Classification) models and attention-based encoder-decoder in automatic recognition of the Russian continuous speech. We used different neural network models such Long short-term memory{\^A} (LSTM), bidirectional LSTM and Residual Networks to provide experiments. We got recognition accuracy a bit worse than hybrid models but our models can work without large language model and they showed better performance in terms of average decoding speed that can be helpful in real systems. Experiments are performed with extra-large vocabulary (more than 150K words) of Russian speech.",

keywords = "Deep learning, End-to-end models, Russian speech, Speech recognition",

author = "Nikita Markovnikov and Irina Kipyatkova and Elena Lyakso",

year = "2018",

month = sep,

day = "1",

doi = "10.1007/978-3-319-99579-3_40",

language = "English",

isbn = "9783319995786",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer Nature",

pages = "377--386",

editor = "Rodmonga Potapova and Oliver Jokisch and Alexey Karpov",

booktitle = "Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings",

address = "Germany",

note = "20th International Conference on Speech and Computer, SPECOM 2018 ; Conference date: 18-09-2018 Through 22-09-2018",

}

RIS

TY - GEN

T1 - End-to-End Speech Recognition in Russian

AU - Markovnikov, Nikita

AU - Kipyatkova, Irina

AU - Lyakso, Elena

PY - 2018/9/1

Y1 - 2018/9/1

N2 - End-to-end speech recognition systems incorporating deep neural networksÂ (DNNs) have achieved good results. We propose applying CTCÂ (Connectionist Temporal Classification) models and attention-based encoder-decoder in automatic recognition of the Russian continuous speech. We used different neural network models such Long short-term memoryÂ (LSTM), bidirectional LSTM and Residual Networks to provide experiments. We got recognition accuracy a bit worse than hybrid models but our models can work without large language model and they showed better performance in terms of average decoding speed that can be helpful in real systems. Experiments are performed with extra-large vocabulary (more than 150K words) of Russian speech.

AB - End-to-end speech recognition systems incorporating deep neural networksÂ (DNNs) have achieved good results. We propose applying CTCÂ (Connectionist Temporal Classification) models and attention-based encoder-decoder in automatic recognition of the Russian continuous speech. We used different neural network models such Long short-term memoryÂ (LSTM), bidirectional LSTM and Residual Networks to provide experiments. We got recognition accuracy a bit worse than hybrid models but our models can work without large language model and they showed better performance in terms of average decoding speed that can be helpful in real systems. Experiments are performed with extra-large vocabulary (more than 150K words) of Russian speech.

KW - Deep learning

KW - End-to-end models

KW - Russian speech

KW - Speech recognition

UR - http://www.scopus.com/inward/record.url?scp=85053774772&partnerID=8YFLogxK

U2 - 10.1007/978-3-319-99579-3_40

DO - 10.1007/978-3-319-99579-3_40

M3 - Conference contribution

AN - SCOPUS:85053774772

SN - 9783319995786

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 377

EP - 386

BT - Speech and Computer - 20th International Conference, SPECOM 2018, Proceedings

A2 - Potapova, Rodmonga

A2 - Jokisch, Oliver

A2 - Karpov, Alexey

PB - Springer Nature

T2 - 20th International Conference on Speech and Computer

Y2 - 18 September 2018 through 22 September 2018

ER -

ID: 36521378