Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
F0 contour prediction for the Kazakh language. / Kaliyev, Arman; Matveev, Yuri N.; Lyakso, Elena E.; Rybin, Sergey V.
ICEMIS '19: Proceedings of the 5th International Conference on Engineering and MIS. Association for Computing Machinery, 2019. 5 (ACM International Conference Proceeding Series).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
}
TY - GEN
T1 - F0 contour prediction for the Kazakh language
AU - Kaliyev, Arman
AU - Matveev, Yuri N.
AU - Lyakso, Elena E.
AU - Rybin, Sergey V.
PY - 2019/6/6
Y1 - 2019/6/6
N2 - The article presents work on predicting the fundamental frequency (F0) values for the Kazakh language. The fundamental frequency plays one of the most important roles in the perception of speech, and at the same time modelling continuous F0 is one of the most difficult tasks in the development of intonational speech synthesis systems. The main and obvious difficulty is that a person is able to say the same sentence with different intonations and with different tones. In this work, we used deep neural networks for accurate and qualitative prediction F0 values as close as possible to the natural sounding of Kazakh speech.
AB - The article presents work on predicting the fundamental frequency (F0) values for the Kazakh language. The fundamental frequency plays one of the most important roles in the perception of speech, and at the same time modelling continuous F0 is one of the most difficult tasks in the development of intonational speech synthesis systems. The main and obvious difficulty is that a person is able to say the same sentence with different intonations and with different tones. In this work, we used deep neural networks for accurate and qualitative prediction F0 values as close as possible to the natural sounding of Kazakh speech.
KW - DNN
KW - Fundamental frequency
KW - Informants with atypical development
KW - Intonation
KW - Kazakh language
KW - LSTM
KW - Speech synthesis
UR - http://www.scopus.com/inward/record.url?scp=85069165325&partnerID=8YFLogxK
UR - http://www.mendeley.com/research/f0-contour-prediction-kazakh-language
U2 - 10.1145/3330431.3330436
DO - 10.1145/3330431.3330436
M3 - Conference contribution
AN - SCOPUS:85069165325
SN - 9781450372121
T3 - ACM International Conference Proceeding Series
BT - ICEMIS '19
PB - Association for Computing Machinery
T2 - 5th International Conference on Engineering and MIS, ICEMIS 2019
Y2 - 6 June 2019 through 8 June 2019
ER -
ID: 46097990