Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
Distributed representation of melodic contours. / Кочаров, Даниил Александрович; Меньшикова, Алла Павловна.
Proceedings of Speech Prosody 2018. Vol. 2018-June 2018. p. 167-171 (Proceedings of the International Conference on Speech Prosody).Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
}
TY - GEN
T1 - Distributed representation of melodic contours
AU - Кочаров, Даниил Александрович
AU - Меньшикова, Алла Павловна
PY - 2018
Y1 - 2018
N2 - We introduce a new computational model for melodic contours—melody embeddings. It is based on the approach of distributional semantics where embeddings represent units as continuous vectors in a multi-dimensional space based on hypothesis that units with similar meaning are used in similar contexts. This paradigm is applied to melodic contours and their segments. Melodic contours are represented by vectors of the same dimensionality independent on their length and shape. We successfully evaluated the ability of the proposed model to measure the distance between melodic contours. The results of applying the model for a task of prominent words detection have not showed the improvement over traditional prosodic features. Nevertheless we assume the model to be very promising. The possible applications for the proposed unsupervised prosodic model include processing of speech of underresourced languages, modelling prosodic variability for textto-speech synthesis, recognition and classification of prosodic events by means of deep-learning algorithms.
AB - We introduce a new computational model for melodic contours—melody embeddings. It is based on the approach of distributional semantics where embeddings represent units as continuous vectors in a multi-dimensional space based on hypothesis that units with similar meaning are used in similar contexts. This paradigm is applied to melodic contours and their segments. Melodic contours are represented by vectors of the same dimensionality independent on their length and shape. We successfully evaluated the ability of the proposed model to measure the distance between melodic contours. The results of applying the model for a task of prominent words detection have not showed the improvement over traditional prosodic features. Nevertheless we assume the model to be very promising. The possible applications for the proposed unsupervised prosodic model include processing of speech of underresourced languages, modelling prosodic variability for textto-speech synthesis, recognition and classification of prosodic events by means of deep-learning algorithms.
KW - Distributed representations
KW - Embeddings
KW - Melody
KW - Prosody
KW - Unsupervised clustering
UR - http://www.scopus.com/inward/record.url?scp=85050220264&partnerID=8YFLogxK
U2 - 10.21437/SpeechProsody.2018-34
DO - 10.21437/SpeechProsody.2018-34
M3 - Conference contribution
VL - 2018-June
T3 - Proceedings of the International Conference on Speech Prosody
SP - 167
EP - 171
BT - Proceedings of Speech Prosody 2018
ER -
ID: 27388055