Distributed representation of melodic contours

Standard

Distributed representation of melodic contours. / Кочаров, Даниил Александрович; Меньшикова, Алла Павловна.

Proceedings of Speech Prosody 2018. Vol. 2018-June 2018. p. 167-171 (Proceedings of the International Conference on Speech Prosody).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review

Harvard

Кочаров, ДА & Меньшикова, АП 2018, Distributed representation of melodic contours. in Proceedings of Speech Prosody 2018. vol. 2018-June, Proceedings of the International Conference on Speech Prosody, pp. 167-171. https://doi.org/10.21437/SpeechProsody.2018-34

APA

Кочаров, Д. А., & Меньшикова, А. П. (2018). Distributed representation of melodic contours. In Proceedings of Speech Prosody 2018 (Vol. 2018-June, pp. 167-171). (Proceedings of the International Conference on Speech Prosody). https://doi.org/10.21437/SpeechProsody.2018-34

Vancouver

Кочаров ДА, Меньшикова АП. Distributed representation of melodic contours. In Proceedings of Speech Prosody 2018. Vol. 2018-June. 2018. p. 167-171. (Proceedings of the International Conference on Speech Prosody). https://doi.org/10.21437/SpeechProsody.2018-34

Author

Кочаров, Даниил Александрович ; Меньшикова, Алла Павловна. / Distributed representation of melodic contours. Proceedings of Speech Prosody 2018. Vol. 2018-June 2018. pp. 167-171 (Proceedings of the International Conference on Speech Prosody).

BibTeX

@inproceedings{713b4cb3a7c940439b92e0f2fdde9394,

title = "Distributed representation of melodic contours",

abstract = "We introduce a new computational model for melodic contours—melody embeddings. It is based on the approach of distributional semantics where embeddings represent units as continuous vectors in a multi-dimensional space based on hypothesis that units with similar meaning are used in similar contexts. This paradigm is applied to melodic contours and their segments. Melodic contours are represented by vectors of the same dimensionality independent on their length and shape. We successfully evaluated the ability of the proposed model to measure the distance between melodic contours. The results of applying the model for a task of prominent words detection have not showed the improvement over traditional prosodic features. Nevertheless we assume the model to be very promising. The possible applications for the proposed unsupervised prosodic model include processing of speech of underresourced languages, modelling prosodic variability for textto-speech synthesis, recognition and classification of prosodic events by means of deep-learning algorithms.",

keywords = "Distributed representations, Embeddings, Melody, Prosody, Unsupervised clustering",

author = "Кочаров, {Даниил Александрович} and Меньшикова, {Алла Павловна}",

year = "2018",

doi = "10.21437/SpeechProsody.2018-34",

language = "English",

volume = "2018-June",

series = "Proceedings of the International Conference on Speech Prosody",

pages = "167--171",

booktitle = "Proceedings of Speech Prosody 2018",

}

RIS

TY - GEN

T1 - Distributed representation of melodic contours

AU - Кочаров, Даниил Александрович

AU - Меньшикова, Алла Павловна

PY - 2018

Y1 - 2018

N2 - We introduce a new computational model for melodic contours—melody embeddings. It is based on the approach of distributional semantics where embeddings represent units as continuous vectors in a multi-dimensional space based on hypothesis that units with similar meaning are used in similar contexts. This paradigm is applied to melodic contours and their segments. Melodic contours are represented by vectors of the same dimensionality independent on their length and shape. We successfully evaluated the ability of the proposed model to measure the distance between melodic contours. The results of applying the model for a task of prominent words detection have not showed the improvement over traditional prosodic features. Nevertheless we assume the model to be very promising. The possible applications for the proposed unsupervised prosodic model include processing of speech of underresourced languages, modelling prosodic variability for textto-speech synthesis, recognition and classification of prosodic events by means of deep-learning algorithms.

AB - We introduce a new computational model for melodic contours—melody embeddings. It is based on the approach of distributional semantics where embeddings represent units as continuous vectors in a multi-dimensional space based on hypothesis that units with similar meaning are used in similar contexts. This paradigm is applied to melodic contours and their segments. Melodic contours are represented by vectors of the same dimensionality independent on their length and shape. We successfully evaluated the ability of the proposed model to measure the distance between melodic contours. The results of applying the model for a task of prominent words detection have not showed the improvement over traditional prosodic features. Nevertheless we assume the model to be very promising. The possible applications for the proposed unsupervised prosodic model include processing of speech of underresourced languages, modelling prosodic variability for textto-speech synthesis, recognition and classification of prosodic events by means of deep-learning algorithms.

KW - Distributed representations

KW - Embeddings

KW - Melody

KW - Prosody

KW - Unsupervised clustering

UR - http://www.scopus.com/inward/record.url?scp=85050220264&partnerID=8YFLogxK

U2 - 10.21437/SpeechProsody.2018-34

DO - 10.21437/SpeechProsody.2018-34

M3 - Conference contribution

VL - 2018-June

T3 - Proceedings of the International Conference on Speech Prosody

SP - 167

EP - 171

BT - Proceedings of Speech Prosody 2018

ER -

ID: 27388055