Standard

Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals. / Kumar, Mukula; Katyal, Nipuna; Ruban, Nersissona; Lyakso, Elena; Mekala A., Mary; Noel, Joseph Raj, Alex ; Richard G, Maarc.

In: Journal of Intelligent and Fuzzy Systems, Vol. 41, No. 1, 11.08.2021, p. 2013-2024.

Research output: Contribution to journalArticlepeer-review

Harvard

Kumar, M, Katyal, N, Ruban, N, Lyakso, E, Mekala A., M, Noel, JRA & Richard G, M 2021, 'Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals', Journal of Intelligent and Fuzzy Systems, vol. 41, no. 1, pp. 2013-2024. https://doi.org/10.3233/jifs-210711

APA

Kumar, M., Katyal, N., Ruban, N., Lyakso, E., Mekala A., M., Noel, J. R. A., & Richard G, M. (2021). Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals. Journal of Intelligent and Fuzzy Systems, 41(1), 2013-2024. https://doi.org/10.3233/jifs-210711

Vancouver

Kumar M, Katyal N, Ruban N, Lyakso E, Mekala A. M, Noel JRA et al. Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals. Journal of Intelligent and Fuzzy Systems. 2021 Aug 11;41(1):2013-2024. https://doi.org/10.3233/jifs-210711

Author

Kumar, Mukula ; Katyal, Nipuna ; Ruban, Nersissona ; Lyakso, Elena ; Mekala A., Mary ; Noel, Joseph Raj, Alex ; Richard G, Maarc. / Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals. In: Journal of Intelligent and Fuzzy Systems. 2021 ; Vol. 41, No. 1. pp. 2013-2024.

BibTeX

@article{5c2aa01a33d9449f8d564b97616d5464,
title = "Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals",
abstract = "Over the years the need for differentiating various emotions from oral communication plays an important role in emotion based studies. There have been different algorithms to classify the kinds of emotion. Although there is no measure of fidelity of the emotion under consideration, which is primarily due to the reason that most of the readily available datasets that are annotated are produced by actors and not generated in real-world scenarios. Therefore, the predicted emotion lacks an important aspect called authenticity, which is whether an emotion is actual or stimulated. In this research work, we have developed a transfer learning and style transfer based hybrid convolutional neural network algorithm to classify the emotion as well as the fidelity of the emotion. The model is trained on features extracted from a dataset that contains stimulated as well as actual utterances. We have compared the developed algorithm with conventional machine learning and deep learning techniques by few metrics like accuracy, Precision, Recall and F1 score. The developed model performs much better than the conventional machine learning and deep learning models. The research aims to dive deeper into human emotion and make a model that understands it like humans do with precision, recall, F1 score values of 0.994, 0.996, 0.995 for speech authenticity and 0.992, 0.989, 0.99 for speech emotion classification respectively.",
keywords = "Deep learning, linear prediction cepstral coefficients (LPCC), mel frequency cepstral coefficients (MFCC), speech emotion recognition, speech fidelity classification, RECOGNITION, FREQUENCY",
author = "Mukula Kumar and Nipuna Katyal and Nersissona Ruban and Elena Lyakso and {Mekala A.}, Mary and Noel, {Joseph Raj, Alex} and {Richard G}, Maarc",
year = "2021",
month = aug,
day = "11",
doi = "10.3233/jifs-210711",
language = "English",
volume = "41",
pages = "2013--2024",
journal = "Journal of Intelligent and Fuzzy Systems",
issn = "1064-1246",
publisher = "IOS Press",
number = "1",

}

RIS

TY - JOUR

T1 - Transfer Learning Based Convolution Neural Net for Authentication and Classification of Emotions from Natural and Stimulated Speech Signals

AU - Kumar, Mukula

AU - Katyal, Nipuna

AU - Ruban, Nersissona

AU - Lyakso, Elena

AU - Mekala A., Mary

AU - Noel, Joseph Raj, Alex

AU - Richard G, Maarc

PY - 2021/8/11

Y1 - 2021/8/11

N2 - Over the years the need for differentiating various emotions from oral communication plays an important role in emotion based studies. There have been different algorithms to classify the kinds of emotion. Although there is no measure of fidelity of the emotion under consideration, which is primarily due to the reason that most of the readily available datasets that are annotated are produced by actors and not generated in real-world scenarios. Therefore, the predicted emotion lacks an important aspect called authenticity, which is whether an emotion is actual or stimulated. In this research work, we have developed a transfer learning and style transfer based hybrid convolutional neural network algorithm to classify the emotion as well as the fidelity of the emotion. The model is trained on features extracted from a dataset that contains stimulated as well as actual utterances. We have compared the developed algorithm with conventional machine learning and deep learning techniques by few metrics like accuracy, Precision, Recall and F1 score. The developed model performs much better than the conventional machine learning and deep learning models. The research aims to dive deeper into human emotion and make a model that understands it like humans do with precision, recall, F1 score values of 0.994, 0.996, 0.995 for speech authenticity and 0.992, 0.989, 0.99 for speech emotion classification respectively.

AB - Over the years the need for differentiating various emotions from oral communication plays an important role in emotion based studies. There have been different algorithms to classify the kinds of emotion. Although there is no measure of fidelity of the emotion under consideration, which is primarily due to the reason that most of the readily available datasets that are annotated are produced by actors and not generated in real-world scenarios. Therefore, the predicted emotion lacks an important aspect called authenticity, which is whether an emotion is actual or stimulated. In this research work, we have developed a transfer learning and style transfer based hybrid convolutional neural network algorithm to classify the emotion as well as the fidelity of the emotion. The model is trained on features extracted from a dataset that contains stimulated as well as actual utterances. We have compared the developed algorithm with conventional machine learning and deep learning techniques by few metrics like accuracy, Precision, Recall and F1 score. The developed model performs much better than the conventional machine learning and deep learning models. The research aims to dive deeper into human emotion and make a model that understands it like humans do with precision, recall, F1 score values of 0.994, 0.996, 0.995 for speech authenticity and 0.992, 0.989, 0.99 for speech emotion classification respectively.

KW - Deep learning

KW - linear prediction cepstral coefficients (LPCC)

KW - mel frequency cepstral coefficients (MFCC)

KW - speech emotion recognition

KW - speech fidelity classification

KW - RECOGNITION

KW - FREQUENCY

UR - http://www.scopus.com/inward/record.url?scp=85113260578&partnerID=8YFLogxK

UR - https://www.mendeley.com/catalogue/776f5d4c-3bd8-3475-9472-cb6252564d87/

U2 - 10.3233/jifs-210711

DO - 10.3233/jifs-210711

M3 - Article

VL - 41

SP - 2013

EP - 2024

JO - Journal of Intelligent and Fuzzy Systems

JF - Journal of Intelligent and Fuzzy Systems

SN - 1064-1246

IS - 1

ER -

ID: 84853053