Linguistic based emotion analysis using softmax over time attention mechanism

Standard

Linguistic based emotion analysis using softmax over time attention mechanism. / Roshan, Megha; Rawat, Mukul; Aryan, Karan ; Lyakso, Elena; Mekala A., Mary; Ruban, Nersissona.

In: PLoS ONE, Vol. 19, No. 4, e0301336, 16.04.2024.

Research output: Contribution to journal › Article › peer-review

Author

Roshan, Megha ; Rawat, Mukul ; Aryan, Karan ; Lyakso, Elena ; Mekala A., Mary ; Ruban, Nersissona. / Linguistic based emotion analysis using softmax over time attention mechanism. In: PLoS ONE. 2024 ; Vol. 19, No. 4.

BibTeX

@article{150a3d07d51e4b1e9ceccfb38b803d26,

title = "Linguistic based emotion analysis using softmax over time attention mechanism",

abstract = "Recognizing the real emotion of humans is considered the most essential task for any customer feedback or medical applications. There are many methods available to recognize the type of emotion from speech signal by extracting frequency, pitch, and other dominant features. These features are used to train various models to auto-detect various human emotions. We cannot completely rely on the features of speech signals to detect the emotion, for instance, a customer is angry but still, he is speaking at a low voice (frequency components) which will eventually lead to wrong predictions. Even a video-based emotion detection system can be fooled by false facial expressions for various emotions. To rectify this issue, we need to make a parallel model that will train on textual data and make predictions based on the words present in the text. The model will then classify the type of emotions using more comprehensive information, thus making it a more robust model. To address this issue, we have tested four text-based classification models to classify the emotions of a customer. We examined the text-based models and compared their results which showed that the modified Encoder decoder model with attention mechanism trained on textual data achieved an accuracy of 93.5%. This research highlights the pressing need for more robust emotion recognition systems and underscores the potential of transfer models with attention mechanisms to significantly improve feedback management processes and the medical applications.",

keywords = "emotion analysis, Male, Humans, Emotions, Speech, Voice, Linguistics, Recognition, Psychology",

author = "Megha Roshan and Mukul Rawat and Karan Aryan and Elena Lyakso and {Mekala A.}, Mary and Nersissona Ruban",

year = "2024",

month = apr,

day = "16",

doi = "10.1371/journal.pone.0301336",

language = "English",

volume = "19",

journal = "PLoS ONE",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "4",

}

RIS

TY - JOUR

T1 - Linguistic based emotion analysis using softmax over time attention mechanism

AU - Roshan, Megha

AU - Rawat, Mukul

AU - Aryan, Karan

AU - Lyakso, Elena

AU - Mekala A., Mary

AU - Ruban, Nersissona

PY - 2024/4/16

Y1 - 2024/4/16

N2 - Recognizing the real emotion of humans is considered the most essential task for any customer feedback or medical applications. There are many methods available to recognize the type of emotion from speech signal by extracting frequency, pitch, and other dominant features. These features are used to train various models to auto-detect various human emotions. We cannot completely rely on the features of speech signals to detect the emotion, for instance, a customer is angry but still, he is speaking at a low voice (frequency components) which will eventually lead to wrong predictions. Even a video-based emotion detection system can be fooled by false facial expressions for various emotions. To rectify this issue, we need to make a parallel model that will train on textual data and make predictions based on the words present in the text. The model will then classify the type of emotions using more comprehensive information, thus making it a more robust model. To address this issue, we have tested four text-based classification models to classify the emotions of a customer. We examined the text-based models and compared their results which showed that the modified Encoder decoder model with attention mechanism trained on textual data achieved an accuracy of 93.5%. This research highlights the pressing need for more robust emotion recognition systems and underscores the potential of transfer models with attention mechanisms to significantly improve feedback management processes and the medical applications.

AB - Recognizing the real emotion of humans is considered the most essential task for any customer feedback or medical applications. There are many methods available to recognize the type of emotion from speech signal by extracting frequency, pitch, and other dominant features. These features are used to train various models to auto-detect various human emotions. We cannot completely rely on the features of speech signals to detect the emotion, for instance, a customer is angry but still, he is speaking at a low voice (frequency components) which will eventually lead to wrong predictions. Even a video-based emotion detection system can be fooled by false facial expressions for various emotions. To rectify this issue, we need to make a parallel model that will train on textual data and make predictions based on the words present in the text. The model will then classify the type of emotions using more comprehensive information, thus making it a more robust model. To address this issue, we have tested four text-based classification models to classify the emotions of a customer. We examined the text-based models and compared their results which showed that the modified Encoder decoder model with attention mechanism trained on textual data achieved an accuracy of 93.5%. This research highlights the pressing need for more robust emotion recognition systems and underscores the potential of transfer models with attention mechanisms to significantly improve feedback management processes and the medical applications.

KW - emotion analysis

KW - Male

KW - Humans

KW - Emotions

KW - Speech

KW - Voice

KW - Linguistics

KW - Recognition, Psychology

UR - https://www.mendeley.com/catalogue/5499e226-c864-3197-bc64-538188af057c/

U2 - 10.1371/journal.pone.0301336

DO - 10.1371/journal.pone.0301336

M3 - Article

C2 - 38625932

VL - 19

JO - PLoS ONE

JF - PLoS ONE

SN - 1932-6203

IS - 4

M1 - e0301336

ER -

ID: 119154629