The paper presents the results of perceptual experiments (by humans) and automatic recognition of the emotional states of children with Down syndrome (DS) by video, audio and text modalities. The participants of the study were 35 children with DS aged 5–16 years, and 30 adults – the participants of the perceptual experiment. Automatic analysis of facial expression by video was performed using FaceReader software runs on the Microsoft Azure cloud platform and convolutional neural network. Automatic recognition of the emotional states of children by speech was carried out using a recurrent neural network. Specifically for this project, we did not apply any additional transfer learning or fine-tuning as our goal was to investigate how the generic models perform for children with DS. The results of perceptual experiments showed that adults recognize the emotional states of children with DS by video better than by audio. Automatic classification of children’s emotional states by facial expression revealed better results for joy and neutral states than for sadness and anger; by audio the best results were shown for the neutral state, by the texts of children’s speech - for joy, the state of sadness was not recognized automatically. The study revealed the possibility of using the available software for classifying the neutral state and the state of joy, i.e. states with neutral and positive valence, and the need to develop an approach to determine the state of sadness and anger.
Original languageEnglish
Title of host publicationSpeech and Computer - 24th International Conference, SPECOM 2022, Proceedings
Subtitle of host publication24th International Conference, SPECOM 2022, Gurugram, India, November 14–16, 2022, Proceedings
EditorsS.R. Mahadeva Prasanna, Alexey Karpov, K. Samudravijaya, Shyam S. Agrawal
PublisherSpringer Nature
Pages438-450
Number of pages13
ISBN (Electronic)978-3-031-20980-2
ISBN (Print)978-3-031-20979-6
DOIs
StatePublished - Nov 2022
Event24th International Conference on Speech and Computer SPECOM 2022 - Gurugram, India
Duration: 14 Nov 202216 Nov 2022
https://www.specom.co.in

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13721 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference24th International Conference on Speech and Computer SPECOM 2022
Abbreviated titleSPECOM 2022
Country/TerritoryIndia
CityGurugram
Period14/11/2216/11/22
Internet address

    Research areas

  • emotional state, Perceptual and automatic recognition, Child with down syndrome, Video, Audio, Text modalities, Emotional state

    Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

ID: 100304343