• Alexander Alenin
  • Anton Okhotnikov
  • Rostislav Makarov
  • Nikita Torgashov
  • Ilya Shigabeev
  • Konstantin Simonchik
This paper describes ID R&D team submission to the text- independent task of the Short-duration Speaker Verification (SdSV) Challenge 2021. The top performed system is a fu- sion of 9 Convolutional Neural Networks based on the ResNet architecture. Experiments’ results of optimal NN architecture search are shown. We also present and investigate the subnet- work approach to solve the auxiliary tasks such as gender or language detection. Verification scores refinement step using quality measurements of a trial pair allowed to further mini- mize the target metrics. A comparative analysis of all systems used in the fusion has been provided on the VoxCeleb-1 test set, SdSV-2021 development and evaluation sets. The final submis- sion achieves 0.69% EER and 0.0319 minDCF on the challenge evaluation set.
Original languageEnglish
Pages2297-2301
Number of pages5
DOIs
StatePublished - 30 Aug 2021
EventInterspeech 2021 - Брно, Czech Republic
Duration: 30 Aug 20213 Sep 2021
https://www.interspeech2021.org/

Conference

ConferenceInterspeech 2021
Abbreviated titleInterspeech 2021
Country/TerritoryCzech Republic
CityБрно
Period30/08/213/09/21
Internet address

    Research areas

  • Speaker recognition, Speaker verification, cross- lingual speaker verification, SdSV Challenge 2021, Cross-lingual speaker verification

    Scopus subject areas

  • Artificial Intelligence
  • Signal Processing
  • Computer Science(all)
  • Software
  • Language and Linguistics
  • Human-Computer Interaction
  • Modelling and Simulation

ID: 86369686