DOI

We propose a novel computationally efficient real-time microphone array speech enhancement postfilter with a small delay that takes into account features of speech signal and recognition algorithms. The algorithm is efficient for small microphone arrays. The filter is based on applying a binary classification model to the Log Short-Term Spectral Amplitude (Log-STSA). The proposed algorithm allows substantial improvement of recognition accuracy with minor increase in complexity compared to Wiener post-filter and lower complexity compared to existing voice model based approaches. Objective tests using dual microphone array, ETSI binaural noise database, TIDIGITS database, and CMU Sphinx 4 speech recognizer demonstrate overall 41% Error Rate reduction for SNR from 15 dB to 0 dB. Subjective evaluation also demonstrates substantial noise reduction and intelligibility improvement without musical noise artifacts common for Wiener and Spectral Subtraction based methods. Testing with SiSEC10 four microphone linear equispaced array database shows that recognition accuracy is improved with increased base and/or number of microphones in array.
Язык оригиналаанглийский
Название основной публикацииSpeech and Computer (SPECOM 2017)
ИздательSpringer Nature
Страницы525-534
Число страниц10
DOI
СостояниеОпубликовано - 1 янв 2017
Событие19th International Conference on Speech and Computer - Hatfield, Великобритания
Продолжительность: 11 сен 201715 сен 2017

Серия публикаций

НазваниеLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ИздательSpringer Nature
Том10458 LNAI
ISSN (печатное издание)0302-9743

конференция

конференция19th International Conference on Speech and Computer
Сокращенное названиеSPECOM 2017
Страна/TерриторияВеликобритания
ГородHatfield
Период11/09/1715/09/17

ID: 152227169