Voice Activity Detector (VAD) based on long-term phonetic features › Научные исследования в СПбГУ

Standard

Voice Activity Detector (VAD) based on long-term phonetic features. / Barabanov, Andrey ; Kocharov, Daniil; Salishev, Sergey; Skrelin, Pavel; Moiseev, Mikhail.

Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016. Athens : National and Kapodistrian University of Athens, 2016. стр. 33-36.

Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование

Harvard

Barabanov, A , Kocharov, D, Salishev, S, Skrelin, P & Moiseev, M 2016, Voice Activity Detector (VAD) based on long-term phonetic features. в Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016. National and Kapodistrian University of Athens, Athens, стр. 33-36, 7th Tutorial and Research Workshop on Experimental Linguistics, Санкт-Петербург, Российская Федерация, 27/06/16. <https://exlingsociety.com/proceedings/exling-2016.html>

APA

Barabanov, A., Kocharov, D., Salishev, S., Skrelin, P., & Moiseev, M. (2016). Voice Activity Detector (VAD) based on long-term phonetic features. в Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016 (стр. 33-36). National and Kapodistrian University of Athens. https://exlingsociety.com/proceedings/exling-2016.html

Vancouver

Barabanov A , Kocharov D, Salishev S, Skrelin P, Moiseev M. Voice Activity Detector (VAD) based on long-term phonetic features. в Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016. Athens: National and Kapodistrian University of Athens. 2016. стр. 33-36

Author

Barabanov, Andrey ; Kocharov, Daniil ; Salishev, Sergey ; Skrelin, Pavel ; Moiseev, Mikhail. / Voice Activity Detector (VAD) based on long-term phonetic features. Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016. Athens : National and Kapodistrian University of Athens, 2016. стр. 33-36

BibTeX

@inproceedings{80d0f396d5454a27afcc81e3de9c62af,

title = "Voice Activity Detector (VAD) based on long-term phonetic features",

abstract = "We propose a VAD using long-term 200 ms Mel frequency band statistics, auditory masking, and pre-trained two level decision tree ensemble based classifier, which allows capturing syllable level structure of speech and discriminating it from com-mon noises. Proposed algorithm demonstrates almost 100% acceptance of clear voice for English, Chinese, Russian, and Polish speech and 100% rejection of sta-tionary noises independently of loudness",

keywords = "Voice Activity Detector, classification, decision tree ensemble, auditory masking, phonetic features",

author = "Andrey Barabanov and Daniil Kocharov and Sergey Salishev and Pavel Skrelin and Mikhail Moiseev",

year = "2016",

language = "English",

isbn = "2529-1092; 978-960-466-161-9",

pages = "33--36",

booktitle = "Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016",

publisher = "National and Kapodistrian University of Athens",

address = "Greece",

note = "7th Tutorial and Research Workshop on Experimental Linguistics : ExLing 2016, ExLing 2016 ; Conference date: 27-06-2016 Through 02-07-2016",

url = "https://exlingsociety.com/past-exlings/exling2016/home-2016.html",

}

RIS

TY - GEN

T1 - Voice Activity Detector (VAD) based on long-term phonetic features

AU - Barabanov, Andrey

AU - Kocharov, Daniil

AU - Salishev, Sergey

AU - Skrelin, Pavel

AU - Moiseev, Mikhail

N1 - Conference code: 7

PY - 2016

Y1 - 2016

N2 - We propose a VAD using long-term 200 ms Mel frequency band statistics, auditory masking, and pre-trained two level decision tree ensemble based classifier, which allows capturing syllable level structure of speech and discriminating it from com-mon noises. Proposed algorithm demonstrates almost 100% acceptance of clear voice for English, Chinese, Russian, and Polish speech and 100% rejection of sta-tionary noises independently of loudness

AB - We propose a VAD using long-term 200 ms Mel frequency band statistics, auditory masking, and pre-trained two level decision tree ensemble based classifier, which allows capturing syllable level structure of speech and discriminating it from com-mon noises. Proposed algorithm demonstrates almost 100% acceptance of clear voice for English, Chinese, Russian, and Polish speech and 100% rejection of sta-tionary noises independently of loudness

KW - Voice Activity Detector

KW - classification

KW - decision tree ensemble

KW - auditory masking

KW - phonetic features

M3 - Conference contribution

SN - 2529-1092; 978-960-466-161-9

SP - 33

EP - 36

BT - Proceedings of the 7th Tutorial and Research Workshop on Experimental Linguistics ExLing 2016

PB - National and Kapodistrian University of Athens

CY - Athens

T2 - 7th Tutorial and Research Workshop on Experimental Linguistics

Y2 - 27 June 2016 through 2 July 2016

ER -

ID: 7595383