RANSAC-based Training Data Selection for Speaker State Recognition

Bozkurt E., Erzin E., Erdem Ç., Erdem A. T.

12th Annual Conference of the International-Speech-Communication-Association 2011 (INTERSPEECH 2011), Florence, İtalya, 27 - 31 Ağustos 2011, ss.3300-3301

Yayın Türü: Bildiri / Tam Metin Bildiri
Basıldığı Şehir: Florence
Basıldığı Ülke: İtalya
Sayfa Sayıları: ss.3300-3301
Marmara Üniversitesi Adresli: Hayır

Özet

We present a Random Sampling Consensus (RANSAC) based training approach for the problem of speaker state recognition from spontaneous speech. Our system is trained and tested with the INTERSPEECH 2011 Speaker State Challenge corpora that includes the Intoxication and the Sleepiness Sub-challenges, where each sub-challenge defines a two-class classification task. We aim to perform a RANSAC-based training data selection coupled with the Support Vector Machine (SVM) based classification to prune possible outliers, which exist in the training data. Our experimental evaluations indicate that utilization of RANSAC-based training data selection provides 66.32 % and 65.38 % unweighted average (UA) recall rate on the development and test sets for the Sleepiness Sub-challenge, respectively and a slight improvement on the Intoxication Sub-challenge performance.