"Non-speech" sounds classification for people with hearing disabilities

H. Lozano*, I. Hernaez, E. Navas, F. J. González, I. Idigoras

*Autor correspondiente de este trabajo

Producción científica: Capítulo del libro/informe/acta de congresoContribución a la conferenciarevisión exhaustiva

3 Citas (Scopus)

Resumen

People with hearing disabilities experience the problems that stem from not being able to detect or identify sounds on a daily basis. Studying the techniques and algorithms which enable this task to be performed automatically may lead to significant technological progress which will offer huge benefits to deaf people. With the objective of developing an application which is capable of detecting and classifying the different sounds that may emerge in the home, a study is being carried out which shows the most important parameters for processing impulsive sounds such as door bells, alarm clocks, a baby crying which obtain high accuracy ratios and give the classifier high reliability. To date, an initial prototype has been developed which implements a GMM (Gaussian Mixture Model) classifier which is based on the Gaussian probability distribution for sound event prediction. In order to check the classifier's accuracy, typical speech recognition parameters have been used, such as MFCC (Mel frequency cepstral coefficient), as well as parameters used to recognise musical instruments and background sounds: Spectral Centroid, Roll-Off Point and ZCR (16 parameters in total). By varying a series of factors (number of parameters, the sounds used to train the classifier...) the GMM's behaviour has been analysed obtaining results with over 90% accuracy in frames and up to 100% accuracy using the sound average, identifying doors, telephones and alarm clocks.

Idioma originalInglés
Título de la publicación alojadaChallenges for Assistive Technology. AAATE 07
EditoresGorka Eizmendi, Jose Miguel Azkoitia, Gerald Craddock
Páginas276-280
Número de páginas5
EstadoPublicada - 2007

Serie de la publicación

NombreAssistive Technology Research Series
Volumen20
ISSN (versión impresa)1383-813X
ISSN (versión digital)1879-8071

Huella

Profundice en los temas de investigación de '"Non-speech" sounds classification for people with hearing disabilities'. En conjunto forman una huella única.

Citar esto