Investigación

ÁREAS DE CONOCIMIENTO

 

GRUPOS DE INVESTIGACIóN

 

MEMORIAS DE INVESTIGACIóN

 

PERSONAL DE INVESTIGACIóN

 

SEMINARIOS DE INVESTIGACIóN

 

DOCTORES EGRESADOS

 

CONVOCATORIAS

 

Escuela Politécnica Superior

Universidad Autónoma de Madrid


Google Research Award

Google Research Award to Joaquin Gonzalez-Rodriguez from ATVS-UAM.
His project proposal entitled "Exploiting prior knowledge for robust recognition and indexing of audio information sources" has been funded by Google Research with 40.000 US$ for a two year research period starting april 2009.

Title: Exploiting prior knowledge for robust recognition and indexing of audio information sources Research Abstract and Goals

Current audio indexing applications and speech, speaker and language recognition technologies rely mostly on the information available in the input signal. As a result, all those techniques are highly influenced by the unknown conditions of the test recording, as session mismatch (channel, type of speech, emotions ...), noisy and/or reverberant recordings, or data scarcity. However, recent developments on factor analysis techniques allow to introduce very strong priors, estimated from huge amounts of data, drastically reducing the amount of free parameters to be estimated to a very limited subspace. Prior subspaces of both speaker and session variability between different recordings have been successfully exploited in last NIST 2008 Speaker Recognition Evaluation, but other applications fully fitted to the Google environment, as speaker diarisation and audio indexing or adaptation to mismatched conditions can be extraordinarily benefited from different sources of prior knowledge. The goals of the research proposal are to obtain significant improvements relative to present performance introducing different sources of prior knowledge in two sample applications, namely multi-environment speaker diarisation (broadcast programs, conversational telephone speech, meetings transcription ...) and speaker recognition from severely mismatched channels with scarce test data. Extensions to other information sources in audio indexing, and the use of factor analysis in speech recognition will also be addressed.

Versión imprimible Webmaster.eps@uam.es
Personal Página Principal Búsquedas Mapa Web Localización English Version Última actualización: Jueves, 30/Abril/2009