Processing of reverberant speech for time-delay estimation

TitleProcessing of reverberant speech for time-delay estimation
Publication TypeJournal Articles
Year of Publication2005
AuthorsYegnanarayana B, Prasanna SRM, Duraiswami R, Zotkin DN
JournalIEEE Transactions on Speech and Audio Processing
Volume13
Issue6
Pagination1110 - 1118
Date Published2005/11//
ISBN Number1063-6676
KeywordsAcoustic noise, acoustic signal processing, array signal processing, data mining, Degradation, delay estimation, Feature extraction, Hilbert envelope, localization algorithm, microphone arrays, microphone location, Microphones, Phase estimation, reverberation, short-time spectral information, Signal processing, source features, source information excitation, speech enhancement, Speech processing, speech production mechanism, speech signal, time-delay, time-delay estimation
Abstract

In this paper, we present a method of extracting the time-delay between speech signals collected at two microphone locations. Time-delay estimation from microphone outputs is the first step for many sound localization algorithms, and also for enhancement of speech. For time-delay estimation, speech signals are normally processed using short-time spectral information (either magnitude or phase or both). The spectral features are affected by degradations in speech caused by noise and reverberation. Features corresponding to the excitation source of the speech production mechanism are robust to such degradations. We show that these source features can be extracted reliably from the speech signal. The time-delay estimate can be obtained using the features extracted even from short segments (50-100 ms) of speech from a pair of microphones. The proposed method for time-delay estimation is found to perform better than the generalized cross-correlation (GCC) approach. A method for enhancement of speech is also proposed using the knowledge of the time-delay and the information of the excitation source.

DOI10.1109/TSA.2005.853005