[HOME] [EDUCATION] [PUBLICATIONS] [RESEARCH] [SOFTWARE] [INTERNSHIPS] [PROJECTS]
TIME DELAY ESTIMATION USING EXCITATION SOURCE INFORMATION IN SPEECH
Speaker Localization using excitation source information in speech Vikas C. Raykar, B.Yegnanarayana, S. R. Mahadeva Prasanna, and Ramani Duraiswami, IEEE Transactions on Speech and Audio Processing, Volume 13, Issue 5, Part 2, pp. 751-761, Sep. 2005.
We propose a novel method to estimate the time-delay between the signals received by a pair of microphones in a noisy reverberant room, using the excitation source information in speech. The time-delay is computed by locating the peak in the cross-correlation of the Hilbert envelope of the Linear Prediction Residuals. Results show that our method gives better performance than the GCC-PHAT, GCC-ML and Brandstein's pitch based methods.
[ Matlab code ] [ Source localization demo ] [ Demo Setup] [Face Detection Demo] [ Poster ]
Tracking a moving speaker using excitation source information Vikas C. Raykar, Ramani Duraiswami, B.Yegnanarayana, and S. R. Mahadeva Prasanna, In Proceedings of the 8th Eur. Conf. Speech Communication Technology (Eurospeech 2003), Geneva, September 2003, pp. 69-72.