TY - CONF T1 - An audio-video front-end for multimedia applications T2 - 2000 IEEE International Conference on Systems, Man, and Cybernetics Y1 - 2000 A1 - Zotkin,Dmitry N A1 - Duraiswami, Ramani A1 - Davis, Larry S. A1 - Haritaoglu,I. KW - Acoustic noise KW - acoustical source location KW - Application software KW - audio cues KW - audio-video front-end KW - CAMERAS KW - Computer vision KW - Microphones KW - multimedia applications KW - multimedia systems KW - multimodal sensor fusion system KW - multimodal user interfaces KW - Position measurement KW - REAL TIME KW - Real time systems KW - real-time systems KW - sensor fusion KW - sound KW - Speech recognition KW - User interfaces KW - video cameras KW - video gaming KW - video-based person tracking KW - Videoconference KW - videoconferencing KW - Virtual reality KW - visual cues KW - Working environment noise AB - Applications such as video gaming, virtual reality, multimodal user interfaces and videoconferencing, require systems that can locate and track persons in a room through a combination of visual and audio cues, enhance the sound that they produce, and perform identification. We describe the development of a particular multimodal sensor fusion system that is portable, runs in real time and achieves these objectives. The system employs novel algorithms for acoustical source location, video-based person tracking and overall system control, which are also described JA - 2000 IEEE International Conference on Systems, Man, and Cybernetics PB - IEEE VL - 2 SN - 0-7803-6583-6 M3 - 10.1109/ICSMC.2000.885945 ER -