Dmitry Zotkin

Adjunct Associate Professor
4218 Iribe Center
(301) 405-1049
Education: 
Ph.D., University of Maryland (Computer Science)
Biography: 

Dmitry N. Zotkin is an adjunct associate professor in UMIACS and a member of the Perceptual Interfaces and Reality Laboratory, the Center for Automation Research (CfAR), and the Computer Vision Laboratory.

Zotkin is working with audio and acoustic signal processing. His main research interests are spatial audio capture and reproduction.

Zotkin also works in related areas, such as microphone arrays, auditory scene analysis, and fast numerical methods for the acoustic wave equation.

He is an author/co-author for two book chapters, 12 journal papers and more than 40 referred conference publications. Zotkin was the main author of a 2006 paper describing a novel fast personalization/customization method for a personal 3-D audio system. UMD has obtained a patent on the relevant technology and has licensed it to companies aimed at widespread use of personalized spatial audio at the consumer level.

Zotkin is a regular reviewer for several audio-related IEEE Transactions and for the Journal of the Acoustical Society of America. He has served on the program committee or as a reviewer for many of the major conferences in his research area. He is also a member of the Acoustical Society of America.

Zotkin received a doctorate in computer science from the University of Maryland in 2002.

Go here to view Zotkin's academic publications.

Publications

2011


Srinivasan BV, Zotkin DN, Duraiswami R.  2011.  A partial least squares framework for speaker recognition. Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on.
:5276-5279.

Srinivasan BV, Garcia-Romero D, Zotkin DN, Duraiswami R.  2011.  Kernel partial least squares for speaker recognition. Twelfth Annual Conference of the International Speech Communication Association.

2010


Zotkin DN, Duraiswami R.  2010.  Signal Processing for Audio HCI. Handbook of Signal Processing Systems.
:243-265.

O'donovan A, Duraiswami R, Zotkin DN, Gumerov NA.  2010.  Audio visual scene analysis using spherical arrays and cameras.. The Journal of the Acoustical Society of America. 127(3):1979-1979.

Vasan Srinivasan B, Duraiswami R, Zotkin DN.  2010.  Kernelized Rényi distance for speaker recognition. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on.
:4506-4509.

O'Donovan AE, Duraiswami R, Zotkin DN.  2010.  Automatic matched filter recovery via the audio camera. Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on.
:2826-2829.

2009


Zotkin DN, Duraiswami R, Gumerov NA.  2009.  Regularized HRTF fitting using spherical harmonics. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2009. WASPAA '09.
:257-260.

Zotkin DN, Duraiswami R.  2009.  Plane-wave decomposition of a sound scene using a cylindrical microphone array. Acoustics, Speech and Signal Processing, 2009. ICASSP 2009. IEEE International Conference on.
:85-88.

O'donovan A, Duraiswami R, Gumerov NA, Zotkin DN.  2009.  Imaging room acoustics with the audio camera.. The Journal of the Acoustical Society of America. 125(4):2544-2544.

2008


O'Donovan A, Duraiswami R, Zotkin DN.  2008.  Imaging concert hall acoustics using visual and audio cameras. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008.
:5284-5287.

Zotkin DN, Duraiswami R, Gumerov NA.  2008.  Sound field decomposition using spherical microphone arrays. IEEE International Conference on Acoustics, Speech and Signal Processing, 2008. ICASSP 2008.
:277-280.

2007


Zotkin DN, Duraiswami R, Gumerov NA.  2007.  Efficient Conversion of X.Y Surround Sound Content to Binaural Head-Tracked Form for HRTF-Enabled Playback. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. 1:I-21-I-24-I-21-I-24.

Zotkin DN, Raykar VC, Duraiswami R, Davis LS.  2007.  Multimodal Tracking for Smart Videoconferencing and Video Surveillance. Computer Vision and Pattern Recognition, 2007. CVPR '07. IEEE Conference on.
:1-2.

Duraiswami R, Zotkin DN, Gumerov NA.  2007.  Fast Evaluation of the Room Transfer Function Using Multipole Expansion. Audio, Speech, and Language Processing, IEEE Transactions on. 15(2):565-576.

Gumerov NA, Duraiswami R, Zotkin DN.  2007.  Fast Multipole Accelerated Boundary Elements for Numerical Computation of the Head Related Transfer Function. IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. 1:I-165-I-168-I-165-I-168.

2006


Zotkin DN, Duraiswami R, Grassi E, Gumerov NA.  2006.  Fast head-related transfer function measurement via reciprocity. The Journal of the Acoustical Society of America. 120(4):2202-2215.

Duraiswami R, Zotkin DN, O'donovan A.  2006.  Capture and rendering of spatial sound over headphones. The Journal of the Acoustical Society of America. 120(5):3094-3094.

Duraiswami R, Li Z, Zotkin DN, Grassi E.  2006.  Spherical and hemispherical microphone arrays for capture and analysis of sound fields. The Journal of the Acoustical Society of America. 120(5):3225-3225.

Yerukhimovich A, Duraiswami R, Gumerov NA, Zotkin DN.  2006.  Frequency Independent Flexible Spherical Beamforming Via Rbf Fitting. 2006 IEEE International Conference on Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 5:V-V-V-V.

2005


Duraiswami R, Li Z, Zotkin DN, Grassi E, Gumerov NA.  2005.  Plane-wave decomposition analysis for spherical microphone arrays. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2005.
:150-153.

Yegnanarayana B, Prasanna SRM, Duraiswami R, Zotkin DN.  2005.  Processing of reverberant speech for time-delay estimation. IEEE Transactions on Speech and Audio Processing. 13(6):1110-1118.

Zotkin DN, Chi T, Shamma SA, Duraiswami R.  2005.  Neuromimetic sound representation for percept detection and manipulation. EURASIP Journal on Applied Signal Processing. 9:1350-1350.

2004


Duraiswami R, Zotkin DN, Gumerov NA.  2004.  INTERPOLATION AND RANGE EXTRAPOLATION OF HEAD RELATED TRANSFER FUNCTIONS. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING. 4

2003


Zotkin DN, Shamma SA, Ru P, Duraiswami R, Davis LS.  2003.  Pitch and timbre manipulations using cortical representation of sound. Multimedia and Expo, IEEE International Conference on. 3:381-384.

Mohan A, Duraiswami R, Zotkin DN, DeMenthon D, Davis LS.  2003.  Using computer vision to generate customized spatial audio. Multimedia and Expo, IEEE International Conference on. 3:57-60.

Zotkin DN, Hwang J, Duraiswami R, Davis LS.  2003.  HRTF personalization using anthropometric measurements. Applications of Signal Processing to Audio and Acoustics, 2003 IEEE Workshop on..
:157-160.

Zotkin DN, Shamma SA, Ru P, Duraiswami R, Davis LS.  2003.  AUDIO-P2. 1: PITCH AND TIMBRE MANIPULATIONS USING CORTICAL REPRESENTATION OF SOUND. IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING. 5

2002


Zotkin DN, Duraiswami R, Davis LS, Mohan A, Raykar V.  2002.  Virtual audio system customization using visual matching of ear parameters. 16th International Conference on Pattern Recognition, 2002. Proceedings. 3:1003-1006vol.3-1003-1006vol.3.

Zotkin DN, Duraiswami R, Davis LS.  2002.  Creation of virtual auditory spaces. 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). 2

Zotkin DN, Duraiswami R, Davis LS.  2002.  Customizable auditory displays. Proceedings of the International Conference on Auditory Display.
:167-176.

2001


Duraiswami R, Zotkin DN, Davis LS.  2001.  Active speech source localization by a dual coarse-to-fine search. 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2001. Proceedings. (ICASSP '01). 5:3309-3312vol.5-3309-3312vol.5.

Zotkin DN, Duraiswami R, Davis LS.  2001.  Multimodal 3-D tracking and event detection via the particle filter. IEEE Workshop on Detection and Recognition of Events in Video, 2001. Proceedings.
:20-27.

Haritaoglu I, Cozzi A, Koons D, Flickner M, Zotkin DN, Yacoob Y.  2001.  Attentive toys. International Conference on Multimedia and Expo. 22:25-25.

Duraiswami R, Gumerov NA, Zotkin DN, Davis LS.  2001.  Efficient evaluation of reverberant sound fields. Applications of Signal Processing to Audio and Acoustics, 2001 IEEE Workshop on the.
:203-206.

Zotkin DN, Duraiswami R, Nanda H, Davis LS.  2001.  Multimodal tracking for smart videoconferencing. Second International Conference on Multimedia and Expo, Tokyo, Japan.

Ghose K, Zotkin DN, Duraiswami R, Moss CF.  2001.  Multimodal localization of a flying bat. Acoustics, Speech, and Signal Processing, IEEE International Conference on. 5:3057-3060.

2000


Duraiswami R, Zotkin DN, Borovikov EA, Davis LS.  2000.  Active source location and beamforming. The Journal of the Acoustical Society of America. 107:2790-2790.

Zotkin DN, Keleher PJ, Perkovic D.  2000.  Attacking the bottlenecks of backfilling schedulers. Cluster Computing.

Zotkin DN, Duraiswami R, Davis LS, Haritaoglu I.  2000.  An audio-video front-end for multimedia applications. 2000 IEEE International Conference on Systems, Man, and Cybernetics. 2:786-791vol.2-786-791vol.2.

Zotkin DN, Duraiswami R, Philomin V, Davis LS.  2000.  Smart videoconferencing. 2000 IEEE International Conference on Multimedia and Expo, 2000. ICME 2000. 3:1597-1600vol.3-1597-1600vol.3.

1999


Zotkin DN, Duraiswami R, Hariatoglu I, Davis LS, Otsuka T.  1999.  A real-time audio–video front-end for multimedia applications. The Journal of the Acoustical Society of America. 106:2271-2271.

1998


Soffer A, Samet H, Zotkin DN.  1998.  Pictorial query trees for query specification in image databases. Fourteenth International Conference on Pattern Recognition, 1998. Proceedings. 1:919-921vol.1-919-921vol.1.