%0 Conference Paper %B Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on %D 2008 %T Image acquisition forensics: Forensic analysis to identify imaging source %A McKay,C. %A Swaminathan,A. %A Gou,Hongmei %A M. Wu %K ACQUISITION %K acquisition;image %K analysis; %K analysis;image %K analysis;interpolation;statistical %K cameras;color %K cell %K coefficients;computer %K colour %K editing %K forensics;image %K graphics;digital %K identification;noise %K images;forensic %K Interpolation %K phone %K processing;data %K softwares;imaging %K source %K statistics;scanners;signal %X With widespread availability of digital images and easy-to-use image editing softwares, the origin and integrity of digital images has become a serious concern. This paper introduces the problem of image acquisition forensics and proposes a fusion of a set of signal processing features to identify the source of digital images. Our results show that the devices' color interpolation coefficients and noise statistics can jointly serve as good forensic features to help accurately trace the origin of the input image to its production process and to differentiate between images produced by cameras, cell phone cameras, scanners, and computer graphics. Further, the proposed features can also be extended to determining the brand and model of the device. Thus, the techniques introduced in this work provide a unified framework for image acquisition forensics. %B Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. IEEE International Conference on %P 1657 - 1660 %8 2008/04/31/4 %G eng %R 10.1109/ICASSP.2008.4517945 %0 Journal Article %J Audio, Speech, and Language Processing, IEEE Transactions on %D 2007 %T Fast Evaluation of the Room Transfer Function Using Multipole Expansion %A Duraiswami, Ramani %A Zotkin,Dmitry N %A Gumerov, Nail A. %K acoustics;computational %K algorithm;frequency-domain %K Alien-Berkley %K analysis;reverberation; %K complexity;frequency-domain %K computations;image %K expansion;receivers;reverberant %K field;architectural %K fields;reverberation;room %K function;acoustic %K method;image %K potential;multipole %K sound %K source %K sources;monopole %K transfer %X Reverberation in rooms is often simulated with the image method due to Allen and Berkley (1979). This method has an asymptotic complexity that is cubic in terms of the simulated reverberation length. When employed in the frequency domain, it is relatively computationally expensive if there are many receivers in the room or if the source or receiver positions are changing with time. The computational complexity of the image method is due to the repeated summation of the fields generated by a large number of image sources. In this paper, a fast method to perform such summations is presented. The method is based on multipole expansion of the monopole source potential. For offline computation of the room transfer function for N image sources and M receiver points, use of the Allen-Berkley algorithm requires O(NM) operations, whereas use of the proposed method requires only O(N+M) operations, resulting in significantly faster computation of reverberant sound fields. The proposed method also has a considerable speed advantage in situations where the room transfer function must be rapidly updated online in response to source/receiver location changes. Simulation results are presented, and algorithm accuracy, speed, and implementation details are discussed. For problems that require frequency-domain computations, the algorithm is found to generate sound fields identical to the ones obtained with the frequency-domain version of the Allen-Berkley algorithm at a fraction of computational cost %B Audio, Speech, and Language Processing, IEEE Transactions on %V 15 %P 565 - 576 %8 2007/02// %@ 1558-7916 %G eng %N 2 %R 10.1109/TASL.2006.876753 %0 Conference Paper %B Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on %D 2005 %T Approximate expressions for the mean and the covariance of the maximum likelihood estimator for acoustic source localization %A Raykar,V.C. %A Duraiswami, Ramani %K (mathematics); %K acoustic %K approximate %K approximation %K array %K array; %K covariance %K estimation; %K expansion; %K expressions; %K function; %K likelihood %K localization; %K matrices; %K matrix; %K maximum %K mean %K microphone %K objective %K processing; %K series %K signal %K source %K Taylor %K theory; %K vector; %K vectors; %X Acoustic source localization using multiple microphones can be formulated as a maximum likelihood estimation problem. The estimator is implicitly defined as the minimum of a certain objective function. As a result, we cannot get explicit expressions for the mean and the covariance of the estimator. We derive approximate expressions for the mean vector and covariance matrix of the estimator using Taylor's series expansion of the implicitly defined estimator. The validity of our expressions is verified by Monte-Carlo simulations. We also study the performance of the estimator for different microphone array configurations. %B Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on %V 3 %P iii/73 - iii/76 Vol. 3 - iii/73 - iii/76 Vol. 3 %8 2005/03// %G eng %R 10.1109/ICASSP.2005.1415649 %0 Conference Paper %B INFOCOM 2005. 24th Annual Joint Conference of the IEEE Computer and Communications Societies. Proceedings IEEE %D 2005 %T Differentiated traffic engineering for QoS provisioning %A Tabatabaee,V. %A Bhattacharjee, Bobby %A La,R.J. %A Shayman,M.A. %K based %K Computer %K differentiated %K DiffServ %K DTE; %K engineering; %K evaluation; %K link; %K links; %K management; %K multipath %K network %K networks; %K nonconvex %K of %K optimisation; %K OPTIMIZATION %K packet %K performance %K problem; %K provisioning; %K QoS %K QUALITY %K routing; %K service; %K simulation-based %K source %K Telecommunication %K traffic %K traffic; %X We introduce a new approach for QoS provisioning in packet networks based on the notion of differentiated traffic engineering (DTE). We consider a single AS network capable of source based multi-path routing. We do not require sophisticated queuing or per-class scheduling at individual routers; instead, if a link is used to forward QoS sensitive packets, we maintain its utilization below a threshold. As a consequence, DTE eliminates the need for per-flow (IntServ) or per-class (DiffServ) packet processing tasks such as traffic classification, queueing, shaping, policing and scheduling in the core and hence poses a lower burden on the network management unit. Conversely, DTE utilizes network bandwidth much more efficiently than simple over-provisioning. In this paper, we propose a complete architecture and an algorithmic structure for DTE. We show that our scheme can be formulated as a non-convex optimization problem, and we present an optimal solution framework based on simulated annealing. We present a simulation-based performance evaluation of DTE, and compare our scheme to existing (gradient projection) methods. %B INFOCOM 2005. 24th Annual Joint Conference of the IEEE Computer and Communications Societies. Proceedings IEEE %V 4 %P 2349 - 2359 vol. 4 - 2349 - 2359 vol. 4 %8 2005/03// %G eng %R 10.1109/INFCOM.2005.1498521 %0 Conference Paper %B Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on %D 2005 %T The manifolds of spatial hearing %A Duraiswami, Ramani %A Raykar,V.C. %K acoustics; %K dimensional %K direction %K distance; %K EMBEDDING %K encoding; %K geodesic %K Head %K hearing %K hearing; %K HRIR %K impulse %K information %K interpolation; %K learned %K linear %K localization; %K locally %K low %K manifold %K manifold; %K manifolds; %K nonlinear %K perceptual %K related %K response; %K responses; %K sound %K source %K spatial %K structure; %K technique; %K transient %X We present exploratory studies on learning the non-linear manifold structure, in head related impulse responses (HRIRs). We use the recently popular locally linear embedding technique. The lower dimensional manifold encodes the perceptual information in the HRIRs, namely the direction of the sound source. Based on this, we propose a new method for HRIR interpolation. We also propose that the distance between two HRIRs of an individual be taken as the geodesic distance on the learned manifold. %B Acoustics, Speech, and Signal Processing, 2005. Proceedings. (ICASSP '05). IEEE International Conference on %V 3 %P iii/285 - iii/288 Vol. 3 - iii/285 - iii/288 Vol. 3 %8 2005/03// %G eng %R 10.1109/ICASSP.2005.1415702 %0 Journal Article %J Speech and Audio Processing, IEEE Transactions on %D 2005 %T Speaker Localization Using Excitation Source Information in Speech %A Raykar,V.C. %A Yegnanarayana,B. %A Prasanna,S. R.M %A Duraiswami, Ramani %K correlation %K correlation; %K cross %K Delay %K error %K error; %K estimation; %K excitation %K generalized %K information; %K localization; %K mean %K methods; %K processing; %K production; %K root %K source %K speaker %K speech %K square %K TIME %X This paper presents the results of simulation and real room studies for localization of a moving speaker using information about the excitation source of speech production. The first step in localization is the estimation of time-delay from speech collected by a pair of microphones. Methods for time-delay estimation generally use spectral features that correspond mostly to the shape of vocal tract during speech production. Spectral features are affected by degradations due to noise and reverberation. This paper proposes a method for localizing a speaker using features that arise from the excitation source during speech production. Experiments were conducted by simulating different noise and reverberation conditions to compare the performance of the time-delay estimation and source localization using the proposed method with the results obtained using the spectrum-based generalized cross correlation (GCC) methods. The results show that the proposed method shows lower number of discrepancies in the estimated time-delays. The bias, variance and the root mean square error (RMSE) of the proposed method is consistently equal or less than the GCC methods. The location of a moving speaker estimated using the time-delays obtained by the proposed method are closer to the actual values, than those obtained by the GCC method. %B Speech and Audio Processing, IEEE Transactions on %V 13 %P 751 - 761 %8 2005/09// %@ 1063-6676 %G eng %N 5 %R 10.1109/TSA.2005.851907 %0 Conference Paper %B Communications, 2004 IEEE International Conference on %D 2004 %T Distortion management of real-time MPEG-4 video over downlink multicode CDMA networks %A Su,Guan-Ming %A Han,Zhu %A Kwasinski,A. %A M. Wu %A Liu,K. J.R %A Farvardin,N. %K access; %K adaptation; %K allocation; %K CDMA %K channel %K code %K coding %K coding; %K combined %K communication; %K compression; %K control; %K data %K distortion %K division %K downlink %K links; %K management; %K MPEG-4 %K multicode %K multiple %K networks; %K power %K radio %K rate %K real-time %K resource %K source %K source-channel %K video %K video; %K visual %X In this paper, a protocol is designed to manage source rate/channel coding rate adaptation, code allocation, and power control to transmit real-time MPEG-4 FGS video over downlink multicode CDMA networks. We develop a fast adaptive scheme of distortion management to reduce the overall distortion received by all users subject, to the limited number of codes and maximal transmitted power. Compared with a modified greedy method in literature, our proposed algorithm can reduce the overall system's distortion by at least 45%. %B Communications, 2004 IEEE International Conference on %V 5 %P 3071 - 3075 Vol.5 - 3071 - 3075 Vol.5 %8 2004/06// %G eng %R 10.1109/ICC.2004.1313096