Audio Processing and Digital Acoustics

The audio processing group at LCAV performs research and education on various topics related to capturing, processing, coding, and rendering of acoustic signals with special focus on 3D-audio. We try to develop expertise in every aspect of this broad field, going from foundations of signal processing, through the physics of wave phenomena, all the way to the human auditory perception. The research is carried out in cooperation with partners from art, industry, and science.

Over the years, we have worked on a broad range of topics that includes:

  • Directional sound capture and playback (beamforming)

  • Room equalization and acoustic echo control

  • Room acoustics simulation

  • Virtual acoustics/auralization

  • Automatic multichannel format conversion (upmix)

  • Sound perception and spatial hearing

  • Sound field reproduction

  • Spatial audio coding

  • Spatial sampling and coding of sound fields

You can also consult our archives for a for a more detailed description of past projects.

Currently, LCAV focuses on various aspects of location-aware audio signal processing. We crafted this term to succinctly cover both typical and highly atypical problems where the terms sound and localization happen to coexist: from vanilla sound source localization with microphone arrays, through more unconventional simultaneous localization of sound sources and microphones and mapping of a room (the infamous acoustic SLAM), to finally localizing concurrent sound sources using a single, albeit unconventional microphone.

RECENT LCAV PUBLICATIONS IN THIS AREA

M. M. J.-A. Simeoni; S. Kashani; P. Hurley; M. Vetterli : DeepWave: A Recurrent Neural-Network for Real-Time Acoustic Imaging. 2019-05-27. Thirty-third Conference on Neural Information Processing Systems (NeurIPS), Vancouver, December 9-14, 2019.
M. Krekovic; G. Baechler; I. Dokmanic; M. Vetterli : Structure from sound with incomplete data. 2018. 43rd International Conference on Acoustics, Speech and Signal Processing, Calgary, Alberta, Canada, April 15–20, 2018.
E. Bezzam; R. Scheibler; J. Azcarreta; H. Pan; M. Simeoni et al. : Hardware And Software For Reproducible Research In Audio Array Signal Processing. 2017. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LANew Orleans, LA, USA, MAR 05-09, 20175-9 March, 2017. p. 6591-6592. DOI : 10.1109/ICASSP.2017.8005297.
H. Pan; R. Scheibler; E. F. Bezzam; I. Dokmanic; M. Vetterli : FRIDA: FRI-Based DOA Estimation For Arbitrary Array Layouts. 2017. ICASSP 2017, New Orleans, USA, March 5-9, 2017. p. 3186-3190. DOI : 10.1109/ICASSP.2017.7952744.
D. El Badawy; I. Dokmanic; M. Vetterli : Acoustic DoA Estimation by One Unsophisticated Sensor. 2017. 13th International Conference on Latent Variable Analysis and Signal Separation, Grenoble, France, February 21-23, 2017. DOI : 10.1007/978-3-319-53547-0_9.
M. Krekovic; I. Dokmanic; M. Vetterli : Omnidirectional bats, point-to-plane distances, and the price of uniqueness. 2017. 42nd International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, March 5-9, 2017. p. 3261-3265.
M. Krekovic; I. Dokmanic; M. Vetterli : Look, no beacons! Optimal all-in-one EchoSLAM. 2016. 50th Asilomar Conference on Signals, Systems, and Computers, Asilomar, Pacific Grove, CA, November 6-9, 2016.
M. Krekovic; I. Dokmanic; M. Vetterli : EchoSLAM: Simultaneous Localization and Mapping with Acoustic Echoes. 2016. 41st IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), Shanghai, China, 20-25 March 2016. p. 11-15.
I. Dokmanic; L. Daudet; M. Vetterli : From Acoustic Room Reconstruction to SLAM. 2016. 41st International Conference on Acoustics, Speech, and Signal Processing, Shanghai, China, March 20-25, 2016. p. 6345-6349.
R. Scheibler; I. Dokmanic; M. Vetterli : Raking echoes in the time domain. 2015. ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), South Brisbane, Queensland, Australia, 19-24 April 2015. p. 554-558. DOI : 10.1109/ICASSP.2015.7178030.
I. Dokmanic; R. Scheibler; M. Vetterli : Raking the Cocktail Party; IEEE Journal of Selected Topics in Signal Processing. 2015. DOI : 10.1109/JSTSP.2015.2415761.
F. Pinto; M. Kolundzija; M. Vetterli : Digital acoustics: processing wave fields in space and time using DSP tools; APSIPA Transactions on Signal and Information Processing. 2014. DOI : 10.1017/ATSIP.2014.13.
I. Dokmanic; L. Daudet; M. Vetterli : How to Localize Ten Microphones in One Fingersnap. 2014. 22nd European Signal Processing Conference, Lisbon, Portugal, September 1-5, 2014. p. 2275-2279.
I. Dokmanic; R. Parhizkar; A. Walther; Y. M. Lu; M. Vetterli : Acoustic Echoes Reveal Room Shape; Proceedings of the National Academy of Sciences. 2013. DOI : 10.1073/pnas.1221464110.
M. Kolundzija; C. Faller; M. Vetterli : Multi-channel low-frequency room equalization using perceptually motivated constrained optimization. 2012. IEEE International Conference on Acoustics, Speech, and Signal Processing, Kyoto, Japan, March 25-30, 2012. p. 533-536.
M. Kolundzija; C. Faller; M. Vetterli : Reproducing Sound Fields Using MIMO Acoustic Channel Inversion; Journal of the Audio Engineering Society. 2011.
M. Kolundzija; C. Faller; M. Vetterli : Spatiotemporal Gradient Analysis of Differential Microphone Arrays; Journal of the Audio Engineering Society. 2011.
M. Kolundzija; C. Faller; M. Vetterli : Design of a Compact Cylindrical Loudspeaker Array for Spatial Sound Reproduction. 2011. AES 130th Convention, May 13-16, 2011.

LCAV-APDA