From sound waves to locations : computational models for sound source localization in the early auditory pathway
Erstveröffentlichung
2021-11-03Authors
Oess, Timo
Referee
Ernst, MarcNeumann, Heiko
Grothe, Benedikt
Dissertation
Faculties
Fakultät für Ingenieurwissenschaften, Informatik und PsychologieInstitutions
Institut für Psychologie und PädagogikInstitut für Neuroinformatik
Abstract
Audition is our only far-reach omni-directional sense and it plays an important role for survival. Detecting a prey hidden in bushes or a predator sneaking up behind us can be crucial. Our auditory system facilitates us to not just detect such sound sources but also precisely localize them.
This localization is not trivial, but requires intensive processing steps along the auditory pathway. These steps include the binaural integration of signals from the left and right ear and the identification of systematic modifications in the perceived spectrum of a sound. In particular, to localize sound sources in the horizontal plane the interaural time (ITD) and level difference (ILD) are computed.
However, these cues create ambiguous information in the vertical plane. Thus, elevation-dependent spectral cues created by the shape of the ear are identified to localize sounds in the vertical plane. Together with the ITD and ILD, these cues provide a 2D representation of auditory space.
This space is comprised of internally computed auditory cues and therefore needs to be calibrated to create an association with the external world. Such a calibration is realized with a visual guidance signal to further create a common frame of reference and a means for multisensory integration. Computational models that describe all these processing steps are currently either missing or merely specialized on a single stage of the processing pathway.
In this thesis, the auditory pathway for localizing sound \\ sources is investigated by developing canonical computational neural network models of consecutive processing stages. In particular, a model for ILD computation in the lateral superior olive is presented which incorporates a synaptic adaptation mechanism that leads to improved spatial localization of sounds in the horizontal plane. Furthermore, a novel model of spectral cue processing for vertical sound source localization is introduced. This model suggests that localization in the vertical plane is fundamentally binaural but allows for monaural sound source localization. A model of the inferior colliculus describes how cues for horizontal and vertical localization are aligned with a visual teacher signal to provide a common frame of reference. Finally, a multisensory integration model of audio-visual inputs of the superior colliculus demonstrates the importance of cortical feedback for top-down modulation of sensory input. The analysis provides insights on its effect on the Bayesian optimal fusion of multisensory input streams.
Model conception and simulation experiments in this thesis investigate underlying neural principles and provide new hypotheses for auditory sound source localization.
Date created
2020
Cumulative dissertation containing articles
• Timo Oess, Marc O. Ernst, and Heiko Neumann. “Computational principles of neural adaptation for binaural signal integration.” In: PLOS Computational Biology 16.7 (July 2020), pp. 1–23. https://doi.org/10.1371/journal.pcbi.1008020
• Timo Oess, Heiko Neumann, and Marc O. Ernst. "Two Are Better Than One: Solving the Problem of Vertical Sound Source Localization via Binaural Integration of HRTFs" (früherer Titel: “Binaural Signal Integration Solves the Problem of Vertical Sound Source Localization.”) https://doi.org/10.1101/2020.09.10.291468
• Timo Oess, Marc Ernst, and Heiko Neumann. “Computational investigation of visually guided learning of spatially aligned auditory maps in the colliculus.” In: Proceedings of the International Symposium on Auditory and Audiological Research 7 (2020), pp. 149–156. https://proceedings.isaar.eu/index.php/isaarproc/article/view/2019-18
• Timo Oess, Maximilian P. R. Löhr, Daniel Schmid, Marc O. Ernst, and Heiko Neumann. “From Near-Optimal Bayesian Integration to Neuromorphic Hardware: A Neural Network Model of Multisensory Integration.” In: Frontiers in Neurorobotics 14 (2020), p. 29. issn: 1662-5218. https://doi.org/10.3389/fnbot.2020.00029
• Timo Oess, Heiko Neumann, and Marc O. Ernst. "Two Are Better Than One: Solving the Problem of Vertical Sound Source Localization via Binaural Integration of HRTFs" (früherer Titel: “Binaural Signal Integration Solves the Problem of Vertical Sound Source Localization.”) https://doi.org/10.1101/2020.09.10.291468
• Timo Oess, Marc Ernst, and Heiko Neumann. “Computational investigation of visually guided learning of spatially aligned auditory maps in the colliculus.” In: Proceedings of the International Symposium on Auditory and Audiological Research 7 (2020), pp. 149–156. https://proceedings.isaar.eu/index.php/isaarproc/article/view/2019-18
• Timo Oess, Maximilian P. R. Löhr, Daniel Schmid, Marc O. Ernst, and Heiko Neumann. “From Near-Optimal Bayesian Integration to Neuromorphic Hardware: A Neural Network Model of Multisensory Integration.” In: Frontiers in Neurorobotics 14 (2020), p. 29. issn: 1662-5218. https://doi.org/10.3389/fnbot.2020.00029
Subject headings
[GND]: Neuronales Netz | Lokalisation | Hören[LCSH]: Neurosciences | Modeling | Sounds
[Free subject headings]: Hearing , Audition | Neural Network | Localization
[DDC subject group]: DDC 570 / Life sciences
Metadata
Show full item recordDOI & citation
Please use this identifier to cite or link to this item: http://dx.doi.org/10.18725/OPARU-39524
Oess, Timo (2021): From sound waves to locations : computational models for sound source localization in the early auditory pathway. Open Access Repositorium der Universität Ulm und Technischen Hochschule Ulm. Dissertation. http://dx.doi.org/10.18725/OPARU-39524
Citation formatter >