Last update: 10/22/2007

Sparse Respresentations for the Cocktail Party Problem

Asari,H., Pearlmutter, B.A., & Zador, A.M.


A striking feature of many sensory processing problems is that there appear to be many more neurons engaged in the internal representations of the signal than in its transduction. For example, humans have about 30,000 cochlear neurons, but at least a thousand times as many neurons in the auditory cortex. Such apparently redundant internal representations have sometimes been proposed as necessary to overcome neuronal noise. We instead posit that they directly subserve computations of interest. Here we provide an example of how sparse overcomplete linear representations can directly solve difficult acoustic signal processing problems, using as an example monaural source separation using solely the cues provided by the differential filtering imposed on a source by its path from its origin to the cochlea (the head-related transfer function, or HRTF). In contrast to much previous work, the HRTF is used here to separate auditory streams rather than to localize them in space. The experimentally testable predictions that arise from this model---including a novel method for estimating a neuron's optimal stimulus using data from a multi-neuron recording experiment---are generic, and apply to a wide range of sensory computations.

Poster (150KB, PDF): Gordon Research Conference (Sensory coding and the natural environment), Oxford, 2004.
Proceeding (287 KB, PDF): Forum Acusticum 2005, Budapest, Hungary, Distributed on CD-ROM at the conference.
Reprint (644KB, PDF): J. Neurosci. 2006; 26(28): 7477-7490.
See also Asari et al., (2007) and my dissertation (Chapter 2).