A Gestalt inference model for auditory scene segregation | Interagency Modeling and Analysis Group

What is being modeled?

Segregation of sounds in the auditory system

Description & purpose of resource

The auditory stream segregation model leverages the multiplexed and non-linear representation of sounds along an auditory hierarchy and learns local and global statistical structure naturally emergent in natural and complex sounds. The three key components of the architecture are : (1) A stochastic network RBM layer that encodes two-dimensional input spectrogram into localized specto-temporal bases based on short term feature analysis; (2) A dynamic aRBM that captures the long-term temporal dependencies across spectro-temporal bases characterizing the transformation of sound from fast changing details to slower dynamics. (3) A temporal coherence layer that mimics the Hebbian process of binding local and global details together to mediate the mapping from feature space to formation of auditory objects.

Has this resource been validated?

DOI link to publication describing this resource

http://doi.org/10.1371/journal.pcbi.1006711

Table sorting checkbox

Off