Sound texture perception via synthesisJ H McDermott, E P Simoncelli and A J Oxenham.Published in Annual Meeting, ARO, Feb 2008.This paper has been superseded by:
|
We tested whether rudimentary statistics computed from the responses of a bank of bandpass filters could produce compelling synthetic textures. Simply matching the marginal statistics (variance, kurtosis) of individual filter responses was generally insufficient to yield good results, but imposing various joint envelope statistics (cross-band correlations, autocorrelations within each band, and cross-band correlations across time) greatly improved the results, frequently producing synthetic textures that sounded natural and recognizable. Synthesizing some classes of textures may necessitate complex "feature detectors", but in many cases, textures with audible features (raindrops, crackles, insect/bird calls) emerge from the imposition of much simpler statistical constraints. The results suggest that the auditory system may rely on surprisingly simple statistics to recognize real-world sound textures.
[Supported by NIH grant R01DC07657 and the Howard Hughes Medical Institute].