Subunit models economically explain neuronal responses in macaque V1

B Vintch, A D Zaharia, J A Movshon and E P Simoncelli

Published in Annual Meeting, Neuroscience, Oct 2012.

This paper has been superseded by:
A convolutional subunit model for neuronal responses in macaque V1
B Vintch, J A Movshon and E P Simoncelli.
J Neurosci, vol.35 pp. 14829--14841, Nov 2015.


Many have sought a characterization of the receptive fields of neurons in primary visual cortex (V1) that is both efficient and biologically interpretable. Spike-triggered averaging (STA) recovers a single filter which describes the response of a V1 neuron to pseudorandom stimuli such as white noise. STA can successfully recover the filter for a simple cell, but the STA of a complex cell is typically unstructured and uninformative. Spike-triggered covariance (STC) extends the analysis to recover a set of filters which better describe complex cell responses; these filters, however, are not guaranteed to relate to underlying biological structures. It is thought that simple cells receive inputs predominantly from LGN, while complex cells receive inputs from simple cells in V1. A different and promising approach describes the receptive field as a combination of "subunits", each corresponding to an afferent. We have developed a method that explicitly models this pooling process and is able to capture the structure of the receptive fields of both simple and complex cells.

Our model consists of simple-like linear-nonlinear (LN) subunits, linearly pooled to generate neural responses. A complex cell would have many subunits distributed over space to yield phase invariance, while a simple cell would have only one. When fitting the three stages of this LN-L model (filter, output nonlinearity, spatial pooling), we assume that subunits differ from one another only in location. This constraint reduces the problem of model estimation to a simple well-behaved coordinate descent procedure. This basic method can be extended to create multiple subunit "channels" that add at the final linear stage, for example one excitatory and one suppressive.

We recorded from 38 cells in anesthetized macaque V1 and characterize their receptive fields with dense ternary white pixel noise. The model performs well: across V1 neurons, we measured an average cross-validated correlation coefficient between the actual and predicted firing rate of 0.52 +/- 0.04 (SEM). The model performed equally well for cells distributed along the simple-complex spectrum. Moreover, the subunit model provides a concise and interpretable depiction of the receptive field; the LN subunits describe the tuning parameters and selectivity, while the final linear pooling describes the spatial extent and phase invariance of the cell. The simplicity of the model also allows it to avoid overfitting, outperforming STC for an equivalent amount of data.


  • Listing of all publications