Ila Fiete
Dept. of Physics, Harvard U
Dept. Brain and Cognitive Sciences, MIT

Reinforcement Learning of Birdsong in a Spiking Neural Network Model

The birdsong motor circuit is a hierarchical structure: nucleus HVC projects to premotor nucleus RA, which in turn drives motor neurons. Recent experiments show that RA-projecting HVC neurons have temporally sparse neural sequences that drive activity in RA. In this context, the role of RA appears to be the conversion of abstract neural sequences in HVC into motor activity.

In this talk, I will illustrate how an appropriate map of HVC to motor activity could be learned via plastic connections between HVC and RA. Such learning is commonly thought to be driven by reinforcement (Doya & Sejnowski 1995, Troyer & Doupe 2000), with a reward signal generated by comparing the bird's vocal output with an internally stored copy (template) of its tutor song.

We have constructed a reinforcement model with spiking neurons that learns HVC-to-RA connections in a feedforward network of HVC, RA, and a motor layer. We assume that HVC provides a sparse sequence, and learning is governed by a synaptic plasticity rule that exploits correlations between fluctuations in the motor output due to noisy neural inputs, and a positive scalar global reward that depends on the match between network output and the stored template. We explore motor fluctuations arising from the inherent stochasticity of HVC-to-RA synapses, or from (possibly LMAN-generated) noise injected into RA. The learning rule performs stochastic gradient ascent on the reward, and is robust over a wide range of parameters.

If time permits, I will talk about the possible role of sparse premotor coding in the rapid learning of song.