|The striatum has long been implicated in reinforcement learning and has been suggested by several neurophysiological studies as the substrate for encoding the reward value of stimuli. Reward prediction error (RPE) has been used in several basal ganglia models as the underlying learning signal, which leads to Pavlovian conditioning abilities that can be simulated by the Rescorla-Wagner model.
Lateral inhibition between striatal projection neurons was once thought to have a winner-take-all function, useful in selecting between possible actions. However, it has been noted that the necessary reciprocal connections for this interpretation are too few, and the relative strength of these synaptic connections is weak. Still,...|