Why does dopamine ramp up during approach to predictable rewards?
In this preprint with Luke Priestley, we explore the idea that dopamine ramps occur when reward predictions inferred using a world-model are used to train striatal cached values.
Midbrain dopamine neurons are thought to implement a temporal difference (TD) reward prediction error (RPE) that updates cached values stored in striatum. This has been challenged by evidence that dop...