Science Cast

Mesolimbic dopamine encodes reward prediction errors independent of learning rates

librarianApril 19, 2024 7:26am

Views (5)
Comments (0)

Export Citation

Voices Powered by

Connected to paperThis paper is a preprint and has not been certified by peer review

Mesolimbic dopamine encodes reward prediction errors independent of learning rates

bioRxivPDFApril 18, 2024 12:00am

Authors

Mah, A.; Golden, C. E. M.; Constantinople, C. M.

Abstract

Biological accounts of reinforcement learning posit that dopamine encodes reward prediction errors (RPEs), which are multiplied by a learning rate to update state or action values. These values are thought to be represented in synaptic weights in the striatum, and updated by dopamine-dependent plasticity, suggesting that dopamine release might reflect the product of the learning rate and RPE. Here, we leveraged the fact that animals learn faster in volatile environments to characterize dopamine encoding of learning rates. We trained rats on a task with semi-observable states offering different rewards, and rats adjusted how quickly they initiated trials across states using RPEs. Computational modeling and behavioral analyses showed that learning rates were higher following state transitions, and scaled with trial-by-trial changes in beliefs about hidden states, approximating normative Bayesian strategies. Notably, dopamine release in the nucleus accumbens encoded RPEs independent of learning rates, suggesting that dopamine-independent mechanisms instantiate dynamic learning rates.

TwitterandLinkedIn

0 comments

Add comment

Mesolimbic dopamine encodes reward prediction errors independent of learning rates

Mesolimbic dopamine encodes reward prediction errors independent of learning rates

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments