Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time

Cone, Ian; Clopath, Claudia; Shouval, Harel Z.

Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time

dc.citation.articleNumber	5856	en_US
dc.citation.journalTitle	Nature Communications	en_US
dc.citation.volumeNumber	15	en_US
dc.contributor.author	Cone, Ian	en_US
dc.contributor.author	Clopath, Claudia	en_US
dc.contributor.author	Shouval, Harel Z.	en_US
dc.date.accessioned	2024-08-09T16:25:26Z	en_US
dc.date.available	2024-08-09T16:25:26Z	en_US
dc.date.issued	2024	en_US
dc.description.abstract	The dominant theoretical framework to account for reinforcement learning in the brain is temporal difference learning (TD) learning, whereby certain units signal reward prediction errors (RPE). The TD algorithm has been traditionally mapped onto the dopaminergic system, as firing properties of dopamine neurons can resemble RPEs. However, certain predictions of TD learning are inconsistent with experimental results, and previous implementations of the algorithm have made unscalable assumptions regarding stimulus-specific fixed temporal bases. We propose an alternate framework to describe dopamine signaling in the brain, FLEX (Flexibly Learned Errors in Expected Reward). In FLEX, dopamine release is similar, but not identical to RPE, leading to predictions that contrast to those of TD. While FLEX itself is a general theoretical framework, we describe a specific, biophysically plausible implementation, the results of which are consistent with a preponderance of both existing and reanalyzed experimental data.	en_US
dc.identifier.citation	Cone, I., Clopath, C., & Shouval, H. Z. (2024). Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time. Nature Communications, 15(1), 5856. https://doi.org/10.1038/s41467-024-50205-3	en_US
dc.identifier.digital	s41467-024-50205-3	en_US
dc.identifier.doi	https://doi.org/10.1038/s41467-024-50205-3	en_US
dc.identifier.uri	https://hdl.handle.net/1911/117642	en_US
dc.language.iso	eng	en_US
dc.publisher	Springer Nature	en_US
dc.rights	Except where otherwise noted, this work is licensed under a Creative Commons Attribution (CC BY) license. Permission to reuse, publish, or reproduce the work beyond the terms of the license or beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.	en_US
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/	en_US
dc.title	Learning to express reward prediction error-like dopaminergic activity requires plastic representations of time	en_US
dc.type	Journal article	en_US
dc.type.dcmi	Text	en_US
dc.type.publication	publisher version	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: s41467-024-50205-3.pdf
Size:: 1.92 MB
Format:: Adobe Portable Document Format

Download

Collections

Faculty Publications
ECE Publications