PVLV
The primary value learned value (PVLV) model is a possible explanation for the reward-predictive firing properties of dopamine (DA) neurons.[1] It simulates behavioral and neural data on Pavlovian conditioning and the midbrain dopaminergic neurons that fire in proportion to unexpected rewards. It is an alternative to the temporal-differences (TD) algorithm.[2]
It is used as part of Leabra.
References
- ↑ http://psych.colorado.edu/~oreilly/pubs-abstr.html#OReillyFrankHazyEtAl07
- ↑ http://grey.colorado.edu/emergent/index.php/Leabra_PVLV
This article is issued from Wikipedia - version of the Wednesday, April 30, 2014. The text is available under the Creative Commons Attribution/Share Alike but additional terms may apply for the media files.