Jaldert Rombouts - Neurally plausible reinforcement learning of memory representations in delayed-response tasks
A key function of brains is undoubtedly the abstraction and maintenance of information from the environment for later use. Neurons in association cortex play an important role in this process: during learning these neurons become tuned to relevant features and represent the information that is required later as a persistent elevation of their activity. It is however not well known how these neurons acquire their task-relevant tuning.
Here we present a biologically plausible neural network model based on reinforcement learning that explains how neurons learn to represent task-relevant information in delayed response tasks. This model generalizes the Attention-Gated Reinforcement Learning (AGREL) model by Roelfsema and van Ooyen (2005) to the temporal domain. An attention-based feedback signal from the motor layer to earlier processing layers is combined with a novel memory mechanism to solve the structural and temporal credit-assignment problems. We can show that on average the updates are equal to a variant of the Error-Backpropagation algorithm.
The model can explain how neurons in lateral intraparietal cortex (LIP) learn to represent task-relevant information in 1) a memory (anti)saccade task, 2) an orientation discrimination task and 3) a probabilistic classification task. Comparisons with experimental results from animals trained on these same tasks show that the model neurons learn representations that are similar to those observed in biological neurons.
This is joint work with Pieter Roelfsema and Sander Bohte
Laatst gewijzigd: | 10 februari 2021 14:56 |
Meer nieuws
-
10 september 2025
Financiering voor Feringa en Minnaard vanuit Nationaal Groeifondsproject Big Chemistry
Twee RUG onderzoeken hebben via NWO financiering ontvangen vanuit het Nationaal Groeifondsproject Big Chemistry.
-
09 september 2025
De vingerafdruk van koolstofdioxide
In het jaar 2000 richtte Harro Meijer, hoogleraar Isotopenfysica aan de Rijksuniversiteit Groningen, het meetstation Lutjewad nabij Hornhuizen op. Daar brengen Groningse onderzoekers onder andere in kaart wat de herkomst van CO2 in de atmosfeer is,...
-
09 september 2025
De koolstofcyclus als thermostaat van de aarde
De natuurlijke koolstofkringloop van de aarde raakt uit balans als wij mensen extra koolstofdioxide (CO2) in de lucht blijven brengen. In dit overzichtsartikel over de koolstofcyclus lees je hoe de aarde zichzelf doorgaans in balans houdt, en hoe wij...