Jaldert Rombouts - Neurally plausible reinforcement learning of memory representations in delayed-response tasks
A key function of brains is undoubtedly the abstraction and maintenance of information from the environment for later use. Neurons in association cortex play an important role in this process: during learning these neurons become tuned to relevant features and represent the information that is required later as a persistent elevation of their activity. It is however not well known how these neurons acquire their task-relevant tuning.
Here we present a biologically plausible neural network model based on reinforcement learning that explains how neurons learn to represent task-relevant information in delayed response tasks. This model generalizes the Attention-Gated Reinforcement Learning (AGREL) model by Roelfsema and van Ooyen (2005) to the temporal domain. An attention-based feedback signal from the motor layer to earlier processing layers is combined with a novel memory mechanism to solve the structural and temporal credit-assignment problems. We can show that on average the updates are equal to a variant of the Error-Backpropagation algorithm.
The model can explain how neurons in lateral intraparietal cortex (LIP) learn to represent task-relevant information in 1) a memory (anti)saccade task, 2) an orientation discrimination task and 3) a probabilistic classification task. Comparisons with experimental results from animals trained on these same tasks show that the model neurons learn representations that are similar to those observed in biological neurons.
This is joint work with Pieter Roelfsema and Sander Bohte
Last modified: | 10 February 2021 2.56 p.m. |
More news
-
10 September 2025
Funding for Feringa and Minnaard from National Growth Fund project Big Chemistry
Two UG research projects have received funding from the National Growth Fund project Big Chemistry via NWO.
-
09 September 2025
The carbon cycle as Earth’s thermostat
Earth's natural carbon cycle becomes unbalanced if we, humans, continue to release extra carbon dioxide (CO2) into the atmosphere. In this overview article about the carbon cycle, you can find out how Earth generally keeps itself in balance and how...
-
09 September 2025
Carbon dioxide’s fingerprint
In the year 2000, Harro Meijer, Professor of Isotope Physics at the University of Groningen, set up the Lutjewad Measurement Station near Hornhuizen. There, researchers from Groningen are mapping where CO2 in the atmosphere originates and where it...