Eligibility sweeps

This repo contains code for replicating the results in the ICLR 2023 Paper (Tiny Paper tracks): "Theta sequences as eligibility traces: A biological solution to credit assignment"

In this paper I how theta sequences (fast hippocampal play throughs of awake behaviour) enable agents to learn under a regime effectively equivalent to TD($\lambda$) (i.e.. learning with bio-implausibly long memory traces). Here's the abstract:

Abstract

Credit assignment problems, for example policy evaluation in RL, often require bootstrapping prediction errors through preceding states or maintaining temporally extended memory traces; solutions which are unfavourable or implausible for biological networks of neurons. We propose theta sequences - chains of neural activity during theta oscillations in the hippocampus, thought to represent rapid playthroughs of awake behaviour - as a solution. By analysing and simulating a model for theta sequences we show they compress behaviour such that existing but short $\mathsf{O}(10)$ ms neuronal memory traces are effectively extended allowing for bootstrap-free credit assignment without long memory traces, equivalent to the use of eligibility traces in TD($\lambda$).

Main paper figure

To run the code:

The jupyter notebook called EligibilitySequences.ipynb replicates the paper figure. You can run it with Google colab here:

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
images		images
EligibilitySequences.ipynb		EligibilitySequences.ipynb
EligibilitySequences_utils.py		EligibilitySequences_utils.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Eligibility sweeps

Abstract

Main paper figure

To run the code:

About

Releases

Packages

Languages

TomGeorge1234/ThetaSequencesAreEligibilityTraces

Folders and files

Latest commit

History

Repository files navigation

Eligibility sweeps

Abstract

Main paper figure

To run the code:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages