Authors

* External authors

Venue

Date

Share

ProtoCRL: Prototype-based Network for Continual Reinforcement Learning

Michela Proietti*

Peter R. Wurman

Peter Stone

Roberto Capobianco

* External authors

RLC-25

2025

Abstract

The purpose of continual reinforcement learning is to train an agent on a sequence of tasks such that it learns the ones that appear later in the sequence while retaining the
ability to perform the tasks that appeared earlier. Experience replay is a popular method used to make the agent remember previous tasks, but its effectiveness strongly relies on
the selection of experiences to store. Kompella et al. (2023) proposed organizing the experience replay buffer into partitions, each storing transitions leading to a rare but
crucial event, such that these key experiences get revisited more often during training.
However, the method is sensitive to the manual selection of event states. To address this issue, we introduce ProtoCRL, a prototype-based architecture leveraging a variational
Gaussian mixture model to automatically discover effective event states and build the associated partitions in the experience replay buffer. The proposed approach is tested
on a sequence of MiniGrid environments, demonstrating the agent’s ability to adapt and learn new skills incrementally.

Related Publications

XAI-Guided Continual Learning: Rationale, Methods, and Future Directions

WIREs, 2025
Michela Proietti*, Alessio Ragno*, Roberto Capobianco

Providing neural networks with the ability to learn new tasks sequentially represents one of the main challenges in artificial intelligence. Unlike humans, neural networks are prone to losing previously acquired knowledge upon learning new information, a phenomenon known as …

Interpretable Memory-based Prototypical Pooling

WSDM, 2025
Alessio Ragno*, Roberto Capobianco

Graph Neural Networks (GNNs) have proven their effectiveness in various graph-structured data applications. However, one of the significant challenges in the realm of GNNs is representation learning, a critical concept that bridges graph pooling, aimed at creating compressed…

Intermediate Layers of LLMs Align Best With the Brain by Balancing Short- and Long-Range Information

CCN, 2025
Michela Proietti*, Roberto Capobianco, Mariya Toneva

Contextual integration is fundamental to human language comprehension. Language models are a powerful tool for studying how contextual information influences brain activity. In this work, we analyze the brain alignment of three types of language models, which vary in how the…

  • HOME
  • Publications
  • ProtoCRL: Prototype-based Network for Continual Reinforcement Learning

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.