Authors

* External authors

Venue

Date

Share

Neural Reward Machines

Elena Umili*

Francesco Argenziano*

Roberto Capobianco

* External authors

ECAI-24

2025

Abstract

Non-markovian Reinforcement Learning (RL) tasks are
very hard to solve, because agents must consider the entire history of
state-action pairs to act rationally in the environment. Most works use
symbolic formalisms (as Linear Temporal Logic or automata) to specify the temporally-extended task. These approaches only work in finite
and discrete state environments or continuous problems for which a
mapping between the raw state and a symbolic interpretation is known
as a symbol grounding (SG) function. Here, we define Neural Reward Machines (NRM), an automata-based neurosymbolic framework
that can be used for both reasoning and learning in non-symbolic
non-markovian RL domains, which is based on the probabilistic relaxation of Moore Machines. We combine RL with semisupervised
symbol grounding (SSSG) and we show that NRMs can exploit highlevel symbolic knowledge in non-symbolic environments without
any knowledge of the SG function, outperforming Deep RL methods
which cannot incorporate prior knowledge. Moreover, we advance the
research in SSSG, proposing an algorithm for analysing the groundability of temporal specifications, which is more efficient than baseline
techniques of a factor 103.

Related Publications

Identifying Candidates for Protein-Protein Interaction: A Focus on NKp46’s Ligands

EXPLIMED, 2025
Alessia Borghini, Federico Di Valerio, Alessio Ragno*, Roberto Capobianco

Recent advances in protein-protein interaction (PPI) research have harnessed the power of artificialintelligence (AI) to enhance our understanding of protein behaviour. These approaches have becomeindispensable tools in the field of biology and medicine, enabling scientists …

Transparent Explainable Logic Layers

ECAI, 2025
Alessio Ragno*, Marc Plantevit, Celine Robardet, Roberto Capobianco

Explainable AI seeks to unveil the intricacies of black box models through post-hoc strategies or self-interpretable models. In this paper, we tackle the problem of building layers that are intrinsically explainable through logical rules. In particular, we address current st…

DeepDFA: Automata Learning through Neural Probabilistic Relaxations

ECAI, 2025
Elena Umili*, Roberto Capobianco

In this work, we introduce DeepDFA, a novel approach to identifying Deterministic Finite Automata (DFAs) from traces, harnessing a differentiable yet discrete model. Inspired by both the probabilistic relaxation of DFAs and Recurrent Neural Networks (RNNs), our model offers …

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.