Roberto
Capobianco

Profile

Roberto is a research scientist at Sony AI and external professor at Sapienza University of Rome. Previously, he was a research scientist at Cogitai and an assistant professor at Sapienza University of Rome, where he also received his Ph.D., under the supervision of Prof. Daniele Nardi. Roberto was a visiting Research Scholar at the Robotics Institute - Carnegie Mellon University (Pittsburgh, USA), under the supervision of Prof. J. Andrew Bagnell. He has been awarded a 3-years fellowship, a Research Starting Grant by Sapienza University of Rome, and a Robotics Fellowship in 2016 by AAAI. Roberto completed his MSc in AI and robotics in 2013, at Sapienza University of Rome, and his BSc in computer engineering at University of Pisa in 2011.

Message

“At Sony AI, I currently focus my research on reinforcement learning, including model-based approaches, and robot learning. I am also very interested in Explainable AI and Ethics. My long-term goal at Sony is to create ethical learning agents that can interact and communicate with humans in dynamic environments/domains, to empower their creativity and self-realization capabilities. At the same time, I really hope that through such empowered creativity we can better understand what intelligence really is.”

Publications

Real-time Trajectory Generation via Dynamic Movement Primitives for Autonomous Racing

ACC, 2024
Catherine Weaver*, Roberto Capobianco, Peter R. Wurman, Peter Stone, Masayoshi Tomizuka*

We employ sequences of high-order motion primitives for efficient online trajectory planning, enabling competitive racecar control even when the car deviates from an offline demonstration. Dynamic Movement Primitives (DMPs) utilize a target-driven non-linear differential equ…

Towards a fuller understanding of neurons with Clustered Compositional Explanations

NeurIPS, 2023
Biagio La Rosa*, Leilani H. Gilpin*, Roberto Capobianco

Compositional Explanations is a method for identifying logical formulas of concepts that approximate the neurons' behavior. However, these explanations are linked to the small spectrum of neuron activations used to check the alignment (i.e., the highest ones), thus lacking c…

Memory Replay For Continual Learning With Spiking Neural Networks

IEEE MSLP, 2023
Michela Proietti*, Alessio Ragno*, Roberto Capobianco

Two of the most impressive features of biological neural networks are their high energy efficiency and their ability to continuously adapt to varying inputs. On the contrary, the amount of power required to train top-performing deep learning models rises as they become more …

Explainable AI in drug discovery: self-interpretable graph neural network for molecular property prediction using concept whi…

Machine Learning, 2023
Michela Proietti*, Alessio Ragno*, Biagio La Rosa*, Rino Ragno*, Roberto Capobianco

Molecular property prediction is a fundamental task in the field of drug discovery. Several works use graph neural networks to leverage molecular graph representations. Although they have been successfully applied in a variety of applications, their decision process is not t…

Understanding Deep RL agent decisions: a novel interpretable approach with trainable prototypes

AIxIA, 2023
Caterina Borzillo*, Alessio Ragno*, Roberto Capobianco

Deep reinforcement learning (DRL) models have shown great promise in various applications, but their practical adoption in critical domains is limited due to their opaque decision-making processes. To address this challenge, explainable AI (XAI) techniques aim to enhance tra…

Grounding LTLf Specifcations in Image Sequences

KR, 2023
Elena Umili*, Roberto Capobianco, Giuseppe De Giacomo*

A critical challenge in neuro-symbolic (NeSy) approaches is to handle the symbol grounding problem without direct supervision. That is mapping high-dimensional raw data into an interpretation over a finite set of abstract concepts with a known meaning, without using labels. …

Visual Reward Machines

NeSy, 2023
Elena Umili*, Francesco Argenziano*, Aymeric Barbin*, Roberto Capobianco

Non-markovian Reinforcement Learning (RL) tasks are extremely hard to solve, because intelligent agents must consider the entire history of state-action pairs to act rationally in the environment. Most works use Linear Temporal Logic (LTL) to specify temporally-extended task…

An Overview of Environmental Features that Impact Deep Reinforcement Learning in Sparse-Reward Domains

JAIR, 2023
Jim Martin Catacora Ocaña*, Roberto Capobianco, Daniele Nardi*

Deep reinforcement learning has achieved impressive results in recent years; yet, it is still severely troubled by environments showcasing sparse rewards. On top of that, not all sparse-reward environments are created equal, ie, they can differ in the presence or absence of …

A self-interpretable module for deep image classification on small data

APIN, 2023
Jim Martin Catacora Ocaña*, Roberto Capobianco, Daniele Nardi*

Deep neural networks are the driving force of the recent explosion of machine learning applications in everyday life. However, they usually require a lot of training data to work well, and they act as black-boxes, making predictions without any explanation about them. This p…

State of the art of visual analytics for explainable deep learning

CGF, 2023
Biagio La Rosa*, Graziano Blasilli*, R Bourqui*, D Auber*, Giuseppe Santucci*, Roberto Capobianco, Enrico Bertini*, Romain Giot*, Marco Angelini*

The use and creation of machine-learning-based solutions to solve problems or reduce their computational costs are becoming increasingly widespread in many domains. Deep Learning plays a large part in this growth. However, it has drawbacks such as a lack of explainability an…

Prototype-based Interpretable Graph Neural Networks

IEEE TAI, 2022
Alessio Ragno*, Biagio La Rosa*, Roberto Capobianco

Graph neural networks have proved to be a key tool for dealing with many problems and domains such as chemistry, natural language processing and social networks. While the structure of the layers is simple, it is difficult to identify the patterns learned by the graph neural…

Grounding LTLf specifications in images

NeSy, 2022
Elena Umili*, Roberto Capobianco, Giuseppe De Giacomo*

A critical challenge in neurosymbolic approaches is to handle the symbol grounding problem without direct supervision. That is mapping high-dimensional raw data into an interpretation over a finite set of abstract concepts with a known meaning, without using labels. In this …

Ligand-based and structure-based studies to develop predictive models for SARS-CoV-2 main protease inhibitors through the 3d-…

Journal of Computer-Aided Molecular Design, 2022
Eleonora Proia*, Alessio Ragno*, Lorenzo Antonini*, Manuela Sabatino*, Milan Mladenovič*, Roberto Capobianco, Rino Ragno*

The main protease (Mpro) of SARS-Cov-2 is the essential enzyme for maturation of functional proteins implicated in viral replication and transcription. The peculiarity of its specific cleavage site joint with its high degree of conservation among all coronaviruses promote it…

Deep Reinforcement Learning for Pin-Point Autonomous Lunar Landing: Trajectory Recalculation for Obstacle Avoidance

AII, 2022
Giulia Ciabatti*, Dario Spiller, Shreyansh Daftry*, Roberto Capobianco, Fabio Curti*

This work aims to present a method to perform autonomous precision landing—pin-point landing—on a planetary environment and perform trajectory recalculation for fault recovery where necessary. In order to achieve this, we choose to implement a Deep Reinforcement Learning—DRL…

Outracing Champion Gran Turismo Drivers with Deep Reinforcement Learning

Nature, 2022
Pete Wurman, Samuel Barrett, Kenta Kawamoto, James MacGlashan, Kaushik Subramanian, Thomas Walsh, Roberto Capobianco, Alisa Devlic, Franziska Eckert, Florian Fuchs, Leilani Gilpin, Piyush Khandelwal, Varun Kompella, Hao Chih Lin, Patrick MacAlpine, Declan Oller, Takuma Seno, Craig Sherstan, Michael D. Thomure, Houmehr Aghabozorgi, Leon Barrett, Rory Douglas, Dion Whitehead Amago, Peter Dürr, Peter Stone, Michael Spranger, Hiroaki Kitano

Many potential applications of artificial intelligence involve making real-time decisions in physical systems while interacting with humans. Automobile racing represents an extreme example of these conditions; drivers must execute complex tactical manoeuvres to pass or block…

Planetary Environment Prediction Using Generative Modeling

AIAA SciTech Forum, 2022
Shrijit Singh*, Shreyansh Daftry*, Roberto Capobianco

Planetary rovers have a limited sensory horizon and operate in environments where limited information about the surrounding terrain is available. The rough and unknown nature of the terrain in planetary environments potentially leads to scenarios where the rover gets stuckan…

Tafl-ES: Exploring Evolution Strategies for Asymmetrical Board Games

AIxIA, 2021
Roberto Gallotta*, Roberto Capobianco

NeuroEvolution Strategies (NES) are a subclass of Evolution Strategies (ES). While their application to games and board games have been studied in the past [11], current state of the art in most of the games is still held by classic RL models, such as AlphaGo Zero [16]. This…

Exploration-Intensive Distractors: Two Environment Proposals and a Benchmarking

AIxIA, 2021
Jim Martin Catacora Ocaña*, Roberto Capobianco, Daniele Nardi*

Sparse-reward environments are famously challenging for deep reinforcement learning (DRL) algorithms. Yet, the prospect of solving intrinsically sparse tasks in an end-to-end fashion without any extra reward engineering is highly appealing. Such aspiration has recently led t…

Detection Accuracy for Evaluating Compositional Explanations of Units

AIxIA, 2021
Sayo M. Makinwa*, Biagio La Rosa*, Roberto Capobianco

The recent success of deep learning models in solving complex problems and in different domains has increased interest in understanding what they learn. Therefore, different approaches have been employed to explain these models, one of which uses human-understandable concept…

A Discussion about Explainable Inference on Sequential Data via Memory-Tracking

AIxIA, 2021
Biagio La Rosa*, Roberto Capobianco, Daniele Nardi*

The recent explosion of deep learning techniques boosted the application of Artificial Intelligence in a variety of domains, thanks to their high performance. However, performance comes at the cost of interpretability: deep models contain hundred of nested non-linear operati…

Learning Transferable Policies for Autonomous Planetary Landing via Deep Reinforcement Learning

ASCEND, 2021
Giulia Ciabatti*, Shreyansh Daftry*, Roberto Capobianco

In this work, we develop an application for autonomous landing, exploiting the properties of Deep Reinforcement Learning and Transfer Learning in order to tackle the problem of planetary landing on unknown or barely-known extra-terrestrial environments by learning good-perfo…

Agent-Based Markov Modeling for Improved COVID-19 Mitigation Policies

JAIR, 2021
Roberto Capobianco, Varun Kompella, James Ault*, Guni Sharon*, Stacy Jong*, Spencer Fox*, Lauren Meyers*, Pete Wurman, Peter Stone

The year 2020 saw the covid-19 virus lead to one of the worst global pandemics in history. As a result, governments around the world have been faced with the challenge of protecting public health while keeping the economy running to the greatest extent possible. Epidemiologi…

Autonomous Planetary Landing via Deep Reinforcement Learning and Transfer Learning

CVPR, 2021
Giulia Ciabatti*, Shreyansh Daftry*, Roberto Capobianco

The aim of this work is to develop an application for autonomous landing. We exploit the properties of Deep Reinforcement Learning and Transfer Learning, in order to tackle the problem of planetary landing on unknown or barely-known extra-terrestrial environments by learning…

Multiagent Epidemiologic Inference through Realtime Contact Tracing

AAMAS, 2021
Guni Sharon*, James Ault*, Peter Stone, Varun Kompella, Roberto Capobianco

This paper addresses an epidemiologic inference problem where, given realtime observation of test results, presence of symptoms,and physical contacts, the most likely infected individuals need to be inferred. The inference problem is modeled as a hidden Markovmodel where inf…

Explainable Inference on Sequential Data via Memory-Tracking

IJCAI, 2021
Biagio La Rosa*, Roberto Capobianco, Daniele Nardi*

In this paper we present a novel mechanism to get explanations that allow to better understand network predictions when dealing with sequential data. Specifically, we adopt memory-based networks — Differential Neural Computers — to exploit their capability of storing data in…

Reinforcement Learning for Optimization of COVID-19 Mitigation Policies

AAAI AI for Social Good, 2020
Varun Kompella, Roberto Capobianco, Stacy Jong*, Jonathan Browne*, Spencer Fox*, Lauren Meyers*, Pete Wurman, Peter Stone

The year 2020 has seen the COVID-19 virus lead to one of the worst global pandemics in history. As a result, governments around the world are faced with the challenge of protecting public health, while keeping the economy running to the greatest extent possible. Epidemiologi…

Blog

November 29, 2021 | Life at Sony AI

Meet the Team #2: Lingjuan, Jerone and Roberto

What do privacy, pattern recognition, and percussion all have in common? They are concepts and creative endeavors that have inspired Sony AI team members Lingjuan, Jerone and Roberto. Read on to learn more about these three Sony…

What do privacy, pattern recognition, and percussion all have in common? They are concepts and creative endeavors that have insp…

March 3, 2021 | Life at Sony AI

The Challenge to Create a Pandemic Simulator

The thing I like most about working at Sony AI is the quality of the projects we're working on, both for their scientific challenges and for their potential for improving the world. What could be more exciting than magnifying hu…

The thing I like most about working at Sony AI is the quality of the projects we're working on, both for their scientific challen…

JOIN US

Shape the Future of AI with Sony AI

We want to hear from those of you who have a strong desire
to shape the future of AI.