Learning to solve arithmetic problems with a virtual abacus

01/17/2023
by   Flavio Petruzzellis, et al.
0

Acquiring mathematical skills is considered a key challenge for modern Artificial Intelligence systems. Inspired by the way humans discover numerical knowledge, here we introduce a deep reinforcement learning framework that allows to simulate how cognitive agents could gradually learn to solve arithmetic problems by interacting with a virtual abacus. The proposed model successfully learn to perform multi-digit additions and subtractions, achieving an error rate below 1 during training. We also compare the performance of learning agents receiving a different amount of explicit supervision, and we analyze the most common error patterns to better understand the limitations and biases resulting from our design choices.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2016

Learning to Perform Physics Experiments via Deep Reinforcement Learning

When encountering novel objects, humans are able to infer a wide range o...
research
07/06/2022

Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation

Mathematical reasoning is one of the most impressive achievements of hum...
research
10/14/2022

Adaptive patch foraging in deep reinforcement learning agents

Patch foraging is one of the most heavily studied behavioral optimizatio...
research
05/07/2013

Projective simulation for classical learning agents: a comprehensive investigation

We study the model of projective simulation (PS), a novel approach to ar...
research
12/26/2020

Towards sample-efficient episodic control with DAC-ML

The sample-inefficiency problem in Artificial Intelligence refers to the...
research
10/16/2021

Learning to Solve Complex Tasks by Talking to Agents

Humans often solve complex problems by interacting (in natural language)...
research
11/08/2022

A study on the ephemeral nature of knowledge shared within multiagent systems

Achieving knowledge sharing within an artificial swarm system could lead...

Please sign up or login with your details

Forgot password? Click here to reset