Limits of an AI program for solving college math problems

08/14/2022
by   Ernest Davis, et al.
0

Drori et al. (2022) report that "A neural network solves, explains, and generates university math problems by program synthesis and few-shot learning at human level ... [It] automatically answers 81% of university-level mathematics problems." The system they describe is indeed impressive; however, the above description is very much overstated. The work of solving the problems is done, not by a neural network, but by the symbolic algebra package Sympy. Problems of various formats are excluded from consideration. The so-called "explanations" are just rewordings of lines of code. Answers are marked as correct that are not in the form specified in the problem. Most seriously, it seems that in many cases the system uses the correct answer given in the test corpus to guide its path to solving the problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2021

Solving Linear Algebra by Program Synthesis

We solve MIT's Linear Algebra 18.06 course and Columbia University's Com...
research
06/11/2022

A Dataset and Benchmark for Automatically Answering and Generating Machine Learning Final Exams

Can a machine learn machine learning? We propose to answer this question...
research
11/06/2017

Learning Solving Procedure for Artificial Neural Network

It is expected that progress toward true artificial intelligence will be...
research
12/02/2019

Deep Learning for Symbolic Mathematics

Neural networks have a reputation for being better at solving statistica...
research
11/18/2019

Program synthesis performance constrained by non-linear spatial relations in Synthetic Visual Reasoning Test

Despite remarkable advances in automated visual recognition by machines,...
research
06/26/2018

Clustering Complex Zeros of Triangular Systems of Polynomials

This paper gives the first algorithm for finding a set of natural ϵ-clus...
research
11/11/2019

(When) Is Truth-telling Favored in AI Debate?

For some problems, humans may not be able to accurately judge the goodne...

Please sign up or login with your details

Forgot password? Click here to reset