GSSF: A Generative Sequence Similarity Function based on a Seq2Seq model for clustering online handwritten mathematical answers

05/21/2021
by   Huy Quang Ung, et al.
0

Toward a computer-assisted marking for descriptive math questions,this paper presents clustering of online handwritten mathematical expressions (OnHMEs) to help human markers to mark them efficiently and reliably. We propose a generative sequence similarity function for computing a similarity score of two OnHMEs based on a sequence-to-sequence OnHME recognizer. Each OnHME is represented by a similarity-based representation (SbR) vector. The SbR matrix is inputted to the k-means algorithm for clustering OnHMEs. Experiments are conducted on an answer dataset (Dset_Mix) of 200 OnHMEs mixed of real patterns and synthesized patterns for each of 10 questions and a real online handwritten mathematical answer dataset of 122 student answers at most for each of 15 questions (NIER_CBT). The best clustering results achieved around 0.916 and 0.915 for purity, and around 0.556 and 0.702 for the marking cost on Dset_Mix and NIER_CBT, respectively. Our method currently outperforms the previous methods for clustering HMEs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/04/2019

Answer-based Adversarial Training for Generating Clarification Questions

We present an approach for generating clarification questions with the g...
research
06/21/2020

Match^2: A Matching over Matching Model for Similar Question Identification

Community Question Answering (CQA) has become a primary means for people...
research
06/17/2021

Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

The recognition of handwritten mathematical expressions in images and vi...
research
08/08/2016

Database of handwritten Arabic mathematical formulas images

Although publicly available, ground-truthed database have proven useful ...
research
01/10/2022

Fully automatic scoring of handwritten descriptive answers in Japanese language tests

This paper presents an experiment of automatically scoring handwritten d...
research
11/04/2020

Answer Identification in Collaborative Organizational Group Chat

We present a simple unsupervised approach for answer identification in o...
research
08/17/2015

A Generative Model for Multi-Dialect Representation

In the era of deep learning several unsupervised models have been develo...

Please sign up or login with your details

Forgot password? Click here to reset