CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data

02/08/2023
by   Amir Namavar Jahromi, et al.
0

Financial sector and especially the insurance industry collect vast volumes of text on a daily basis and through multiple channels (their agents, customer care centers, emails, social networks, and web in general). The information collected includes policies, expert and health reports, claims and complaints, results of surveys, and relevant social media posts. It is difficult to effectively extract label, classify, and interpret the essential information from such varied and unstructured material. Therefore, the Insurance Industry is among the ones that can benefit from applying technologies for the intelligent analysis of free text through Natural Language Processing (NLP). In this paper, CRL+, a novel text classification model combining Contrastive Representation Learning (CRL) and Active Learning is proposed to handle the challenge of using semi-supervised learning for text classification. In this method, supervised (CRL) is used to train a RoBERTa transformer model to encode the textual data into a contrastive representation space and then classify using a classification layer. This (CRL)-based transformer model is used as the base model in the proposed Active Learning mechanism to classify all the data in an iterative manner. The proposed model is evaluated using unstructured obituary data with objective to determine the cause of the death from the data. This model is compared with the CRL model and an Active Learning model with the RoBERTa base model. The experiment shows that the proposed method can outperform both methods for this specific task.

READ FULL TEXT

page 1

page 5

research
07/16/2021

The Application of Active Query K-Means in Text Classification

Active learning is a state-of-art machine learning approach to deal with...
research
12/21/2022

Text classification in shipping industry using unsupervised models and Transformer based supervised models

Obtaining labelled data in a particular context could be expensive and t...
research
01/28/2022

Dominant Set-based Active Learning for Text Classification and its Application to Online Social Media

Recent advances in natural language processing (NLP) in online social me...
research
04/27/2021

Multi-class Text Classification using BERT-based Active Learning

Text Classification finds interesting applications in the pickup and del...
research
05/12/2021

Mining Legacy Issues in Open Pit Mining Sites: Innovation Support of Renaturalization and Land Utilization

Open pit mines left many regions worldwide inhospitable or uninhabitable...
research
09/22/2020

ALICE: Active Learning with Contrastive Natural Language Explanations

Training a supervised neural network classifier typically requires many ...
research
01/20/2020

Early Forecasting of Text Classification Accuracy and F-Measure with Active Learning

When creating text classification systems, one of the major bottlenecks ...

Please sign up or login with your details

Forgot password? Click here to reset