Implementing Deep Learning-Based Approaches for Article Summarization in Indian Languages

12/12/2022
by   Rahul Tangsali, et al.
0

The research on text summarization for low-resource Indian languages has been limited due to the availability of relevant datasets. This paper presents a summary of various deep-learning approaches used for the ILSUM 2022 Indic language summarization datasets. The ISUM 2022 dataset consists of news articles written in Indian English, Hindi, and Gujarati respectively, and their ground-truth summarizations. In our work, we explore different pre-trained seq2seq models and fine-tune those with the ILSUM 2022 datasets. In our case, the fine-tuned SoTA PEGASUS model worked the best for English, the fine-tuned IndicBART model with augmented data for Hindi, and again fine-tuned PEGASUS model along with a translation mapping-based approach for Gujarati. Our scores on the obtained inferences were evaluated using ROUGE-1, ROUGE-2, and ROUGE-4 as the evaluation metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2023

Abstractive Text Summarization for Resumes With Cutting Edge NLP Transformers and LSTM

Text summarization is a fundamental task in natural language processing ...
research
08/24/2020

A Baseline Analysis for Podcast Abstractive Summarization

Podcast summary, an important factor affecting end-users' listening deci...
research
12/04/2020

CUED_speech at TREC 2020 Podcast Summarisation Track

In this paper, we describe our approach for the Podcast Summarisation ch...
research
11/08/2020

Bait and Switch: Online Training Data Poisoning of Autonomous Driving Systems

We show that by controlling parts of a physical environment in which a p...
research
05/05/2022

Quantifying Language Variation Acoustically with Few Resources

Deep acoustic models represent linguistic information based on massive a...
research
03/07/2023

ChatGPT: Beginning of an End of Manual Linguistic Data Annotation? Use Case of Automatic Genre Identification

ChatGPT has shown strong capabilities in natural language generation tas...
research
03/19/2023

Bangla Grammatical Error Detection Using T5 Transformer Model

This paper presents a method for detecting grammatical errors in Bangla ...

Please sign up or login with your details

Forgot password? Click here to reset