Uzbek text summarization based on TF-IDF

03/01/2023
by   Khabibulla Madatov, et al.
0

The volume of information is increasing at an incredible rate with the rapid development of the Internet and electronic information services. Due to time constraints, we don't have the opportunity to read all this information. Even the task of analyzing textual data related to one field requires a lot of work. The text summarization task helps to solve these problems. This article presents an experiment on summarization task for Uzbek language, the methodology was based on text abstracting based on TF-IDF algorithm. Using this density function, semantically important parts of the text are extracted. We summarize the given text by applying the n-gram method to important parts of the whole text. The authors used a specially handcrafted corpus called "School corpus" to evaluate the performance of the proposed method. The results show that the proposed approach is effective in extracting summaries from Uzbek language text and can potentially be used in various applications such as information retrieval and natural language processing. Overall, this research contributes to the growing body of work on text summarization in under-resourced languages.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2022

Information Retrieval in Friction Stir Welding of Aluminum Alloys by using Natural Language Processing based Algorithms

Text summarization is a technique for condensing a big piece of text int...
research
12/27/2021

Hamtajoo: A Persian Plagiarism Checker for Academic Manuscripts

In recent years, due to the high availability of electronic documents th...
research
01/18/2021

Neural Abstractive Text Summarizer for Telugu Language

Abstractive Text Summarization is the process of constructing semantical...
research
06/20/2023

One model to rule them all: ranking Slovene summarizers

Text summarization is an essential task in natural language processing, ...
research
10/16/2019

Automated Text Summarization for the Enhancement of Public Services

Natural language processing and machine learning algorithms have been sh...
research
02/05/2016

Utilização de Grafos e Matriz de Similaridade na Sumarização Automática de Documentos Baseada em Extração de Frases

The internet increased the amount of information available. However, the...
research
06/09/2017

Collaborative Summarization of Topic-Related Videos

Large collections of videos are grouped into clusters by a topic keyword...

Please sign up or login with your details

Forgot password? Click here to reset