A Novel Approach to Document Classification using WordNet

10/04/2015
by   Koushiki Sarkar, et al.
0

Content based Document Classification is one of the biggest challenges in the context of free text mining. Current algorithms on document classifications mostly rely on cluster analysis based on bag-of-words approach. However that method is still being applied to many modern scientific dilemmas. It has established a strong presence in fields like economics and social science to merit serious attention from the researchers. In this paper we would like to propose and explore an alternative grounded more securely on the dictionary classification and correlatedness of words and phrases. It is expected that application of our existing knowledge about the underlying classification structure may lead to improvement of the classifier's performance.

READ FULL TEXT

page 4

page 5

research
06/24/2016

Interactive Semantic Featuring for Text Classification

In text classification, dictionaries can be used to define human-compreh...
research
09/16/2019

Document classification methods

Information on different fields which are collected by users requires ap...
research
07/05/2017

The Influence of Feature Representation of Text on the Performance of Document Classification

In this paper we perform a comparative analysis of three models for feat...
research
02/18/2016

Corpus analysis without prior linguistic knowledge - unsupervised mining of phrases and subphrase structure

When looking at the structure of natural language, "phrases" and "words"...
research
01/20/2019

Hierarchical Attentional Hybrid Neural Networks for Document Classification

Document classification is a challenging task with important application...
research
04/17/2020

A Survey of Document Grounded Dialogue Systems (DGDS)

Dialogue system (DS) attracts great attention from industry and academia...
research
08/30/2021

Exploring Multi-Tasking Learning in Document Attribute Classification

In this work, we adhere to explore a Multi-Tasking learning (MTL) based ...

Please sign up or login with your details

Forgot password? Click here to reset