Text classification based on ensemble extreme learning machine

05/10/2018
by   Ming Li, et al.
0

In this paper, we propose a novel approach based on cost-sensitive ensemble weighted extreme learning machine; we call this approach AE1-WELM. We apply this approach to text classification. AE1-WELM is an algorithm including balanced and imbalanced multiclassification for text classification. Weighted ELM assigning the different weights to the different samples improves the classification accuracy to a certain extent, but weighted ELM considers the differences between samples in the different categories only and ignores the differences between samples within the same categories. We measure the importance of the documents by the sample information entropy, and generate cost-sensitive matrix and factor based on the document importance, then embed the cost-sensitive weighted ELM into the AdaBoost.M1 framework seamlessly. Vector space model(VSM) text representation produces the high dimensions and sparse features which increase the burden of ELM. To overcome this problem, we develop a text classification framework combining the word vector and AE1-WELM. The experimental results show that our method provides an accurate, reliable and effective solution for text classification.

READ FULL TEXT
POST COMMENT

Comments

There are no comments yet.

Authors

page 1

page 2

page 3

page 4

09/23/2010

A hybrid learning algorithm for text classification

Text classification is the process of classifying documents into predefi...
11/28/2016

Improving Multi-Document Summarization via Text Classification

Developed so far, multi-document summarization has reached its bottlenec...
09/28/2019

W-RNN: News text classification based on a Weighted RNN

Most of the information is stored as text, so text mining is regarded as...
05/14/2018

Utilizing Probase in Open Directory Project-based Text Classification

Open Directory Project (ODP) has been successfully utilized in text clas...
10/22/2020

Classification with Rejection Based on Cost-sensitive Classification

The goal of classification with rejection is to avoid risky misclassific...
05/03/2014

Kaggle LSHTC4 Winning Solution

Our winning submission to the 2014 Kaggle competition for Large Scale Hi...
11/14/2021

"Will You Find These Shortcuts?" A Protocol for Evaluating the Faithfulness of Input Salience Methods for Text Classification

Feature attribution a.k.a. input salience methods which assign an import...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.