Interactive Semantic Featuring for Text Classification

06/24/2016
by   Camille Jandot, et al.
0

In text classification, dictionaries can be used to define human-comprehensible features. We propose an improvement to dictionary features called smoothed dictionary features. These features recognize document contexts instead of n-grams. We describe a principled methodology to solicit dictionary features from a teacher, and present results showing that models built using these human-comprehensible features are competitive with models trained with Bag of Words features.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2023

Gzip versus bag-of-words for text classification with KNN

The effectiveness of compression distance in KNN-based text classificati...
research
10/04/2015

A Novel Approach to Document Classification using WordNet

Content based Document Classification is one of the biggest challenges i...
research
09/03/2019

Neural Attentive Bag-of-Entities Model for Text Classification

This study proposes a Neural Attentive Bag-of-Entities model, which is a...
research
02/24/2023

Spanish Built Factual Freectianary (Spanish-BFF): the first AI-generated free dictionary

Dictionaries are one of the oldest and most used linguistic resources. B...
research
09/10/2020

The Grievance Dictionary: Understanding Threatening Language Use

This paper introduces the Grievance Dictionary, a psycholinguistic dicti...
research
07/06/2019

Bag-of-Audio-Words based on Autoencoder Codebook for Continuous Emotion Prediction

In this paper we present a novel approach for extracting a Bag-of-Words ...
research
11/23/2022

Embedding Compression for Text Classification Using Dictionary Screening

In this paper, we propose a dictionary screening method for embedding co...

Please sign up or login with your details

Forgot password? Click here to reset