Evolving Character-level Convolutional Neural Networks for Text Classification

12/03/2020
by   Trevor Londt, et al.
0

Character-level convolutional neural networks (char-CNN) require no knowledge of the semantic or syntactic structure of the language they classify. This property simplifies its implementation but reduces its classification accuracy. Increasing the depth of char-CNN architectures does not result in breakthrough accuracy improvements. Research has not established which char-CNN architectures are optimal for text classification tasks. Manually designing and training char-CNNs is an iterative and time-consuming process that requires expert domain knowledge. Evolutionary deep learning (EDL) techniques, including surrogate-based versions, have demonstrated success in automatically searching for performant CNN architectures for image analysis tasks. Researchers have not applied EDL techniques to search the architecture space of char-CNNs for text classification tasks. This article demonstrates the first work in evolving char-CNN architectures using a novel EDL algorithm based on genetic programming, an indirect encoding and surrogate models, to search for performant char-CNN architectures automatically. The algorithm is evaluated on eight text classification datasets and benchmarked against five manually designed CNN architecture and one long short-term memory (LSTM) architecture. Experiment results indicate that the algorithm can evolve architectures that outperform the LSTM in terms of classification accuracy and five of the manually designed CNN architectures in terms of classification accuracy and parameter count.

READ FULL TEXT

page 1

page 8

research
12/03/2020

Evolving Character-Level DenseNet Architectures using Genetic Programming

DenseNet architectures have demonstrated impressive performance in image...
research
10/28/2018

Automatically Evolving CNN Architectures Based on Blocks

The performance of Convolutional Neural Networks (CNNs) highly relies on...
research
03/13/2023

SA-CNN: Application to text categorization issues using simulated annealing-based convolutional neural network optimization

Convolutional neural networks (CNNs) are a representative class of deep ...
research
06/05/2022

Performance Comparison of Simple Transformer and Res-CNN-BiLSTM for Cyberbullying Classification

The task of text classification using Bidirectional based LSTM architect...
research
12/07/2021

Evaluating Generic Auto-ML Tools for Computational Pathology

Image analysis tasks in computational pathology are commonly solved usin...
research
04/04/2019

Text Classification Components for Detecting Descriptions and Names of CAD models

We apply text analysis approaches for a specialized search engine for 3D...
research
02/16/2020

ArcText: An Unified Text Approach to Describing Convolutional Neural Network Architectures

Numerous Convolutional Neural Network (CNN) models have demonstrated the...

Please sign up or login with your details

Forgot password? Click here to reset