TechTexC: Classification of Technical Texts using Convolution and Bidirectional Long Short Term Memory Network

12/21/2020
by   Omar Sharif, et al.
0

This paper illustrates the details description of technical text classification system and its results that developed as a part of participation in the shared task TechDofication 2020. The shared task consists of two sub-tasks: (i) first task identify the coarse-grained technical domain of given text in a specified language and (ii) the second task classify a text of computer science domain into fine-grained sub-domains. A classification system (called 'TechTexC') is developed to perform the classification task using three techniques: convolution neural network (CNN), bidirectional long short term memory (BiLSTM) network, and combined CNN with BiLSTM. Results show that CNN with BiLSTM model outperforms the other techniques concerning task-1 of sub-tasks (a, b, c and g) and task-2a. This combined model obtained f1 scores of 82.63 (sub-task a), 81.95 (sub-task b), 82.39 (sub-task c), 84.37 (sub-task g), and 67.44 (task-2a) on the development dataset. Moreover, in the case of test set, the combined CNN with BiLSTM approach achieved that higher accuracy for the subtasks 1a (70.76 (70.14

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/20/2021

An Attention Ensemble Approach for Efficient Text Classification of Indian Languages

The recent surge of complex attention-based deep learning architectures ...
research
08/11/2017

N-gram and Neural Language Models for Discriminating Similar Languages

This paper describes our submission (named clac) to the 2016 Discriminat...
research
03/06/2020

Brazilian Lyrics-Based Music Genre Classification Using a BLSTM Network

Organize songs, albums, and artists in groups with shared similarity cou...
research
05/16/2020

Radial Loss for Learning Fine-grained Video Similarity Metric

In this paper, we propose the Radial Loss which utilizes category and su...
research
04/12/2019

IIT (BHU) Varanasi at MSR-SRST 2018: A Language Model Based Approach for Natural Language Generation

This paper describes our submission system for the Shallow Track of Surf...
research
11/07/2016

AC-BLSTM: Asymmetric Convolutional Bidirectional LSTM Networks for Text Classification

Recently deeplearning models have been shown to be capable of making rem...
research
06/26/2023

Integrating Bidirectional Long Short-Term Memory with Subword Embedding for Authorship Attribution

The problem of unveiling the author of a given text document from multip...

Please sign up or login with your details

Forgot password? Click here to reset