Native Language Identification using Stacked Generalization

03/19/2017
by   Shervin Malmasi, et al.
0

Ensemble methods using multiple classifiers have proven to be the most successful approach for the task of Native Language Identification (NLI), achieving the current state of the art. However, a systematic examination of ensemble methods for NLI has yet to be conducted. Additionally, deeper ensemble architectures such as classifier stacking have not been closely evaluated. We present a set of experiments using three ensemble-based models, testing each with multiple configurations and algorithms. This includes a rigorous application of meta-classification models for NLI, achieving state-of-the-art results on three datasets from different languages. We also present the first use of statistical significance testing for comparing NLI systems, showing that our results are significantly better than the previous state of the art. We make available a collection of test set predictions to facilitate future statistical tests.

READ FULL TEXT

page 7

page 8

page 22

page 25

research
07/22/2017

Native Language Identification on Text and Speech

This paper presents an ensemble system combining the output of multiple ...
research
07/16/2017

Open-Set Language Identification

We present the first open-set language identification experiments using ...
research
04/15/2021

HIVE-COTE 2.0: a new meta ensemble for time series classification

The Hierarchical Vote Collective of Transformation-based Ensembles (HIVE...
research
01/30/2021

Hellinger Distance Weighted Ensemble for Imbalanced Data Stream Classification

The imbalanced data classification remains a vital problem. The key is t...
research
11/18/2022

Scaling Native Language Identification with Transformer Adapters

Native language identification (NLI) is the task of automatically identi...
research
08/13/2023

An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM

This paper introduces a novel ensemble approach for question classificat...
research
06/05/2020

"To Target or Not to Target": Identification and Analysis of Abusive Text Using Ensemble of Classifiers

With rising concern around abusive and hateful behavior on social media ...

Please sign up or login with your details

Forgot password? Click here to reset