1Cademy @ Causal News Corpus 2022: Leveraging Self-Training in Causality Classification of Socio-Political Event Data

11/04/2022
by   Adam Nik, et al.
0

This paper details our participation in the Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE) workshop @ EMNLP 2022, where we take part in Subtask 1 of Shared Task 3. We approach the given task of event causality detection by proposing a self-training pipeline that follows a teacher-student classifier method. More specifically, we initially train a teacher model on the true, original task data, and use that teacher model to self-label data to be used in the training of a separate student model for the final task prediction. We test how restricting the number of positive or negative self-labeled examples in the self-training process affects classification performance. Our final results show that using self-training produces a comprehensive performance improvement across all models and self-labeled training sets tested within the task of event causality sequence classification. On top of that, we find that self-training performance did not diminish even when restricting either positive/negative examples used in training. Our code is be publicly available at https://github.com/Gzhang-umich/1CademyTeamOfCASE.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2023

Customizing Synthetic Data for Data-Free Student Learning

Data-free knowledge distillation (DFKD) aims to obtain a lightweight stu...
research
09/03/2022

STAD: Self-Training with Ambiguous Data for Low-Resource Relation Extraction

We present a simple yet effective self-training approach, named as STAD,...
research
10/26/2022

Causality Detection using Multiple Annotation Decision

The paper describes the work that has been submitted to the 5th workshop...
research
07/15/2022

Segment-level Metric Learning for Few-shot Bioacoustic Event Detection

Few-shot bioacoustic event detection is a task that detects the occurren...
research
09/12/2023

Self-Training and Multi-Task Learning for Limited Data: Evaluation Study on Object Detection

Self-training allows a network to learn from the predictions of a more c...
research
08/17/2021

Challenges and Applications of Automated Extraction of Socio-political Events from Text (CASE 2021): Workshop and Shared Task Report

This workshop is the fourth issue of a series of workshops on automatic ...

Please sign up or login with your details

Forgot password? Click here to reset