Un duel probabiliste pour départager deux présidents (LIA @ DEFT'2005)

03/11/2019
by   Marc El-Bèze, et al.
0

We present a set of probabilistic models applied to binary classification as defined in the DEFT'05 challenge. The challenge consisted a mixture of two differents problems in Natural Language Processing : identification of author (a sequence of François Mitterrand's sentences might have been inserted into a speech of Jacques Chirac) and thematic break detection (the subjects addressed by the two authors are supposed to be different). Markov chains, Bayes models and an adaptative process have been used to identify the paternity of these sequences. A probabilistic model of the internal coherence of speeches which has been employed to identify thematic breaks. Adding this model has shown to improve the quality results. A comparison with different approaches demostrates the superiority of a strategy that combines learning, coherence and adaptation. Applied to the DEFT'05 data test the results in terms of precision (0.890), recall (0.955) and Fscore (0.925) measure are very promising.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2023

Semantic Coherence Markers for the Early Diagnosis of the Alzheimer Disease

In this work we explore how language models can be employed to analyze l...
research
03/20/2020

Probabilistic learning of boolean functions applied to the binary classification problem with categorical covariates

In this work we cast the problem of binary classification in terms of es...
research
03/08/2000

Coherence, Belief Expansion and Bayesian Networks

We construct a probabilistic coherence measure for information sets whic...
research
03/11/2015

Convolutional Neural Network Architectures for Matching Natural Language Sentences

Semantic matching is of central importance to many natural language task...
research
05/31/2019

Improving Open Information Extraction via Iterative Rank-Aware Learning

Open information extraction (IE) is the task of extracting open-domain a...
research
07/27/2021

Unsupervised Domain Adaptation for Hate Speech Detection Using a Data Augmentation Approach

Online harassment in the form of hate speech has been on the rise in rec...

Please sign up or login with your details

Forgot password? Click here to reset