Data Collection and Analysis of French Dialects

08/01/2022
by   Omar Shaur Choudhry, et al.
0

This paper discusses creating and analysing a new dataset for data mining and text analytics research, contributing to a joint Leeds University research project for the Corpus of National Dialects. This report investigates machine learning classifiers to classify samples of French dialect text across various French-speaking countries. Following the steps of the CRISP-DM methodology, this report explores the data collection process, data quality issues and data conversion for text analysis. Finally, after applying suitable data mining techniques, the evaluation methods, best overall features and classifiers and conclusions are discussed.

READ FULL TEXT
research
01/29/2020

Proceedings of Symposium on Data Mining Applications 2014

The Symposium on Data Mining and Applications (SDMA 2014) is aimed to ga...
research
12/19/2022

Very Large Language Model as a Unified Methodology of Text Mining

Text data mining is the process of deriving essential information from l...
research
01/10/2012

Pbm: A new dataset for blog mining

Text mining is becoming vital as Web 2.0 offers collaborative content cr...
research
03/04/2015

The concept "altruism" for sociological research: from conceptualization to operationalization

This article addresses the question of the relevant conceptualization of...
research
08/26/2020

Data Mining Approach to Analyze Covid19 Dataset of Brazilian Patients

The pandemic originated by coronavirus(covid-19), name coined by World H...
research
05/28/2018

Core Conflictual Relationship: Text Mining to Discover What and When

Following detailed presentation of the Core Conflictual Relationship The...

Please sign up or login with your details

Forgot password? Click here to reset