TeleCrowd: A Crowdsourcing Approach to Create Informal to Formal Text Corpora

04/24/2020
by   Vahid Masoumi, et al.
0

Crowdsourcing has been widely used recently as an alternative to traditional annotations that is costly and usually done by experts. However, crowdsourcing tasks are not interesting by themselves, therefore, combining tasks with game will increase both participants motivation and engagement. In this paper, we have proposed a gamified crowdsourcing platform called TeleCrowd based on Telegram Messenger to use its social power as a base platform and facilitator for accomplishing crowdsourcing projects. Furthermore, to evaluate the performance of the proposed platform, we ran an experimental crowdsourcing project consisting of 500 informal Persian sentences in which participants were supposed to provide candidates that were the formal equivalent of sentences or qualify other candidates by upvoting or downvoting them. In this study, 2700 candidates and 21000 votes were submitted by the participants and a parallel dataset using candidates with the highest points, sum of their upvotes and downvotes, as the best candidates was built. As the evaluation, BLEU score of 0.54 was achieved on the collected dataset which shows that our proposed platform can be used to create large corpora. Also, this platform is highly efficient in terms of time period and cost price in comparison with other related works, because the whole duration of the project was 28 days at a cost of 40 dollars.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/07/2022

Older Adults' Motivation and Engagement with Diverse Crowdsourcing Citizen Science Tasks

In this exploratory study we evaluated the engagement, performance and p...
research
12/21/2020

RC-chain: Reputation-based Crowdsourcing Blockchain for Vehicular Networks

As the commercial use of 5G technologies has grown more prevalent, smart...
research
07/23/2023

Milimili. Collecting Parallel Data via Crowdsourcing

We present a methodology for gathering a parallel corpus through crowdso...
research
06/15/2021

Benchmark dataset of memes with text transcriptions for automatic detection of multi-modal misogynistic content

In this paper we present a benchmark dataset generated as part of a proj...
research
07/29/2019

Small Profits and Quick Returns: An Incentive Mechanism Design for IoT-based Crowdsourcing under Continuous Platform Competition

Crowdsourcing can be applied to the Internet-of-Things (IoT) systems to ...
research
12/19/2019

Developing a Multi-Platform Speech Recording System Toward Open Service of Building Large-Scale Speech Corpora

This paper briefly reports our ongoing attempt at the development of a m...
research
06/11/2023

To Save Crowdsourcing from Cheap-Talk: Strategic Learning from Biased Users

Today many users are invited by a crowdsourcing platform (e.g., TripAdvi...

Please sign up or login with your details

Forgot password? Click here to reset