The EcoLexicon English Corpus as an open corpus in Sketch Engine

07/16/2018
by   Pilar Leon-Arauz, et al.
0

The EcoLexicon English Corpus (EEC) is a 23.1-million-word corpus of contemporary environmental texts. It was compiled by the LexiCon research group for the development of EcoLexicon (Faber, Leon-Arauz & Reimerink 2016; San Martin et al. 2017), a terminological knowledge base on the environment. It is available as an open corpus in the well-known corpus query system Sketch Engine (Kilgarriff et al. 2014), which means that any user, even without a subscription, can freely access and query the corpus. In this paper, the EEC is introduced by de- scribing how it was built and compiled and how it can be queried and exploited, based both on the functionalities provided by Sketch Engine and on the parameters in which the texts in the EEC are classified.

READ FULL TEXT

page 4

page 5

page 6

page 7

research
04/15/2018

The EcoLexicon Semantic Sketch Grammar: from Knowledge Patterns to Word Sketches

Many projects have applied knowledge patterns (KPs) to the retrieval of ...
research
06/14/2021

Contemporary Amharic Corpus: Automatically Morpho-Syntactically Tagged Amharic Corpus

We introduced the contemporary Amharic corpus, which is automatically ta...
research
05/08/2016

A corpus of preposition supersenses in English web reviews

We present the first corpus annotated with preposition supersenses, unle...
research
03/28/2019

A dataset for resolving referring expressions in spoken dialogue via contextual query rewrites (CQR)

We present Contextual Query Rewrite (CQR) a dataset for multi-domain tas...
research
11/18/2022

Corpus non alignés et ADT. Essai de comparaison entre les présidents français et brésiliens de l'ère contemporaine

Is there an ADT method that can deal with non-aligned bilingual corpora?...
research
03/25/2022

Plagiarism Detection in the Bengali Language: A Text Similarity-Based Approach

Plagiarism means taking another person's work and not giving any credit ...
research
04/07/2017

Adposition Supersenses v2

This document describes an inventory of 50 semantic labels designed to c...

Please sign up or login with your details

Forgot password? Click here to reset