NoReC: The Norwegian Review Corpus

10/15/2017
by   Erik Velldal, et al.
0

This paper presents the Norwegian Review Corpus (NoReC), created for training and evaluating models for document-level sentiment analysis. The full-text reviews have been collected from major Norwegian news sources and cover a range of different domains, including literature, movies, video games, restaurants, music and theater, in addition to product reviews across a range of categories. Each review is labeled with a manually assigned score of 1-6, as provided by the rating of the original author. This first release of the corpus comprises more than 35,000 reviews. It is distributed using the CoNLL-U format, pre-processed using UDPipe, along with a rich set of metadata. The work reported in this paper forms part of the SANT initiative (Sentiment Analysis for Norwegian Text), a project seeking to provide resources and tools for sentiment analysis and opinion mining for Norwegian. As resources for sentiment analysis have so far been unavailable for Norwegian, NoReC represents a highly valuable and sought-after addition to Norwegian language technology.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2023

AlbMoRe: A Corpus of Movie Reviews for Sentiment Analysis in Albanian

Lack of available resources such as text corpora for low-resource langua...
research
02/05/2016

Mining Software Quality from Software Reviews: Research Trends and Open Issues

Software review text fragments have considerably valuable information ab...
research
11/18/2020

Improving Document-Level Sentiment Analysis with User and Product Context

Past work that improves document-level sentiment analysis by encoding us...
research
12/05/2016

The Evolution of Sentiment Analysis - A Review of Research Topics, Venues, and Top Cited Papers

Sentiment analysis is one of the fastest growing research areas in compu...
research
08/24/2016

Semantic descriptions of 24 evaluational adjectives, for application in sentiment analysis

We apply the Natural Semantic Metalanguage (NSM) approach (Goddard and W...
research
05/08/2022

Multi-Domain Targeted Sentiment Analysis

Targeted Sentiment Analysis (TSA) is a central task for generating insig...
research
04/04/2023

Polarity based Sarcasm Detection using Semigraph

Sarcasm is an advanced linguistic expression often found on various onli...

Please sign up or login with your details

Forgot password? Click here to reset