Sentiment Analysis of Citations Using Word2vec

04/01/2017
by   Haixia Liu, et al.
0

Citation sentiment analysis is an important task in scientific paper analysis. Existing machine learning techniques for citation sentiment analysis are focusing on labor-intensive feature engineering, which requires large annotated corpus. As an automatic feature extraction tool, word2vec has been successfully applied to sentiment analysis of short texts. In this work, I conducted empirical research with the question: how well does word2vec work on the sentiment analysis of citations? The proposed method constructed sentence vectors (sent2vec) by averaging the word embeddings, which were learned from Anthology Collections (ACL-Embeddings). I also investigated polarity-specific word embeddings (PS-Embeddings) for classifying positive and negative citations. The sentence vectors formed a feature space, to which the examined citation sentence was mapped to. Those features were input into classifiers (support vector machines) for supervised classification. Using 10-cross-validation scheme, evaluation was conducted on a set of annotated citations. The results showed that word embeddings are effective on classifying positive and negative citations. However, hand-crafted features performed better for the overall classification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2017

Automatic Argumentative-Zoning Using Word2vec

In comparison with document summarization on the articles from social me...
research
05/10/2020

Article citation study: Context enhanced citation sentiment detection

Citation sentimet analysis is one of the little studied tasks for scient...
research
10/07/2019

SentiCite: An Approach for Publication Sentiment Analysis

With the rapid growth in the number of scientific publications, year aft...
research
11/01/2019

Efficient Feature Selection techniques for Sentiment Analysis

Sentiment analysis is a domain of study that focuses on identifying and ...
research
04/16/2021

Citations are not opinions: a corpus linguistics approach to understanding how citations are made

Citation content analysis seeks to understand citations based on the lan...
research
12/31/2019

Revisiting Paraphrase Question Generator using Pairwise Discriminator

In this paper, we propose a method for obtaining sentence-level embeddin...
research
01/05/2020

Generating Word and Document Embeddings for Sentiment Analysis

Sentiments of words differ from one corpus to another. Inducing general ...

Please sign up or login with your details

Forgot password? Click here to reset