Hierarchical classification of e-commerce related social media

11/26/2015
by   Matthew Long, et al.
0

In this paper, we attempt to classify tweets into root categories of the Amazon browse node hierarchy using a set of tweets with browse node ID labels, a much larger set of tweets without labels, and a set of Amazon reviews. Examining twitter data presents unique challenges in that the samples are short (under 140 characters) and often contain misspellings or abbreviations that are trivial for a human to decipher but difficult for a computer to parse. A variety of query and document expansion techniques are implemented in an effort to improve information retrieval to modest success.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/12/2022

LAMBRETTA: Learning to Rank for Twitter Soft Moderation

To curb the problem of false information, social media platforms like Tw...
research
04/14/2020

Standardizing and Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing

Time-critical analysis of social media streams is important for humanita...
research
06/21/2020

Automatic Query Optimization for Retrieving Traffic Tweets

Twitter, like many social media and data brokering companies, makes thei...
research
11/17/2021

NLP based grievance redressal system for Indian Railways

The current grievance redressal system has a dedicated 24X7 Twitter Cell...
research
03/02/2019

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media

This short paper presents the design decisions taken and challenges enco...
research
08/01/2018

How Does Tweet Difficulty Affect Labeling Performance of Annotators?

Crowdsourcing is a popular means to obtain labeled data at moderate cost...
research
09/16/2020

Adoption of Twitter's New Length Limit: Is 280 the New 140?

In November 2017, Twitter doubled the maximum allowed tweet length from ...

Please sign up or login with your details

Forgot password? Click here to reset