Twitter-based traffic information system based on vector representations for words

12/04/2018
by   Sina Dabiri, et al.
0

Recently, researchers have shown an increased interest in harnessing Twitter data for dynamic monitoring of traffic conditions. Bag-of-words representation is a common method in literature for tweet modeling and retrieving traffic information, yet it suffers from the curse of dimensionality and sparsity. To address these issues, our specific objective is to propose a simple and robust framework on the top of word embedding for distinguishing traffic-related tweets against non-traffic-related ones. In our proposed model, a tweet is classified as traffic-related if semantic similarity between its words and a small set of traffic keywords exceeds a threshold value. Semantic similarity between words is captured by means of word-embedding models, which is an unsupervised learning tool. The proposed model is as simple as having only one trainable parameter. The model takes advantage of outstanding merits, which are demonstrated through several evaluation steps. The state-of-the-art test accuracy for our proposed model is 95.9

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2020

Attention Word Embedding

Word embedding models learn semantically rich vector representations of ...
research
12/23/2021

Traffic event description based on Twitter data using Unsupervised Learning Methods for Indian road conditions

Non-recurrent and unpredictable traffic events directly influence road t...
research
07/25/2020

Effect of Text Processing Steps on Twitter Sentiment Classification using Word Embedding

Processing of raw text is the crucial first step in text classification ...
research
08/23/2020

Augmenting Semantic Representation of Depressive Language: from Forums to Microblogs

We discuss and analyze the process of creating word embedding feature re...
research
06/20/2016

Uncertainty in Neural Network Word Embedding: Exploration of Threshold for Similarity

Word embedding, specially with its recent developments, promises a quant...
research
03/02/2017

Discovery of Evolving Semantics through Dynamic Word Embedding Learning

During the course of human language evolution, the semantic meanings of ...
research
05/10/2021

Measuring Economic Policy Uncertainty Using an Unsupervised Word Embedding-based Method

Economic Policy Uncertainty (EPU) is a critical indicator in economic st...

Please sign up or login with your details

Forgot password? Click here to reset