Understanding Chat Messages for Sticker Recommendation in Hike Messenger

02/07/2019
by   Abhishek Laddha, et al.
0

Stickers are popularly used in messaging apps such as Hike to visually express a nuanced range of thoughts and utterances and convey exaggerated emotions. However, discovering the right sticker at the right time in a chat from a large and ever expanding pool of stickers can be cumbersome. In this paper, we describe a system for recommending stickers as users chat based on what the user is typing and the conversational context. We decompose the sticker recommendation problem into two steps. First, we predict the next message that the user is likely to send in the chat. Second, we substitute the predicted message with an appropriate sticker. Majority of Hike's users transliterate messages from their native language to English. This leads to numerous orthographic variations of the same message and thus complicates message prediction. To address this issue, we cluster the messages that have the same meaning and predict the message cluster instead of the message. We experiment with different approaches to train embedding for chat messages and study their efficacy in learning similar dense representations for messages that have the same intent. We propose a novel hybrid message prediction model, which can run with low latency on low end phones that have severe computational limitations.

READ FULL TEXT
research
03/05/2022

ECMG: Exemplar-based Commit Message Generation

Commit messages concisely describe the content of code diffs (i.e., code...
research
05/11/2017

Content-based Approach for Vietnamese Spam SMS Filtering

Short Message Service (SMS) spam is a serious problem in Vietnam because...
research
10/12/2021

Impact of delay classes on the data structure in IOTA

In distributed ledger technologies (DLTs) with a directed acyclic graph ...
research
08/25/2023

The Arrangement of Marks Impacts Afforded Messages: Ordering, Partitioning, Spacing, and Coloring in Bar Charts

Data visualizations present a massive number of potential messages to an...
research
03/27/2021

Abuse is Contextual, What about NLP? The Role of Context in Abusive Language Annotation and Detection

The datasets most widely used for abusive language detection contain lis...
research
07/07/2021

POSLAN: Disentangling Chat with Positional and Language encoded Post Embeddings

Most online message threads inherently will be cluttered and any new use...
research
01/20/2014

A Genetic Algorithm to Optimize a Tweet for Retweetability

Twitter is a popular microblogging platform. When users send out message...

Please sign up or login with your details

Forgot password? Click here to reset