Document understanding and information extraction include different task...
Natural language understanding typically maps single utterances to a dua...
Text Classification is the most essential and fundamental problem in Nat...
Document-based Visual Question Answering examines the document understan...
Compared to general document analysis tasks, form document structure
und...
There has been growing interest in applying NLP techniques in the financ...
When a human communicates with a machine using natural language on the w...
We propose a PiggyBack, a Visual Question Answering platform that allows...
In-game toxic language becomes the hot potato in the gaming industry and...
Scene Graph Generation (SGG) serves a comprehensive representation of th...
Collaborative filtering problems are commonly solved based on matrix
com...
Online Hate speech detection has become important with the growth of dig...
Recognizing the layout of unstructured digital documents is crucial when...
Attention mechanism has been used as an important component across
Visio...
For preventing youth suicide, social media platforms have received much
...
Text classification aims to assign labels to textual units by making use...
We propose V-Doc, a question-answering tool using document images and PD...
Pretrained models have produced great success in both Computer Vision (C...
Compared to sequential learning models, graph-based neural networks exhi...
Graph Convolutional Networks (GCN) have been effective at tasks that hav...
Intent classification and slot filling are two critical tasks for natura...
Recommender systems typically operate on high-dimensional sparse user-it...
The Federal Reserve System (the Fed) plays a significant role in affecti...
Traditional toxicity detection models have focused on the single utteran...
As an essential component of human cognition, cause-effect relations app...
Online abusive language detection (ALD) has become a societal issue of
i...
Text-to-image multimodal tasks, generating/retrieving an image from a gi...
Visual question answering (VQA) is a challenging multi-modal task that
r...