
-
Minimally-Supervised Structure-Rich Text Categorization via Learning on Text-Rich Networks
Text categorization is an essential task in Web content analysis. Consid...
read it
-
Data Profiling for Adversarial Training: On the Ruin of Problematic Data
Multiple intriguing problems hover in adversarial training, including ro...
read it
-
Sensei: Self-Supervised Sensor Name Segmentation
A sensor name, typically an alphanumeric string, encodes the key context...
read it
-
X-Class: Text Classification with Extremely Weak Supervision
In this paper, we explore to conduct text classification with extremely ...
read it
-
Overfitting or Underfitting? Understand Robustness Drop in Adversarial Training
Our goal is to understand why the robustness drops after conducting adve...
read it
-
User-Guided Aspect Classification for Domain-Specific Texts
Aspect classification, identifying aspects of text segments, facilitates...
read it
-
SetExpan: Corpus-Based Set Expansion via Context Feature Selection and Rank Ensemble
Corpus-based set expansion (i.e., finding the "complete" set of entities...
read it
-
FUSE: Multi-Faceted Set Expansion by Coherent Clustering of Skip-grams
Set expansion aims to expand a small set of seed entities into a complet...
read it
-
CrossWeigh: Training Named Entity Tagger from Imperfect Annotations
Everyone makes mistakes. So do human annotators when curating labels for...
read it
-
Raw-to-End Name Entity Recognition in Social Media
Taking word sequences as the input, typical named entity recognition (NE...
read it
-
Learning Named Entity Tagger using Domain-Specific Dictionary
Recent advances in deep neural models allow us to build reliable named e...
read it
-
Entity Set Search of Scientific Literature: An Unsupervised Ranking Approach
Literature search is critical for any scientific research. Different fro...
read it
-
Open Information Extraction with Global Structure Constraints
Extracting entities and their relations from text is an important task f...
read it
-
Integrating Local Context and Global Cohesiveness for Open Information Extraction
Extracting entities and their relations from text is an important task f...
read it
-
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Many efforts have been made to facilitate natural language processing ta...
read it
-
Investigating Rumor News Using Agreement-Aware Search
In recent years, rumor news has been generated by humans as well as robo...
read it
-
Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning
Motivation: Biomedical named entity recognition (BioNER) is the most fun...
read it
-
An Attention-based Collaboration Framework for Multi-View Network Representation Learning
Learning distributed node representations in networks has been attractin...
read it
-
Empower Sequence Labeling with Task-Aware Neural Language Model
Linguistic sequence labeling is a general modeling approach that encompa...
read it
-
MetaPAD: Meta Pattern Discovery from Massive Text Corpora
Mining textual patterns in news, tweets, papers, and many other kinds of...
read it
-
Automated Phrase Mining from Massive Text Corpora
As one of the fundamental tasks in text analysis, phrase mining aims at ...
read it
-
DPPred: An Effective Prediction Framework with Concise Discriminative Patterns
In the literature, two series of models have been proposed to address pr...
read it
-
A Parallel and Efficient Algorithm for Learning to Match
Many tasks in data mining and related fields can be formalized as matchi...
read it