We propose NECA, a deep representation learning method for categorical d...
Language models (LMs) have made remarkable progress, but still struggle ...
Named entity disambiguation (NED), which involves mapping textual mentio...
Hard parameter sharing for multi-task learning is widely used in empiric...
A challenge for named entity disambiguation (NED), the task of mapping
t...
Our goal is to enable machine learning systems to be trained interactive...
We investigate multi-task learning approaches that use a shared feature
...
Data augmentation is a powerful technique to improve performance in
appl...
A popular way to estimate the causal effect of a variable x on y from
ob...
Many industrial machine learning (ML) systems require frequent retrainin...
In real-world machine learning applications, data subsets correspond to
...
Labeling training data is increasingly the largest bottleneck in deployi...
Sparse coding (SC) is attracting more and more attention due to its
comp...
We present SwellShark, a framework for building biomedical named entity
...
Large labeled training sets are the critical building blocks of supervis...
Populating a database with unstructured information is a long-standing
p...