Data augmentation (DA) is widely used to improve the generalization of n...
This technical report presents AutoGen, a new framework that enables
dev...
Recent advances in large language models (LLMs) have demonstrated notabl...
Long-tailed recognition with imbalanced class distribution naturally eme...
In the last year alone, a surge of new benchmarks to measure composition...
Active learning (AL) aims to minimize the annotation cost by only queryi...
Machine learning tasks over image databases often generate masks that
an...
Large multimodal datasets have been instrumental in recent breakthroughs...
Time-series anomaly detection is an important task and has been widely
a...
Existing graph contrastive learning (GCL) typically requires two forward...
To obtain a large amount of training labels inexpensively, researchers h...
Programmatic Weak Supervision (PWS) has emerged as a widespread paradigm...
To create a large amount of training labels for machine learning models
...
To reduce the human annotation efforts, the programmatic weak supervisio...
Deep reinforcement learning gives the promise that an agent learns good
...
Programmatic Weak Supervision (PWS) aggregates the source votes of multi...
Graph contrastive learning (GCL) is the most representative and prevalen...
Graphs are ubiquitous in encoding relational information of real-world
o...
Weak Supervision (WS) techniques allow users to efficiently create large...
Labeling training data has become one of the major roadblocks to using
m...
Taxonomies are fundamental to many real-world applications in various
do...
Despite the great success of pre-trained language models (LMs) in many
n...
Creating labeled training sets has become one of the major roadblocks in...
Recent Weak Supervision (WS) approaches have had widespread success in
e...
Classification is one of the most important problems in machine learning...