The rapid advancement and widespread use of large language models (LLMs)...
Image classification has improved with the development of training
techn...
In order to build self-consistent personalized dialogue agents, previous...
Advances in Large Language Models (LLMs) have inspired a surge of resear...
Large language models for code have recently shown remarkable performanc...
Recently, large-scale vision-language pre-training models and visual sem...
Recovering 3D human mesh in the wild is greatly challenging as in-the-wi...
Supervised learning of image classifiers distills human knowledge into a...
This paper proposes a novel diffusion-based model, CompoDiff, for solvin...
We need billion-scale images to achieve more generalizable and
ground-br...
Diagnosing and cleaning datasets are crucial for building robust machine...
Vision Transformer (ViT) extracts the final representation from either c...
We propose the first unified theoretical analysis of mixed sample data
a...
Data augmentation has recently emerged as an essential component of mode...
The great success of machine learning with massive amounts of data comes...
Weakly supervised semantic segmentation (WSSS) methods are often built o...
Recent studies have demonstrated that gradient matching-based dataset
sy...
The problem of class imbalanced data lies in that the generalization
per...
Understanding document images (e.g., invoices) has been an important res...
Image-mixing augmentations (e.g., Mixup or CutMix), which typically mix ...
Deep neural networks (DNNs) often rely on easy-to-learn discriminatory
f...
Modern image files are usually progressively transmitted and provide a
p...
Weakly-supervised object localization (WSOL) enables finding an object u...
Vision Transformer (ViT) extends the application range of transformers f...
ImageNet has been arguably the most popular image classification benchma...
State-of-the-art video action classifiers often suffer from overfitting....
This paper addresses representational bottleneck in a network and propos...
Normalization techniques, such as batch normalization (BN), have led to
...
Despite apparent human-level performances of deep neural networks (DNN),...
We propose a generic confidence-based approximation that can be plugged ...
Many machine learning algorithms are trained and evaluated by splitting ...
This paper proposes a new high dimensional regression method by merging
...
In this paper, we propose a new multi-scale face detector having an extr...
Regional dropout strategies have been proposed to enhance the performanc...
Scene text detection methods based on neural networks have emerged recen...
Many new proposals for scene text recognition (STR) models have been
int...
We investigate the design aspects of feature distillation methods achiev...
The semantic segmentation requires a lot of computational cost. The dila...
An activation boundary for a neuron refers to a separating hyperplane th...
Many recent works on knowledge distillation have provided ways to transf...
Many recent works on knowledge distillation have provided ways to transf...
We propose a new context-aware correlation filter based tracking framewo...
This paper proposes a new algorithm for controlling classification resul...
We introduce a new problem of generating an image based on a small numbe...