Representation learning has significantly driven the field to develop
pr...
The rapid advancement and widespread use of large language models (LLMs)...
Training data attribution (TDA) techniques find influential training dat...
Large Language Models (LLMs) are transforming society and permeating int...
Supervised learning of image classifiers distills human knowledge into a...
Contrastively trained encoders have recently been proven to invert the
d...
Neural networks trained with ERM (empirical risk minimization) sometimes...
The favorable performance of Vision Transformers (ViTs) is often attribu...
The great success of machine learning with massive amounts of data comes...
Image-Test matching (ITM) is a common task for evaluating the quality of...
Weakly supervised semantic segmentation (WSSS) methods are often built o...
Data augmentation has been an important ingredient for boosting performa...
Deep neural networks (DNNs) often rely on easy-to-learn discriminatory
f...
The class activation mapping, or CAM, has been the cornerstone of featur...
Effective control and prediction of dynamical systems often require
appr...
Vision Transformer (ViT) extends the application range of transformers f...
Cross-modal retrieval methods build a common representation space for sa...
ImageNet has been arguably the most popular image classification benchma...
State-of-the-art video action classifiers often suffer from overfitting....
Weakly-supervised object localization (WSOL) has gained popularity over ...
Normalization techniques, such as batch normalization (BN), have led to
...
Despite apparent human-level performances of deep neural networks (DNN),...
Devising indicative evaluation metrics for the image generation task rem...
Weakly-supervised object localization (WSOL) has gained popularity over ...
Scene text recognition (STR) is the task of recognizing character sequen...
Many machine learning algorithms are trained and evaluated by splitting ...
Regional dropout strategies have been proposed to enhance the performanc...
Many new proposals for scene text recognition (STR) models have been
int...
Instance embeddings are an efficient and versatile image representation ...
Reinforcement learning (RL) has advanced greatly in the past few years w...
Machine Learning techniques are widely used by online services (e.g. Goo...
As more and more personal photos are shared online, being able to obfusc...
Many deployed learned models are black boxes: given input, returns outpu...
People nowadays share large parts of their personal lives through social...
Learning how to generate descriptions of images or videos received major...
Users like sharing personal photos with others through social media. At ...
There have been remarkable improvements in the semantic labelling task i...
As we shift more of our lives into the virtual domain, the volume of dat...
Recognising persons in everyday photos presents major challenges (occlud...