Dialect identification is a critical task in speech processing and langu...
Masked image modeling has been demonstrated as a powerful pretext task f...
We propose JEDI, a multi-dataset semi-supervised learning method, which
...
Text-to-image diffusion models such as Stable Diffusion have recently
at...
We propose a novel cascaded cross-modal transformer (CCMT) that combines...
We propose an efficient abnormal event detection model based on a lightw...
The performance of neural networks in content-based image retrieval (CBI...
To date, query performance prediction (QPP) in the context of content-ba...
We present a novel corpus for French dialect identification comprising
4...
Can we leverage the audiovisual information already present in video to
...
We propose a very fast frame-level model for anomaly detection in video,...
Medical image segmentation is an actively studied task in medical imagin...
Anomaly detection has recently gained increasing attention in the field ...
Denoising diffusion models represent a recent emerging topic in computer...
A self-supervised multi-task learning (SSMTL) framework for video anomal...
The DarkWeb represents a hotbed for illicit activity, where users commun...
Most curriculum learning methods require an approach to sort the data sa...
Super-resolving medical images can help physicians in providing more acc...
Following the successful application of vision transformers in multiple
...
During the training process, deep neural networks implicitly learn to
re...
Recent studies revealed that convolutional neural networks do not genera...
An important preliminary step of optical character recognition systems i...
We study a series of recognition tasks in two realistic scenarios requir...
Anomaly detection is commonly pursued as a one-class classification prob...
Detecting abnormal events in video is commonly framed as a one-class
cla...
We propose a novel approach to translate unpaired contrast computed
tomo...
The COVID-19 pandemic raises the problem of adapting face recognition sy...
We propose contextual convolution (CoConv) for visual recognition. CoCon...
We propose an enhanced version of the Authentication with Built-in Camer...
In this work, we introduce a corpus for satire detection in Romanian new...
In this paper, we introduce FreSaDa, a French Satire Data Set, which is
...
Combining multiple machine learning models into an ensemble is known to
...
We explore different curriculum learning methods for training convolutio...
In this work, we describe our approach addressing the Social Media Varie...
Training machine learning models in a meaningful order, from the easy sa...
Romanian is one of the understudied languages in computational linguisti...
The class distribution of data is one of the factors that regulates the
...
Anomaly detection in video is a challenging computer vision problem. Due...
We study the task of replicating the functionality of black-box neural
m...
Deep learning methods for automotive radar interference mitigation can
s...
In this work, we introduce the methods proposed by the UnibucKernel team...
We propose a deep learning method to automatically detect personal prote...
For the time being, mobile devices employ implicit authentication mechan...
Nowadays, commonly-used authentication systems for mobile device users, ...
Abnormal event detection in video is a complex computer vision problem t...
Radar sensors are gradually becoming a wide-spread equipment for road
ve...
In this paper, we study the task of facial expression recognition under
...
In this paper, we present our system for the RSNA Intracranial Hemorrhag...
In this work, we provide a follow-up on the Moldavian versus Romanian
Cr...
The task of detecting whether a person wears a face mask from speech is
...