In this paper, we propose Docprompt for document question answering task...
Within the context of autonomous driving, encountering unknown objects
b...
Spread spectrum multiple access systems demand minimum possible
cross-co...
The advent of deep learning has brought a revolutionary transformation t...
Emerging large-scale text-to-image generative models, e.g., Stable Diffu...
The generalization with respect to domain shifts, as they frequently app...
Large text-to-image diffusion models have impressive capabilities in
gen...
Clustering continues to be a significant and challenging task. Recent st...
Modern predictive models are often deployed to environments in which
com...
Large-scale pre-trained models have achieved remarkable success in a var...
Avoiding the introduction of ghosts when synthesising LDR images as high...
In recent years, the development of deep learning has been pushing image...
Recent advances in deep learning have been pushing image denoising techn...
Data science workflows are human-centered processes involving on-demand
...
Despite excellent average-case performance of many image classifiers, th...
In this paper, we propose a novel self-distillation method for fake spee...
How humans infer discrete emotions is a fundamental research question in...
Human-centered AI workflows involve stakeholders with multiple roles
int...
We present MEGAnno, a novel exploratory annotation framework designed fo...
We address the task of open-world class-agnostic object detection, i.e.,...
Microprocessor architects are increasingly resorting to domain-specific
...
Pre-trained multilingual language models play an important role in
cross...
The generalization with respect to domain shifts, as they frequently app...
Semantic segmentation is a classic computer vision problem dedicated to
...
Dataset discovery from data lakes is essential in many real application
...
Joint synthesis of images and segmentation masks with generative adversa...
Automated detection of retinal structures, such as retinal vessels (RV),...
Accurate prediction of roll motion in high sea state is significant for ...
Retinal Optical Coherence Tomography Angiography (OCTA) with high-resolu...
As a rising task, panoptic segmentation is faced with challenges in both...
Emotion recognition plays a vital role in human-machine interactions and...
The rapidly-changing deep learning landscape presents a unique opportuni...
Magnetic induction tomography (MIT) is an efficient solution for long-te...
Post-hoc calibration is a common approach for providing high-quality
con...
Deep generative networks trained via maximum likelihood on a natural ima...
It is a recognized fact that the classification accuracy of unseen class...
Uncertainty estimates help to identify ambiguous, novel, or anomalous in...
Detecting the semantic types of data columns in relational tables is
imp...
Deep neural networks achieve state-of-the-art results on several tasks w...
POLSAR image has an advantage over optical image because it can be acqui...
Predicting the potential relations between nodes in networks, known as l...
Training of Generative Adversarial Networks (GANs) is notoriously fragil...
The adoption of differential privacy is growing but the complexity of
de...
For data pricing, data quality is a factor that must be considered. To k...
The approaches for analyzing the polarimetric scattering matrix of
polar...
Gastric cancer is the second leading cause of cancer-related deaths
worl...
In ISNN'04, a novel symmetric cipher was proposed, by combining a chaoti...