Unsupervised domain adaptation (UDA) has proven to be very effective in
...
The expanding market for e-comics has spurred interest in the developmen...
We present a novel vision-language prompt learning approach for few-shot...
Diffusion models have the ability to generate high quality images by
den...
Removing out-of-distribution (OOD) images from noisy images scraped from...
In recent years, the performance of novel view synthesis using perspecti...
Diverse image completion, a problem of generating various ways of fillin...
Deep image compression performs better than conventional codecs, such as...
Rotation is frequently listed as a candidate for data augmentation in
co...
360 images are informative – it contains omnidirectional visual
informat...
Image quality assessment (IQA) is a fundamental metric for image process...
Recognizing irregular texts has been a challenging topic in text recogni...
Designing fonts for Chinese characters is highly labor-intensive and
tim...
Movie-Map, an interactive first-person-view map that engages the user in...
360 images are widely available over the last few years. This paper
prop...
360 cameras have gained popularity over the last few years. In this
pape...
Supervised training of object detectors requires well-annotated large-sc...
Positive-unlabeled learning refers to the process of training a binary
c...
Scene text recognition (STR) task has a common practice: All state-of-th...
We propose a new Movie Map system, with an interface for exploring citie...
Designing fonts for languages with a large number of characters, such as...
We propose a new optimization framework for aleatoric uncertainty estima...
There are five features to consider when using generative adversarial
ne...
Semi-supervised learning (SSL) has been proposed to leverage unlabeled d...
Manga, or comics, which are a type of multimodal artwork, have been left...
Since deep learning models have been implemented in many commercial
appl...
Weakly supervised object detection (WSOD), where a detector is trained w...
We propose a novel method for mesh-based single-view depth estimation us...
The existing computational visual attention systems have focused on the
...
We investigate image recognition of multiple food items in a single phot...
Face hallucination is a technique that reconstruct high-resolution (HR) ...
Camera geo-localization from a monocular video is a fundamental task for...
In this work, travel destination and business location are taken as venu...
Currently, food image recognition tasks are evaluated against fixed data...
Convolutional neural network (CNN) architectures utilize downsampling la...
Can we detect common objects in a variety of image domains without
insta...
Deep neural networks (DNNs) trained on large-scale datasets have exhibit...
The extraction of useful deep features is important for many computer vi...
Data clustering is a fundamental operation in data analysis. For handlin...
The Japanese comic format known as Manga is popular all over the world. ...
We propose the residual expansion (RE) algorithm: a global (or near-glob...
Knowledge of the human visual system helps to develop better computation...
Manga (Japanese comics) are popular worldwide. However, current e-manga
...