Kiyoharu Aizawa

research

∙ 07/30/2023

Open-Set Domain Adaptation with Visual-Language Foundation Models

Unsupervised domain adaptation (UDA) has proven to be very effective in ...

0 Qing Yu, et al. ∙

research

∙ 06/30/2023

Manga109Dialog A Large-scale Dialogue Dataset for Comics Speaker Detection

The expanding market for e-comics has spurred interest in the developmen...

0 Yingxuan Li, et al. ∙

research

∙ 06/02/2023

LoCoOp: Few-Shot Out-of-Distribution Detection via Prompt Learning

We present a novel vision-language prompt learning approach for few-shot...

0 Atsuyuki Miyai, et al. ∙

research

∙ 05/05/2023

Guided Image Synthesis via Initial Image Editing in Diffusion Model

Diffusion models have the ability to generate high quality images by den...

0 Jiafeng Mao, et al. ∙

research

∙ 04/10/2023

Zero-Shot In-Distribution Detection in Multi-Object Settings Using Vision-Language Foundation Models

Removing out-of-distribution (OOD) images from noisy images scraped from...

0 Atsuyuki Miyai, et al. ∙

research

∙ 12/07/2022

Non-uniform Sampling Strategies for NeRF on 360° images

In recent years, the performance of novel view synthesis using perspecti...

0 Takashi Otonari, et al. ∙

research

∙ 11/18/2022

A Structure-Guided Diffusion Model for Large-Hole Diverse Image Completion

Diverse image completion, a problem of generating various ways of fillin...

0 Daichi Horita, et al. ∙

research

∙ 11/02/2022

Universal Deep Image Compression via Content-Adaptive Optimization with Adapters

Deep image compression performs better than conventional codecs, such as...

0 Koki Tsubota, et al. ∙

research

∙ 10/23/2022

Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation

Rotation is frequently listed as a candidate for data augmentation in co...

0 Atsuyuki Miyai, et al. ∙

research

∙ 09/08/2022

Saliency-based Multiple Region of Interest Detection from a Single 360° image

360 images are informative – it contains omnidirectional visual informat...

0 Yuuki Sawabe, et al. ∙

research

∙ 07/20/2022

Evaluating the Stability of Deep Image Quality Assessment With Respect to Image Scaling

Image quality assessment (IQA) is a fundamental metric for image process...

0 Koki Tsubota, et al. ∙

research

∙ 07/11/2022

COO: Comic Onomatopoeia Dataset for Recognizing Arbitrary or Truncated Texts

Recognizing irregular texts has been a challenging topic in text recogni...

0 Jeonghun Baek, et al. ∙

research

∙ 06/21/2022

SVG Vector Font Generation for Chinese Characters with Transformer

Designing fonts for Chinese characters is highly labor-intensive and tim...

0 Haruka Aoki, et al. ∙

research

∙ 04/10/2022

Intersection Prediction from Single 360° Image via Deep Detection of Possible Direction of Travel

Movie-Map, an interactive first-person-view map that engages the user in...

0 Naoki Sugimoto, et al. ∙

research

∙ 04/03/2022

Distortion-Aware Self-Supervised 360° Depth Estimation from A Single Equirectangular Projection Image

360 images are widely available over the last few years. This paper prop...

0 Yuya Hasegawa, et al. ∙

research

∙ 02/07/2022

Field-of-View IoU for Object Detection in 360° Images

360 cameras have gained popularity over the last few years. In this pape...

0 Miao Cao, et al. ∙

research

∙ 10/20/2021

Noisy Annotation Refinement for Object Detection

Supervised training of object detectors requires well-annotated large-sc...

0 Jiafeng Mao, et al. ∙

research

∙ 03/08/2021

A Novel Perspective for Positive-Unlabeled Learning via Noisy Labels

Positive-unlabeled learning refers to the process of training a binary c...

0 Daiki Tanaka, et al. ∙

research

∙ 03/07/2021

What If We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels

Scene text recognition (STR) task has a common practice: All state-of-th...

0 Jeonghun Baek, et al. ∙

research

∙ 11/17/2020

Building Movie Map – A Tool for Exploring Areas in a City – and its Evaluation

We propose a new Movie Map system, with an interface for exploring citie...

0 Naoki Sugimoto, et al. ∙

research

∙ 11/04/2020

Few-Shot Font Generation with Deep Metric Learning

Designing fonts for languages with a large number of characters, such as...

0 Haruka Aoki, et al. ∙

research

∙ 11/03/2020

The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

We propose a new optimization framework for aleatoric uncertainty estima...

23 Takumi Kawashima, et al. ∙

research

∙ 09/16/2020

SLGAN: Style- and Latent-guided Generative Adversarial Network for Desirable Makeup Transfer and Removal

There are five features to consider when using generative adversarial ne...

0 Daichi Horita, et al. ∙

research

∙ 07/22/2020

Multi-Task Curriculum Framework for Open-Set Semi-Supervised Learning

Semi-supervised learning (SSL) has been proposed to leverage unlabeled d...

0 Qing Yu, et al. ∙

research

∙ 05/09/2020

Building a Manga Dataset "Manga109" with Annotations for Multimedia Applications

Manga, or comics, which are a type of multimodal artwork, have been left...

12 Kiyoharu Aizawa, et al. ∙

research

∙ 08/14/2019

Unsupervised Out-of-Distribution Detection by Maximum Classifier Discrepancy

Since deep learning models have been implemented in many commercial appl...

8 Qing Yu, et al. ∙

research

∙ 08/10/2019

Object-Aware Instance Labeling for Weakly Supervised Object Detection

Weakly supervised object detection (WSOD), where a detector is trained w...

6 Satoshi Kosugi, et al. ∙

research

∙ 05/03/2019

MeshDepth: Disconnected Mesh-based Deep Depth Prediction

We propose a novel method for mesh-based single-view depth estimation us...

8 Masaya Kaneko, et al. ∙

research

∙ 04/18/2019

Computational Attention System for Children, Adults and Elderly

The existing computational visual attention systems have focused on the ...

0 Onkar Krishna, et al. ∙

research

∙ 03/03/2019

Recognition of Multiple Food Items in a Single Photo for Use in a Buffet-Style Restaurant

We investigate image recognition of multiple food items in a single phot...

0 Masashi Anzawa, et al. ∙

research

∙ 09/03/2018

Context-Patch Face Hallucination Based on Thresholding Locality-constrained Representation and Reproducing Learning

Face hallucination is a technique that reconstruct high-resolution (HR) ...

0 Junjun Jiang, et al. ∙

research

∙ 08/26/2018

Scale Drift Correction of Camera Geo-Localization using Geo-Tagged Images

Camera geo-localization from a monocular video is a fundamental task for...

2 Kazuya Iwami, et al. ∙

research

∙ 05/08/2018

Category-Based Deep CCA for Fine-Grained Venue Discovery from Multimodal Data

In this work, travel destination and business location are taken as venu...

0 Yi Yu, et al. ∙

research

∙ 04/08/2018

Personalized Classifier for Food Image Recognition

Currently, food image recognition tasks are evaluated against fixed data...

0 Shota Horiguchi, et al. ∙

research

∙ 03/30/2018

Parallel Grid Pooling for Data Augmentation

Convolutional neural network (CNN) architectures utilize downsampling la...

0 Akito Takeki, et al. ∙

research

∙ 03/30/2018

Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation

Can we detect common objects in a variety of image domains without insta...

0 Naoto Inoue, et al. ∙

research

∙ 03/30/2018

Joint Optimization Framework for Learning with Noisy Labels

Deep neural networks (DNNs) trained on large-scale datasets have exhibit...

0 Daiki Tanaka, et al. ∙

research

∙ 12/29/2017

Significance of Softmax-based Features in Comparison to Distance Metric Learning-based Features

The extraction of useful deep features is important for many computer vi...

0 Shota Horiguchi, et al. ∙

research

∙ 09/12/2017

PQk-means: Billion-scale Clustering for Product-quantized Codes

Data clustering is a fundamental operation in data analysis. For handlin...

0 Yusuke Matsui, et al. ∙

research

∙ 06/21/2017

cGAN-based Manga Colorization Using a Single Training Image

The Japanese comic format known as Manga is popular all over the world. ...

0 Paulina Hensman, et al. ∙

research

∙ 05/26/2017

Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems

We propose the residual expansion (RE) algorithm: a global (or near-glob...

0 Daiki Ikami, et al. ∙

research

∙ 05/20/2017

Gaze Distribution Analysis and Saliency Prediction Across Age Groups

Knowledge of the human visual system helps to develop better computation...

0 Onkar Krishna, et al. ∙

research

∙ 10/15/2015

Sketch-based Manga Retrieval using Manga109 Dataset

Manga (Japanese comics) are popular worldwide. However, current e-manga ...

0 Yusuke Matsui, et al. ∙

Kiyoharu Aizawa

Featured Co-authors

Sign in with Google

Consider DeepAI Pro