b'Tae-Hyun Oh'

research

∙ 09/19/2023

Sound Source Localization is All about Cross-Modal Alignment

Humans can easily perceive the direction of sound sources in a visual sc...

0 Arda Senocak, et al. ∙

research

∙ 09/18/2023

An Iterative Method for Unsupervised Robust Anomaly Detection Under Data Contamination

Most deep anomaly detection models are based on learning normality from ...

0 Minkyung Kim, et al. ∙

research

∙ 08/14/2023

The Devil in the Details: Simple and Effective Optical Flow Synthetic Data Generation

Recent work on dense optical flow has shown significant progress, primar...

0 Kwon Byung-Ki, et al. ∙

research

∙ 08/02/2023

Exploiting Synthetic Data for Data Imbalance Problems: Baselines from a Data Perspective

We live in a vast ocean of data, and deep neural networks are no excepti...

0 Moon Ye-Bin, et al. ∙

research

∙ 07/27/2023

TextManiA: Enriching Visual Feature by Text-driven Manifold Augmentation

Recent label mix-based augmentation methods have shown their effectivene...

0 Moon Ye-Bin, et al. ∙

research

∙ 05/26/2023

Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis

Recently, zero-shot TTS and VC methods have gained attention due to thei...

0 Seongyeon Park, et al. ∙

research

∙ 03/30/2023

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment

How does audio describe the world around us? In this paper, we propose a...

0 Kim Sung-Bin, et al. ∙

research

∙ 03/30/2023

Prefix tuning for automated audio captioning

Audio captioning aims to generate text descriptions from environmental s...

0 Minkyu Kim, et al. ∙

research

∙ 03/28/2023

Unsupervised Pre-Training For Data-Efficient Text-to-Speech On Low Resource Languages

Neural text-to-speech (TTS) models can synthesize natural human speech w...

0 Seongyeon Park, et al. ∙

research

∙ 02/20/2023

ENInst: Enhancing Weakly-supervised Low-shot Instance Segmentation

We address a weakly-supervised low-shot instance segmentation, an annota...

0 Moon Ye-Bin, et al. ∙

research

∙ 02/01/2023

Computational Discovery of Microstructured Composites with Optimal Strength-Toughness Trade-Offs

The conflict between strength and toughness is a fundamental problem in ...

0 Beichen Li, et al. ∙

research

∙ 01/26/2023

Semi-Supervised Image Captioning by Adversarially Propagating Labeled Data

We present a novel data-efficient semi-supervised framework to improve t...

0 Dong-Jin Kim, et al. ∙

research

∙ 10/16/2022

Scratching Visual Transformer's Back with Uniform Attention

The favorable performance of Vision Transformers (ViTs) is often attribu...

18 Nam Hyeon-Woo, et al. ∙

research

∙ 08/14/2022

HDR-Plenoxels: Self-Calibrating High Dynamic Range Radiance Fields

We propose high dynamic range (HDR) radiance fields, HDR-Plenoxels, that...

0 Kim Jun-Seong, et al. ∙

research

∙ 07/27/2022

Cross-Attention of Disentangled Modalities for 3D Human Mesh Recovery with Transformers

Transformer encoder architectures have recently achieved state-of-the-ar...

2 Junhyeong Cho, et al. ∙

research

∙ 06/09/2022

CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Meshes

We propose CLIP-Actor, a text-driven motion recommendation and neural me...

0 Kim Youwang, et al. ∙

research

∙ 02/12/2022

Audio-Visual Fusion Layers for Event Type Aware Video Recognition

Human brain is continuously inundated with the multisensory information ...

0 Arda Senocak, et al. ∙

research

∙ 11/03/2021

Unified 3D Mesh Recovery of Humans and Animals by Learning Animal Exercise

We propose an end-to-end unified 3D mesh recovery of humans and quadrupe...

0 Kim Youwang, et al. ∙

research

∙ 10/02/2021

FICGAN: Facial Identity Controllable GAN for De-identification

In this work, we present Facial Identity Controllable GAN (FICGAN) for n...

0 Yonghyun Jeong, et al. ∙

research

∙ 08/13/2021

FedPara: Low-rank Hadamard Product Parameterization for Efficient Federated Learning

To overcome the burdens on frequent model uploads and downloads during f...

0 Nam Hyeon-Woo, et al. ∙

research

∙ 10/08/2020

Dense Relational Image Captioning via Multi-task Triple-Stream Networks

We introduce dense relational captioning, a novel image captioning task ...

0 Dong-Jin Kim, et al. ∙

research

∙ 08/24/2020

Monocular Reconstruction of Neural Face Reflectance Fields

The reflectance field of a face describes the reflectance properties res...

1 Mallikarjun B R., et al. ∙

research

∙ 03/18/2020

Cross-domain Self-supervised Learning for Domain Adaptation with Few Source Labels

Existing unsupervised domain adaptation methods aim to transfer knowledg...

5 Donghyun Kim, et al. ∙

research

∙ 12/10/2019

Listen to Look: Action Recognition by Previewing Audio

In the face of the video data deluge, today's expensive clip-level class...

9 Ruohan Gao, et al. ∙

research

∙ 11/20/2019

Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications

Visual events are usually accompanied by sounds in our daily lives. Howe...

65 Arda Senocak, et al. ∙

research

∙ 09/16/2019

Visuomotor Understanding for Representation Learning of Driving Scenes

Dashboard cameras capture a tremendous amount of driving scene video eac...

1 Seokju Lee, et al. ∙

research

∙ 09/05/2019

Image Captioning with Very Scarce Supervised Data: Adversarial Semi-Supervised Learning Approach

Constructing an organized dataset comprised of a large number of images ...

0 Dong-Jin Kim, et al. ∙

research

∙ 05/23/2019

Speech2Face: Learning the Face Behind a Voice

How much can we infer about a person's looks from the way they speak? In...

22 Tae-Hyun Oh, et al. ∙

research

∙ 04/17/2019

Variational Prototyping-Encoder: One-Shot Learning with Prototypical Images

In daily life, graphic symbols, such as traffic signs and brand logos, a...

0 Junsik Kim, et al. ∙

research

∙ 03/14/2019

Dense Relational Captioning: Triple-Stream Networks for Relationship-Based Captioning

Our goal in this work is to train an image captioning model that generat...

0 Dong-Jin Kim, et al. ∙

research

∙ 02/07/2019

Neural Inverse Knitting: From Images to Manufacturing Instructions

Motivated by the recent potential of mass customization brought by whole...

0 Alexandre Kaspar, et al. ∙

research

∙ 11/27/2018

Noise-tolerant Audio-visual Online Person Verification using an Attention-based Neural Network Fusion

In this paper, we present a multi-modal online person verification syste...

0 Suwon Shon, et al. ∙

research

∙ 05/15/2018

On Learning Associations of Faces and Voices

In this paper, we study the associations between human faces and voices....

0 Changil Kim, et al. ∙

research

∙ 04/08/2018

Learning-based Video Motion Magnification

Video motion magnification techniques allow us to see small motions prev...

1 Tae-Hyun Oh, et al. ∙

research

∙ 03/10/2018

Learning to Localize Sound Source in Visual Scenes

Visual events are usually accompanied by sounds in our daily lives. We p...

0 Arda Senocak, et al. ∙

research

∙ 02/14/2018

Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks

Human behavior understanding is arguably one of the most important mid-l...

0 Dong-Jin Kim, et al. ∙

research

∙ 12/05/2017

Co-domain Embedding using Deep Quadruplet Networks for Unseen Traffic Sign Recognition

Recent advances in visual recognition show overarching success by virtue...

0 Junsik Kim, et al. ∙

research

∙ 08/24/2017

Gradient-based Camera Exposure Control for Outdoor Mobile Platforms

We introduce a novel method to automatically adjust camera exposure for ...

0 Inwook Shim, et al. ∙

research

∙ 08/09/2017

Personalized Cinemagraphs using Semantic Understanding and Collaborative Learning

Cinemagraphs are a compelling way to convey dynamic aspects of a scene. ...

0 Tae-Hyun Oh, et al. ∙

research

∙ 08/09/2017

Weakly- and Self-Supervised Learning for Content-Aware Deep Image Retargeting

This paper proposes a weakly- and self-supervised deep convolutional neu...

0 Donghyeon Cho, et al. ∙

research

∙ 02/06/2017

Textually Customized Video Summaries

The best summary of a long video differs among different people due to i...

0 Jinsoo Choi, et al. ∙

research

∙ 01/12/2016

Human Attention Estimation for Natural Images: An Automatic Gaze Refinement Approach

Photo collections and its applications today attempt to reflect user int...

0 Jinsoo Choi, et al. ∙

research

∙ 12/07/2015

Pseudo-Bayesian Robust PCA: Algorithms and Analyses

Commonly used in computer vision and other applications, robust PCA repr...

0 Tae-Hyun Oh, et al. ∙

research

∙ 09/01/2015

Fast Randomized Singular Value Thresholding for Low-rank Optimization

Rank minimization can be converted into tractable surrogate problems, su...

0 Tae-Hyun Oh, et al. ∙

research

∙ 03/04/2015

Partial Sum Minimization of Singular Values in Robust PCA: Algorithm and Applications

Robust Principal Component Analysis (RPCA) via rank minimization is a po...

0 Tae-Hyun Oh, et al. ∙

Tae-Hyun Oh

Featured Co-authors

Sign in with Google

Consider DeepAI Pro