Hyung-Min Park

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Kwanghee Choi
15 publications
Seongkyu Mun
13 publications
Seung-Hyun Lee
10 publications
Changsoo Je
7 publications
Soyeon Choe
7 publications
Myungwoo Oh
3 publications
Rae-Hong Park
3 publications
Yong-Hyeok Lee
2 publications
Jun-Hwan Ahn
2 publications
Jeongkyun Park
2 publications
Jong-Hyeon Park
1 publication

research

∙ 06/13/2023

Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition

In this paper, we present a statistical beamforming algorithm as a pre-p...

0 Ui-Hyeop Shin, et al. ∙

research

∙ 04/08/2023

Unsupervised Speech Representation Pooling Using Vector Quantization

With the advent of general-purpose speech representations from large-sca...

0 Jeongkyun Park, et al. ∙

research

∙ 01/16/2023

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

Inspired by humans comprehending speech in a multi-modal manner, various...

0 Jeongkyun Park, et al. ∙

research

∙ 06/25/2022

Distilling a Pretrained Language Model to a Multilingual ASR Model

Multilingual speech data often suffer from long-tailed language distribu...

0 Kwanghee Choi, et al. ∙

research

∙ 07/10/2020

Overcoming label noise in audio event detection using sequential labeling

This paper addresses the noisy label issue in audio event detection (AED...

0 Jae-Bin Kim, et al. ∙

research

∙ 04/12/2019

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

In general, the performance of automatic speech recognition (ASR) system...

0 Jong-Hyeon Park, et al. ∙

research

∙ 08/25/2015

BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components

We propose a novel reflection color model consisting of body essence and...

0 Changsoo Je, et al. ∙

Success!

An error occurred

Hyung-Min Park

Featured Co-authors

Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition

Unsupervised Speech Representation Pooling Using Vector Quantization

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

Distilling a Pretrained Language Model to a Multilingual ASR Model

Overcoming label noise in audio event detection using sequential labeling

Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition

BREN: Body Reflection Essence-Neuter Model for Separation of Reflection Components

Sign in with Google

Consider DeepAI Pro