Yun Tang

research

∙ 07/17/2023

Domain Knowledge Distillation from Large Language Model: An Empirical Study in the Autonomous Driving Domain

Engineering knowledge-based (or expert) systems require extensive manual...

0 Yun Tang, et al. ∙

research

∙ 06/01/2023

Exploration on HuBERT with Multiple Resolutions

Hidden-unit BERT (HuBERT) is a widely-used self-supervised learning (SSL...

0 Jiatong Shi, et al. ∙

research

∙ 05/04/2023

Hybrid Transducer and Attention based Encoder-Decoder Modeling for Speech-to-Text Tasks

Transducer and Attention based Encoder-Decoder (AED) are two widely used...

0 Yun Tang, et al. ∙

research

∙ 04/10/2023

Enhancing Speech-to-Speech Translation with Multiple TTS Targets

It has been known that direct speech-to-speech translation (S2ST) models...

0 Jiatong Shi, et al. ∙

research

∙ 04/10/2023

ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit

ESPnet-ST-v2 is a revamp of the open-source ESPnet-ST toolkit necessitat...

0 Brian Yan, et al. ∙

research

∙ 12/15/2022

UnitY: Two-pass Direct Speech-to-speech Translation with Discrete Units

Direct speech-to-speech translation (S2ST), in which all components can ...

2 Hirofumi Inaguma, et al. ∙

research

∙ 11/05/2022

High Capacity Reversible Data Hiding for Encrypted 3D Mesh Models Based on Topology

Reversible data hiding in encrypted domain(RDH-ED) can not only protect ...

0 Yun Tang, et al. ∙

research

∙ 10/26/2022

Improving Speech-to-Speech Translation Through Unlabeled Text

Direct speech-to-speech translation (S2ST) is among the most challenging...

3 Xuan-Phi Nguyen, et al. ∙

research

∙ 10/21/2022

Named Entity Detection and Injection for Direct Speech Translation

In a sentence, certain words are critical for its semantic. Among them, ...

1 Marco Gaido, et al. ∙

research

∙ 10/18/2022

Simple and Effective Unsupervised Speech Translation

The amount of labeled data to train models for speech tasks is limited f...

0 Changhan Wang, et al. ∙

research

∙ 04/11/2022

Unified Speech-Text Pre-training for Speech Translation and Recognition

We describe a method to jointly pre-train speech and text in an encoder-...

1 Yun Tang, et al. ∙

research

∙ 10/15/2021

Direct simultaneous speech to speech translation

We present the first direct simultaneous speech-to-speech translation (S...

0 Xutai Ma, et al. ∙

research

∙ 10/15/2021

Incremental Speech Synthesis For Speech-To-Speech Translation

In a speech-to-speech translation (S2ST) pipeline, the text-to-speech (T...

0 Danni Liu, et al. ∙

research

∙ 07/14/2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task

In this paper, we describe our end-to-end multilingual speech translatio...

7 Yun Tang, et al. ∙

research

∙ 07/12/2021

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task

Pretraining and multitask learning are widely used to improve the speech...

12 Yun Tang, et al. ∙

research

∙ 07/12/2021

Direct speech-to-speech translation with discrete units

We present a direct speech-to-speech translation (S2ST) model that trans...

0 Ann Lee, et al. ∙

research

∙ 06/21/2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling

Multi-head attention has each of the attention heads collect salient inf...

4 Hongyu Gong, et al. ∙

research

∙ 10/24/2020

Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation

We propose an effective approach to utilize pretrained speech and text m...

8 Chau Tran, et al. ∙

research

∙ 10/21/2020

A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks

Attention-based sequence-to-sequence modeling provides a powerful and el...

0 Yun Tang, et al. ∙

research

∙ 10/11/2020

fairseq S2T: Fast Speech-to-Text Modeling with fairseq

We introduce fairseq S2T, a fairseq extension for speech-to-text (S2T) m...

0 Changhan Wang, et al. ∙

research

∙ 06/03/2020

Self-Training for End-to-End Speech Translation

One of the main challenges for end-to-end speech translation is data sca...

0 Juan Pino, et al. ∙

research

∙ 11/09/2019

Orthogonal Relation Transforms with Graph Context Modeling for Knowledge Graph Embedding

Translational distance-based knowledge graph embedding has shown progres...

12 Yun Tang, et al. ∙

research

∙ 10/23/2019

Relation Module for Non-answerable Prediction on Question Answering

Machine reading comprehension(MRC) has attracted significant amounts of ...

13 Kevin Huang, et al. ∙

research

∙ 08/29/2019

Zero-shot Text-to-SQL Learning with Auxiliary Task

Recent years have seen great success in the use of neural seq2seq models...

0 Shuaichen Chang, et al. ∙

research

∙ 05/17/2019

Multi-hop Reading Comprehension across Multiple Documents by Reasoning over Heterogeneous Graphs

Multi-hop reading comprehension (RC) across documents poses new challeng...

0 Ming Tu, et al. ∙

research

∙ 04/16/2019

I4U Submission to NIST SRE 2018: Leveraging from a Decade of Shared Experiences

The I4U consortium was established to facilitate a joint entry to NIST s...

0 Kong Aik Lee, et al. ∙

research

∙ 03/22/2019

Towards adversarial learning of speaker-invariant representation for speech emotion recognition

Speech emotion recognition (SER) has attracted great attention in recent...

0 Ming Tu, et al. ∙

research

∙ 02/21/2019

Deep Speaker Embedding Learning with Multi-Level Pooling for Text-Independent Speaker Verification

This paper aims to improve the widely used deep speaker embedding x-vect...

0 Yun Tang, et al. ∙

research

∙ 11/11/2018

End-to-end Structure-Aware Convolutional Networks for Knowledge Base Completion

Knowledge graph embedding has been an active research topic for knowledg...

4 Chao Shang, et al. ∙

Yun Tang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro