Han Lu

research

∙ 09/14/2023

USM-SCD: Multilingual Speaker Change Detection Based on Large Pretrained Foundation Models

We introduce a multilingual speaker change detection model (USM-SCD) tha...

0 Guanlong Zhao, et al. ∙

research

∙ 05/12/2023

Rethinking k-means from manifold learning perspective

Although numerous clustering algorithms have been developed, many existi...

0 Quanxue Gao, et al. ∙

research

∙ 04/10/2023

Scalable Randomized Kernel Methods for Multiview Data Integration and Prediction

We develop scalable randomized kernel methods for jointly associating da...

0 Sandra E. Safo, et al. ∙

research

∙ 03/25/2023

Active Finetuning: Exploiting Annotation Budget in the Pretraining-Finetuning Paradigm

Given the large-scale data and the high annotation cost, pretraining-fin...

0 Yichen Xie, et al. ∙

research

∙ 02/15/2023

Interpretable Deep Learning Methods for Multiview Learning

Technological advances have enabled the generation of unique and complem...

0 Hengkang Wang, et al. ∙

research

∙ 11/11/2022

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

In this work we propose a novel token-based training strategy that impro...

0 Guanlong Zhao, et al. ∙

research

∙ 10/25/2022

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

While recent research advances in speaker diarization mostly focus on im...

0 Quan Wang, et al. ∙

research

∙ 05/27/2022

Contrastive Siamese Network for Semi-supervised Speech Recognition

This paper introduces contrastive siamese (c-siam) network, an architect...

5 Soheil Khorram, et al. ∙

research

∙ 04/12/2022

Continual Predictive Learning from Videos

Predictive learning ideally builds the world model of physical processes...

0 Wendong Zhang, et al. ∙

research

∙ 12/28/2021

Mind Your Solver! On Adversarial Attack and Defense for Combinatorial Optimization

Combinatorial optimization (CO) is a long-standing challenging task not ...

0 Han Lu, et al. ∙

research

∙ 12/03/2021

Generalized Transitional Markov Chain Monte Carlo Sampling Technique for Bayesian Inversion

In the context of Bayesian inversion for scientific and engineering mode...

0 Han Lu, et al. ∙

research

∙ 09/23/2021

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

In this paper, we present a novel speaker diarization system for streami...

0 Wei Xia, et al. ∙

research

∙ 06/14/2021

Learning-Aided Heuristics Design for Storage System

Computer systems such as storage systems normally require transparent wh...

0 Yingtian Tang, et al. ∙

research

∙ 05/06/2021

Reducing Streaming ASR Model Delay with Self Alignment

Reducing prediction delay for streaming end-to-end ASR models with minim...

4 Jaeyoung Kim, et al. ∙

research

∙ 10/07/2020

Transformer Transducer: One Model Unifying Streaming and Non-streaming Speech Recognition

In this paper we present a Transformer-Transducer model architecture and...

0 Anshuman Tripathi, et al. ∙

research

∙ 04/17/2020

Detailed 2D-3D Joint Representation for Human-Object Interaction

Human-Object Interaction (HOI) detection lies at the core of action unde...

7 Yong-Lu Li, et al. ∙

research

∙ 02/07/2020

Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss

In this paper we present an end-to-end speech recognition model with Tra...

0 Qian Zhang, et al. ∙

research

∙ 08/22/2017

Handling Homographs in Neural Machine Translation

Homographs, words with different meanings but the same surface form, hav...

0 Frederick Liu, et al. ∙

research

∙ 04/17/2017

Learning Character-level Compositionality with Visual Features

Previous work has modeled the compositionality of words by creating char...

0 Frederick Liu, et al. ∙

Han Lu

Featured Co-authors

Sign in with Google

Consider DeepAI Pro