Qiao Liang

research

∙ 06/08/2023

ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases

Enabling large language models to effectively utilize real-world tools i...

0 Qiaoyu Tang, et al. ∙

research

∙ 08/29/2022

A Language Agnostic Multilingual Streaming On-Device ASR System

On-device end-to-end (E2E) models have shown improvements over a convent...

1 Bo Li, et al. ∙

research

∙ 08/29/2022

Streaming Intended Query Detection using E2E Modeling for Continued Conversation

In voice-enabled applications, a predetermined hotword isusually used to...

0 Shuo-yiin Chang, et al. ∙

research

∙ 08/29/2022

Turn-Taking Prediction for Natural Conversational Speech

While a streaming voice assistant system has been used in many applicati...

0 Shuo-yiin Chang, et al. ∙

research

∙ 04/13/2022

A Unified Cascaded Encoder ASR Model for Dynamic Model Sizes

In this paper, we propose a dynamic cascaded encoder Automatic Speech Re...

0 Shaojin Ding, et al. ∙

research

∙ 04/08/2022

Personal VAD 2.0: Optimizing Personal Voice Activity Detection for On-Device Speech Recognition

Personalization of on-device speech recognition (ASR) has seen explosive...

0 Shaojin Ding, et al. ∙

research

∙ 02/24/2022

Closing the Gap between Single-User and Multi-User VoiceFilter-Lite

VoiceFilter-Lite is a speaker-conditioned voice separation model that pl...

0 Rajeev Rikhye, et al. ∙

research

∙ 07/02/2021

Multi-user VoiceFilter-Lite via Attentive Speaker Embedding

In this paper, we propose a solution to allow speaker conditioned speech...

0 Rajeev Rikhye, et al. ∙

research

∙ 04/28/2021

Personalized Keyphrase Detection using Speaker and Environment Information

In this paper, we introduce a streaming keyphrase detection system that ...

0 Rajeev Rikhye, et al. ∙

research

∙ 02/18/2021

JST-RR Model: Joint Modeling of Ratings and Reviews in Sentiment-Topic Prediction

Analysis of online reviews has attracted great attention with broad appl...

0 Qiao Liang, et al. ∙

research

∙ 11/21/2020

A Better and Faster End-to-End Model for Streaming ASR

End-to-end (E2E) models have shown to outperform state-of-the-art conven...

0 Bo Li, et al. ∙

research

∙ 11/11/2020

Efficient Knowledge Distillation for RNN-Transducer Models

Knowledge Distillation is an effective method of transferring knowledge ...

0 Sankaran Panchapagesan, et al. ∙

research

∙ 05/16/2020

Dynamic Sparsity Neural Networks for Automatic Speech Recognition

In automatic speech recognition (ASR), model pruning is a widely adopted...

0 Zhaofeng Wu, et al. ∙

research

∙ 03/28/2020

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Thus far, end-to-end (E2E) models have not been shown to outperform stat...

0 Tara N. Sainath, et al. ∙

research

∙ 08/29/2019

Two-Pass End-to-End Speech Recognition

The requirements for many applications of state-of-the-art speech recogn...

0 Tara N. Sainath, et al. ∙

research

∙ 02/21/2019

Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling

Lingvo is a Tensorflow framework offering a complete solution for collab...

13 Jonathan Shen, et al. ∙

research

∙ 11/15/2018

Streaming End-to-end Speech Recognition For Mobile Devices

End-to-end (E2E) models, which directly predict output character sequenc...

0 Yanzhang He, et al. ∙

research

∙ 06/19/2015

To Know Where We Are: Vision-Based Positioning in Outdoor Environments

Augmented reality (AR) displays become more and more popular recently, b...

0 Kuan-Wen Chen, et al. ∙

Qiao Liang

Featured Co-authors

Sign in with Google

Consider DeepAI Pro