Xian Li

research

∙ 09/20/2023

Chain-of-Verification Reduces Hallucination in Large Language Models

Generation of plausible yet incorrect factual information, termed halluc...

0 Shehzaad Dhuliawala, et al. ∙

research

∙ 09/15/2023

Fine-tune the pretrained ATST model for sound event detection

Sound event detection (SED) often suffers from the data deficiency probl...

0 Nian Shao, et al. ∙

research

∙ 08/24/2023

Capacity Analysis and Throughput Maximization of NOMA with Nonlinear Power Amplifier Distortion

In future B5G/6G broadband communication systems, non-linear signal dist...

0 Xiaojia Wang, et al. ∙

research

∙ 08/11/2023

Self-Alignment with Instruction Backtranslation

We present a scalable method to build a high quality instruction followi...

0 Xian Li, et al. ∙

research

∙ 07/04/2023

Concept2Box: Joint Geometric Embeddings for Learning Two-View Knowledge Graphs

Knowledge graph embeddings (KGE) have been extensively studied to embed ...

0 Zijie Huang, et al. ∙

research

∙ 06/07/2023

Self-supervised Audio Teacher-Student Transformer for Both Clip-level and Frame-level Tasks

In recent years, self-supervised learning (SSL) has emerged as a popular...

0 Xian Li, et al. ∙

research

∙ 06/01/2023

PV2TEA: Patching Visual Modality to Textual-Established Information Extraction

Information extraction, e.g., attribute value extraction, has been exten...

4 Hejie Cui, et al. ∙

research

∙ 05/26/2023

Towards Open-World Product Attribute Mining: A Lightly-Supervised Approach

We present a new task setting for attribute mining on e-commerce product...

0 Liyan Xu, et al. ∙

research

∙ 05/23/2023

Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model

Large and sparse feed-forward networks (S-FFN) such as Mixture-of-Expert...

0 Leo Z. Liu, et al. ∙

research

∙ 05/09/2023

Large Language Model Programs

In recent years, large pre-trained language models (LLMs) have demonstra...

0 Imanol Schlag, et al. ∙

research

∙ 12/22/2022

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Recent work has shown that fine-tuning large pre-trained language models...

0 Srinivasan Iyer, et al. ∙

research

∙ 06/05/2022

Multilingual Neural Machine Translation with Deep Encoder and Multiple Shallow Decoders

Recent work in multilingual translation advances translation quality sur...

27 Xiang Kong, et al. ∙

research

∙ 05/25/2022

ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

Hate speech detection is complex; it relies on commonsense reasoning, kn...

0 Badr AlKhamissi, et al. ∙

research

∙ 05/24/2022

SFace: Sigmoid-Constrained Hypersphere Loss for Robust Face Recognition

Deep face recognition has achieved great success due to large-scale trai...

12 Yaoyao Zhong, et al. ∙

research

∙ 05/12/2022

Lifting the Curse of Multilinguality by Pre-training Modular Transformers

Multilingual pre-trained models are known to suffer from the curse of mu...

4 Jonas Pfeiffer, et al. ∙

research

∙ 05/02/2022

OPT: Open Pre-trained Transformer Language Models

Large language models, which are often trained for hundreds of thousands...

8 Susan Zhang, et al. ∙

research

∙ 04/29/2022

OA-Mine: Open-World Attribute Mining for E-Commerce Products with Weak Supervision

Automatic extraction of product attributes from their textual descriptio...

9 Xinyang Zhang, et al. ∙

research

∙ 04/26/2022

ATST: Audio Representation Learning with Teacher-Student Transformer

Self-supervised learning (SSL) learns knowledge from a large amount of u...

5 Xian Li, et al. ∙

research

∙ 04/11/2022

Unified Speech-Text Pre-training for Speech Translation and Recognition

We describe a method to jointly pre-train speech and text in an encoder-...

1 Yun Tang, et al. ∙

research

∙ 03/14/2022

Efficient Language Modeling with Sparse all-MLP

All-MLP architectures have attracted increasing interest as an alternati...

7 Ping Yu, et al. ∙

research

∙ 12/20/2021

Efficient Large Scale Language Modeling with Mixtures of Experts

Mixture of Experts layers (MoEs) enable efficient scaling of language mo...

10 Mikel Artetxe, et al. ∙

research

∙ 12/20/2021

Few-shot Learning with Multilingual Language Models

Large-scale autoregressive language models such as GPT-3 are few-shot le...

8 Xi Victoria Lin, et al. ∙

research

∙ 11/26/2021

Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs

Do language models have beliefs about the world? Dennett (1995) famously...

5 Peter Hase, et al. ∙

research

∙ 11/04/2021

Energy-Efficient Online Data Sensing and Processing in Wireless Powered Edge Computing Systems

This paper focuses on developing energy-efficient online data processing...

0 Xian Li, et al. ∙

research

∙ 09/09/2021

Distributionally Robust Multilingual Machine Translation

Multilingual neural machine translation (MNMT) learns to translate multi...

9 Chunting Zhou, et al. ∙

research

∙ 07/14/2021

FST: the FAIR Speech Translation System for the IWSLT21 Multilingual Shared Task

In this paper, we describe our end-to-end multilingual speech translatio...

7 Yun Tang, et al. ∙

research

∙ 07/12/2021

Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task

Pretraining and multitask learning are widely used to improve the speech...

12 Yun Tang, et al. ∙

research

∙ 06/30/2021

Unaware Fairness: Hierarchical Random Forest for Protected Classes

Procedural fairness has been a public concern, which leads to controvers...

15 Xian Li, et al. ∙

research

∙ 06/27/2021

Online Cognitive Data Sensing and Processing Optimization in Energy-harvesting Edge Computing Systems

Mobile edge computing (MEC) has recently become a prevailing technique t...

0 Xian Li, et al. ∙

research

∙ 06/21/2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling

Multi-head attention has each of the attention heads collect salient inf...

4 Hongyu Gong, et al. ∙

research

∙ 06/01/2021

Gender Bias Amplification During Speed-Quality Optimization in Neural Machine Translation

Is bias amplified when neural machine translation (NMT) models are optim...

5 Adithya Renduchintala, et al. ∙

research

∙ 04/15/2021

Demystify Optimization Challenges in Multilingual Transformers

Multilingual Transformer improves parameter efficiency and crosslingual ...

12 Xian Li, et al. ∙

research

∙ 04/15/2021

Adaptive Sparse Transformer for Multilingual Translation

Multilingual machine translation has attracted much attention recently d...

10 Hongyu Gong, et al. ∙

research

∙ 04/14/2021

An Integrated Optimization-Learning Framework for Online Combinatorial Computation Offloading in MEC Networks

Mobile edge computing (MEC) is a promising paradigm to accommodate the i...

0 Xian Li, et al. ∙

research

∙ 02/28/2021

On the Subbagging Estimation for Massive Data

This article introduces subbagging (subsample aggregating) estimation ap...

6 Tao Zou, et al. ∙

research

∙ 12/30/2020

Improving Zero-Shot Translation by Disentangling Positional Information

Multilingual neural machine translation has shown the capability of dire...

6 Danni Liu, et al. ∙

research

∙ 12/29/2020

Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms

Active learning (AL) algorithms may achieve better performance with fewe...

15 Yilun Zhou, et al. ∙

research

∙ 10/24/2020

Cross-Modal Transfer Learning for Multilingual Speech-to-Text Translation

We propose an effective approach to utilize pretrained speech and text m...

8 Chau Tran, et al. ∙

research

∙ 10/23/2020

DeFuzz: Deep Learning Guided Directed Fuzzing

Fuzzing is one of the most effective technique to identify potential sof...

11 Xiaogang Zhu, et al. ∙

research

∙ 09/28/2020

Deep Transformers with Latent Depth

The Transformer model has achieved state-of-the-art performance in many ...

0 Xian Li, et al. ∙

research

∙ 08/02/2020

Multilingual Translation with Extensible Multilingual Pretraining and Finetuning

Recent work demonstrates the potential of multilingual pretraining of cr...

0 Yuqing Tang, et al. ∙

research

∙ 06/24/2020

AutoKnow: Self-Driving Knowledge Collection for Products of Thousands of Types

Can one build a knowledge graph (KG) for all products in the world? Know...

12 Xin Luna Dong, et al. ∙

research

∙ 06/16/2020

Cross-lingual Retrieval for Iterative Self-Supervised Training

Recent studies have demonstrated the cross-lingual alignment ability of ...

0 Chau Tran, et al. ∙

research

∙ 06/15/2020

Automatic Validation of Textual Attribute Values in E-commerce Catalog by Learning with Limited Labeled Data

Product catalogs are valuable resources for eCommerce website. In the ca...

0 Yaqing Wang, et al. ∙

research

∙ 06/06/2020

Citing is earlier than Cited?

Generally, it is common that cited papers are earlier than citing papers...

0 Xian Li, et al. ∙

research

∙ 04/30/2020

Recipes for Adapting Pre-trained Monolingual and Multilingual Models to Machine Translation

There has been recent success in pre-training on monolingual data and fi...

0 Asa Cooper Stickland, et al. ∙

research

∙ 02/24/2020

Computation Rate Maximization in Wireless Powered MEC with Spread Spectrum Multiple Access

The integration of mobile edge computing (MEC) and wireless power transf...

0 Yuegui Chen, et al. ∙

research

∙ 01/22/2020

Multilingual Denoising Pre-training for Neural Machine Translation

This paper demonstrates that multilingual denoising pre-training produce...

0 Yinhan Liu, et al. ∙

research

∙ 09/19/2019

Improved Variational Neural Machine Translation by Promoting Mutual Information

Posterior collapse plagues VAEs for text, especially for conditional tex...

0 Arya D. McCarthy, et al. ∙

research

∙ 09/05/2019

FlowSeq: Non-Autoregressive Conditional Sequence Generation with Generative Flow

Most sequence-to-sequence (seq2seq) models are autoregressive; they gene...

0 Xuezhe Ma, et al. ∙

Xian Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro