b'Yanyang Li'

research

∙ 08/18/2023

VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control

As the model size of pre-trained language models (PLMs) grows rapidly, f...

0 Zi-Yuan Hu, et al. ∙

research

∙ 08/09/2023

CLEVA: Chinese Language Models EVAluation Platform

With the continuous emergence of Chinese Large Language Models (LLMs), h...

0 Yanyang Li, et al. ∙

research

∙ 05/10/2023

Multi-Path Transformer is Better: A Case Study on Neural Machine Translation

For years the model performance in machine learning obeyed a power-law r...

0 Ye Lin, et al. ∙

research

∙ 01/05/2023

SPRING: Situated Conversation Agent Pretrained with Multimodal Questions from Incremental Layout Graph

Existing multimodal conversation agents have shown impressive abilities ...

0 Yuxing Long, et al. ∙

research

∙ 11/03/2022

Eliciting Knowledge from Large Pre-Trained Models for Unsupervised Knowledge-Grounded Conversation

Recent advances in large-scale pre-training provide large models with th...

0 Yanyang Li, et al. ∙

research

∙ 04/17/2022

On Effectively Learning of Knowledge in Continual Pre-training

Pre-trained language models (PLMs) like BERT have made significant progr...

0 Cunxiang Wang, et al. ∙

research

∙ 04/06/2022

Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and Efficiency

Structured pruning has been extensively studied on monolingual pre-train...

0 Yanyang Li, et al. ∙

research

∙ 02/14/2022

FlowEval: A Consensus-Based Dialogue Evaluation Framework Using Segment Act Flows

Despite recent progress in open-domain dialogue evaluation, how to devel...

0 Jianqiao Zhao, et al. ∙

research

∙ 09/16/2021

The NiuTrans System for WNGT 2020 Efficiency Task

This paper describes the submissions of the NiuTrans Team to the WNGT 20...

0 Chi Hu, et al. ∙

research

∙ 09/09/2021

Bag of Tricks for Optimizing Transformer Efficiency

Improving Transformer efficiency has become increasingly attractive rece...

0 Ye Lin, et al. ∙

research

∙ 05/12/2021

Stacked Acoustic-and-Textual Encoding: Integrating the Pre-trained Models into Speech Translation Encoders

Encoder pre-training is promising in end-to-end Speech Translation (ST),...

0 Bojie Hu, et al. ∙

research

∙ 01/03/2021

An Efficient Transformer Decoder with Compressed Sub-layers

The large attention-based encoder-decoder network (Transformer) has beco...

0 Yanyang Li, et al. ∙

research

∙ 11/30/2020

A Simple and Effective Approach to Robust Unsupervised Bilingual Dictionary Induction

Unsupervised Bilingual Dictionary Induction methods based on the initial...

0 Yanyang Li, et al. ∙

research

∙ 09/19/2020

Weight Distillation: Transferring the Knowledge in Neural Network Parameters

Knowledge distillation has been proven to be effective in model accelera...

0 Ye Lin, et al. ∙

research

∙ 09/17/2020

Towards Fully 8-bit Integer Inference for the Transformer Model

8-bit integer inference, as a promising direction in reducing both the l...

0 Ye Lin, et al. ∙

research

∙ 02/16/2020

Multi-layer Representation Fusion for Neural Machine Translation

Neural machine translation systems require a number of stacked layers fo...

0 Qiang Wang, et al. ∙

research

∙ 02/16/2020

Neural Machine Translation with Joint Representation

Though early successes of Statistical Machine Translation (SMT) systems ...

0 Yanyang Li, et al. ∙

Yanyang Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro