High-quality instruction-tuning data is critical to improving LLM
capabi...
While large language models (LLMs) have demonstrated remarkable capabili...
Safety lies at the core of the development of Large Language Models (LLM...
Modeling discourse – the linguistic phenomena that go beyond individual
...
One challenge in text-to-image (T2I) generation is the inadvertent refle...
Traditional multitask learning methods basically can only exploit common...
Most existing text generation models follow the sequence-to-sequence
par...
Although instruction-tuned large language models (LLMs) have exhibited
r...
Sentence embedding is one of the most fundamental tasks in Natural Langu...
Diffusion models developed on top of powerful text-to-image generation m...
Recent advances in large language models have enabled them to reach a le...
We introduce a frustratingly simple, super efficient and surprisingly
ef...
Zero pronouns (ZPs) are frequently omitted in pro-drop languages (e.g.
C...
Generating proper embedding of sentences through an unsupervised way is
...
Large language models (LLMs) like ChatGPT and GPT-4 have exhibited remar...
Large language models (LLMs) such as Chat-GPT can produce coherent, cohe...
The emergence of ChatGPT has recently garnered significant attention fro...
The spread of rumors along with breaking events seriously hinders the tr...
In this technical report, we introduce Effidit (Efficient and Intelligen...
People perceive the world with multiple senses (e.g., through hearing so...
We present SkillNet-NLG, a sparsely activated approach that handles many...
Chinese BERT models achieve remarkable progress in dealing with grammati...
Towards building intelligent dialogue agents, there has been a growing
i...
Current practices in metric evaluation focus on one single dataset, e.g....
In this paper, we present a substantial step in better understanding the...
We present a Chinese BERT model dubbed MarkBERT that uses word informati...
Structured prediction models aim at solving a type of problem where the
...
This paper presents a Pathways approach to handle many tasks at once. Ou...
While GPT has become the de-facto method for text generation tasks, its
...
The standard BERT adopts subword-based tokenization, which may break a w...
Paraphrase generation is an important NLP task that has achieved signifi...
Recently, it has been shown that natural language processing (NLP) model...
Pre-training (PT) and back-translation (BT) are two simple and powerful
...
In many situations (e.g., distant supervision), unlabeled entity problem...
Previous studies have shown that initializing neural machine translation...
We investigate the problem of Chinese Grammatical Error Correction (CGEC...
Self-training has proven effective for improving NMT performance by
augm...
Computer-aided translation (CAT), the use of software to assist a human
...
The lack of reliable automatic evaluation metrics is a major impediment ...
Automatic machine translation is super efficient to produce translations...
This technique report introduces TexSmart, a text understanding system t...
We study the learning of a matching model for dialogue response selectio...
The multiplayer online battle arena (MOBA) games have become increasingl...
In many scenarios, named entity recognition (NER) models severely suffer...
In this work, we present Lexical Unit Analysis (LUA), a framework for ge...
We address hypernymy detection, i.e., whether an is-a relationship exist...
There have been significant efforts to interpret the encoder of
Transfor...
Many efforts have been devoted to extracting constituency trees from
pre...
With the rapid prevalence and explosive development of MOBA esports
(Mul...
Recently many efforts have been devoted to interpreting the black-box NM...