Reward models (RMs) are crucial in aligning large language models (LLMs)...
Although dominant in natural language processing, transformer-based mode...
Transformers are central to recent successes in natural language process...
The explosive growth of language models and their applications have led ...
Pretraining on a large-scale corpus has become a standard method to buil...
The mixture proportions of pretraining data domains (e.g., Wikipedia, bo...
We propose Conditional Adapter (CoDA), a parameter-efficient transfer
le...
While large language models (LLM) have made impressive progress in natur...
Neural network (NN) algorithms have become the dominant tool in visual o...
With the advent of the Internet of Things, nanoelectronic devices or
mem...
While large language models (LLMs) have demonstrated impressive capabili...
Large language models have been shown to achieve remarkable performance
...
Sparsely-activated Mixture-of-experts (MoE) models allow the number of
p...
Scale has opened new frontiers in natural language processing – but at a...
Scaling language models with more data, compute and parameters has drive...
This paper explores a simple method for improving the zero-shot learning...
We propose a novel framework for modeling the interaction between graphi...
There is a growing interest in creating tools to assist in clinical note...
Clinical forecasting based on electronic medical records (EMR) can uncov...
Recently we proposed the Span Attribute Tagging (SAT) Model (Du et al., ...
This paper presents a novel framework, MGNER, for Multi-Grained Named En...
This paper describes novel models tailored for a new application, that o...
Being able to automatically discover synonymous entities from a large
fr...
Being able to recognize words as slots and detect the intent of an utter...
Answer selection and knowledge base question answering (KBQA) are two
im...
Generalized Chinese Remainder Theorem (CRT) has been shown to be a power...
Social goods, such as healthcare, smart city, and information networks, ...
Question answering (QA) has achieved promising progress recently. Howeve...
The past few years have witnessed the flourishing of crowdsourced medica...
Medical knowledge graph is the core component for various medical
applic...
Radiation therapy (RT) is a common treatment for head and neck (HaN) can...
Methods: Our deep learning model, called AnatomyNet, segments OARs from ...
The Internet has revolutionized healthcare by offering medical informati...
Variational inference provides approximations to the computationally
int...
The typical algorithmic problem in viral marketing aims to identify a se...