Differentially private stochastic gradient descent (DP-SGD) adds noise t...
We introduce MAmmoTH, a series of open-source large language models (LLM...
We introduce TacoBot, a user-centered task-oriented digital assistant
de...
We explore testing the reasoning ability of large language models (LLMs)...
A recent focus of large language model (LLM) development, as exemplified...
Privacy concerns have attracted increasing attention in data-driven prod...
We present TacoBot, a task-oriented dialogue system built for the inaugu...
We consider the problem of pretraining a two-stage open-domain question
...
Synthesizing QA pairs with a question generator (QG) on the target domai...
Texts convey sophisticated knowledge. However, texts also convey sensiti...
Clinical question answering (QA) aims to automatically answer questions ...
We present a large challenging dataset, COUGH, for COVID-19 FAQ retrieva...
De-identification is the task of identifying protected health informatio...
Machine reading comprehension has made great progress in recent years ow...
Annotating datasets for question answering (QA) tasks is very costly, as...
Document-level machine translation manages to outperform sentence level
...
Unstructured clinical texts contain rich health-related information. To
...
Motivation: Graph embedding learning which aims to automatically learn
l...