Despite the power of Large Language Models (LLMs) like GPT-4, they still...
Open-domain Multi-Document Summarization (ODMDS) is a critical tool for
...
Using large language models (LMs) for query or document expansion can im...
The integration of multi-document pre-training objectives into language
...
Large language models (LLMs) have shown remarkable ability on controllab...
Many real-world applications require surfacing extracted snippets to use...
People primarily consult tables to conduct data analysis or answer speci...
Recent studies have found that summaries generated by large language mod...
In-context learning (ICL) has emerged as a new approach to various natur...
Neural information retrieval often adopts a retrieve-and-rerank framewor...
Diffusion models have emerged as a powerful paradigm for generation,
obt...
While human evaluation remains best practice for accurately judging the
...
The volume of scientific output is creating an urgent need for automated...
Multi-document summarization (MDS) has traditionally been studied assumi...
Learned representations of scientific documents can serve as valuable in...
While research on scientific claim verification has led to the developme...
Training and inference with large neural models is expensive. However, f...
Automated methods have been widely used to identify and analyze mental h...
Automated scientific fact checking is difficult due to the complexity of...
Long-range transformer models have achieved encouraging results on
long-...
We introduce the LongChecker system for scientific claim verification. G...
We present Aspire, a new scientific document similarity model based on
m...
Recently proposed pre-trained generation models achieve strong performan...
Few-shot NLP research is highly active, yet conducted in disjoint resear...
Citation context analysis (CCA) is an important task in natural language...
Readers of academic research papers often read with the goal of answerin...
Managing the data for Information Retrieval (IR) experiments can be
chal...
We introduce a new pretraining approach for language models that are gea...
Prior work in document summarization has mainly focused on generating sh...
Numerous studies have demonstrated the effectiveness of pretrained
conte...
With worldwide concerns surrounding the Severe Acute Respiratory Syndrom...
With worldwide concerns surrounding the Severe Acute Respiratory Syndrom...
We introduce TLDR generation for scientific papers, a new automatic
summ...
We introduce the task of scientific fact-checking. Given a corpus of
sci...
Representation learning is a critical ingredient for natural language
pr...
Representation learning is a critical ingredient for natural language
pr...
Transformer-based models are unable to process long sequences due to the...
Medical errors are a major public health concern and a leading cause of ...
Dietary supplements are used by a large portion of the population, but
i...
As a step toward better document-level understanding, we explore
classif...
Automatically generating accurate summaries from clinical reports could ...
Although considerable attention has been given to neural ranking
archite...
Although considerable attention has been given to neural ranking
archite...
Identifying the intent of a citation in scientific papers (e.g., backgro...
Obtaining large-scale annotated data for NLP tasks in the scientific dom...
Many questions cannot be answered simply; their answers must include num...
Self-reported diagnosis statements have been widely employed in studying...
Mental health is a significant and growing public health concern. As lan...
Complex answer retrieval (CAR) is the process of retrieving answers to
q...
In recent years, online communities have formed around suicide and self-...