b'Caiming Xiong'

research

∙ 09/17/2023

Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

Previous research in multi-document news summarization has typically con...

0 Kung-Hsiang Huang, et al. ∙

research

∙ 08/24/2023

Exploring the Integration Strategies of Retriever and Large Language Models

The integration of retrieved passages and large language models (LLMs), ...

0 Ye Liu, et al. ∙

research

∙ 08/16/2023

Enhancing Performance on Seen and Unseen Dialogue Scenarios using Retrieval-Augmented End-to-End Task-Oriented System

End-to-end task-oriented dialogue (TOD) systems have achieved promising ...

0 Jianguo Zhang, et al. ∙

research

∙ 08/11/2023

BOLAA: Benchmarking and Orchestrating LLM-augmented Autonomous Agents

The massive successes of large language models (LLMs) encourage the emer...

0 Zhiwei Liu, et al. ∙

research

∙ 08/04/2023

Retroformer: Retrospective Large Language Agents with Policy Gradient Optimization

Recent months have seen the emergence of a powerful new trend in which l...

0 Weiran Yao, et al. ∙

research

∙ 07/19/2023

DialogStudio: Towards Richest and Most Diverse Unified Dataset Collection for Conversational AI

Despite advancements in conversational AI, language models encounter cha...

0 Jianguo Zhang, et al. ∙

research

∙ 07/18/2023

REX: Rapid Exploration and eXploitation for AI Agents

In this paper, we propose an enhanced approach for Rapid Exploration and...

0 Rithesh Murthy, et al. ∙

research

∙ 07/06/2023

Sample-Efficient Learning of POMDPs with Multiple Observations In Hindsight

This paper studies the sample-efficiency of learning in Partially Observ...

0 Jiacheng Guo, et al. ∙

research

∙ 06/07/2023

Transformers as Statisticians: Provable In-Context Learning with In-Context Algorithm Selection

Neural sequence models based on the transformer architecture have demons...

6 Yu Bai, et al. ∙

research

∙ 06/01/2023

Did You Read the Instructions? Rethinking the Effectiveness of Task Definitions in Instruction Learning

Large language models (LLMs) have shown impressive performance in follow...

0 Fan Yin, et al. ∙

research

∙ 06/01/2023

Preference-grounded Token-level Guidance for Language Model Fine-tuning

Aligning language models (LMs) with preferences is an important problem ...

0 Shentao Yang, et al. ∙

research

∙ 05/30/2023

SWiPE: A Dataset for Document-Level Simplification of Wikipedia Pages

Text simplification research has mostly focused on sentence-level simpli...

0 Philippe Laban, et al. ∙

research

∙ 05/23/2023

LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond

With the recent appearance of LLMs in practical settings, having methods...

0 Philippe Laban, et al. ∙

research

∙ 05/18/2023

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

Achieving machine autonomy and human control often represent divergent o...

0 Can Qin, et al. ∙

research

∙ 05/12/2023

Answering Complex Questions over Text by Hybrid Question Parsing and Execution

The dominant paradigm of textual question answering systems is based on ...

1 Ye Liu, et al. ∙

research

∙ 05/12/2023

Zero-shot Item-based Recommendation via Multi-task Product Knowledge Graph Pre-Training

Existing recommender systems face difficulties with zero-shot items, i.e...

0 Ziwei Fan, et al. ∙

research

∙ 05/03/2023

CodeGen2: Lessons for Training LLMs on Programming and Natural Languages

Large language models (LLMs) have demonstrated remarkable abilities in r...

3 Erik Nijkamp, et al. ∙

research

∙ 04/03/2023

Efficiently Aligned Cross-Lingual Transfer Learning for Conversational Tasks using Prompt-Tuning

Cross-lingual transfer of language models trained on high-resource langu...

2 Lifu Tu, et al. ∙

research

∙ 03/17/2023

GlueGen: Plug and Play Multi-modal Encoders for X-to-image Generation

Text-to-image (T2I) models based on diffusion processes have achieved re...

0 Can Qin, et al. ∙

research

∙ 03/16/2023

HIVE: Harnessing Human Feedback for Instructional Visual Editing

Incorporating human feedback has been shown to be crucial to align text ...

1 Shu Zhang, et al. ∙

research

∙ 03/10/2023

On the Unlikelihood of D-Separation

Causal discovery aims to recover a causal graph from data generated by i...

0 Itai Feigenbaum, et al. ∙

research

∙ 03/07/2023

Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation

Interpretability and efficiency are two important considerations for the...

13 Yixin Liu, et al. ∙

research

∙ 02/20/2023

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems

When learning task-oriented dialogue (ToD) agents, reinforcement learnin...

0 Yihao Feng, et al. ∙

research

∙ 02/18/2023

A Comprehensive Survey on Pretrained Foundation Models: A History from BERT to ChatGPT

The Pretrained Foundation Models (PFMs) are regarded as the foundation f...

0 Ce Zhou, et al. ∙

research

∙ 02/17/2023

Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions

Modern news aggregators do the hard work of organizing a large news stre...

0 Philippe Laban, et al. ∙

research

∙ 02/15/2023

Improved Online Conformal Prediction via Strongly Adaptive Online Learning

We study the problem of uncertainty quantification via prediction sets, ...

1 Aadyot Bhatnagar, et al. ∙

research

∙ 02/02/2023

Lower Bounds for Learning in Revealing POMDPs

This paper studies the fundamental limits of reinforcement learning (RL)...

9 Fan Chen, et al. ∙

research

∙ 01/06/2023

Model-Agnostic Hierarchical Attention for 3D Object Detection

Transformers as versatile network architectures have recently seen great...

0 Manli Shu, et al. ∙

research

∙ 12/15/2022

Revisiting the Gold Standard: Grounding Summarization Evaluation with Robust Human Evaluation

Human evaluation is the foundation upon which the evaluation of both sum...

12 Yixin Liu, et al. ∙

research

∙ 11/22/2022

Best-k Search Algorithm for Neural Text Generation

Modern natural language generation paradigms require a good decoding str...

0 Jiacheng Xu, et al. ∙

research

∙ 11/14/2022

SPE: Symmetrical Prompt Enhancement for Fact Probing

Pretrained language models (PLMs) have been shown to accumulate factual ...

0 Yiyuan Li, et al. ∙

research

∙ 11/11/2022

Improving Factual Consistency in Summarization with Compression-Based Post-Editing

State-of-the-art summarization models still struggle to be factually con...

0 Alexander R. Fabbri, et al. ∙

research

∙ 11/09/2022

Uni-Parser: Unified Semantic Parser for Question Answering on Knowledge Base and Database

Parsing natural language questions into executable logical forms is a us...

0 Ye Liu, et al. ∙

research

∙ 11/09/2022

Discord Questions: A Computational Approach To Diversity Analysis in News Coverage

There are many potential benefits to news readers accessing diverse sour...

0 Philippe Laban, et al. ∙

research

∙ 10/23/2022

Model ensemble instead of prompt fusion: a sample-specific knowledge transfer method for few-shot prompt tuning

Prompt tuning approaches, which learn task-specific soft prompts for a d...

0 Xiangyu Peng, et al. ∙

research

∙ 10/22/2022

Prompt-Tuning Can Be Much Better Than Fine-Tuning on Cross-lingual Understanding With Multilingual Language Models

Pre-trained multilingual language models show significant performance ga...

0 Lifu Tu, et al. ∙

research

∙ 10/06/2022

Binding Language Models in Symbolic Languages

Though end-to-end neural approaches have recently been dominating NLP ta...

2 Zhoujun Cheng, et al. ∙

research

∙ 08/07/2022

Generating Negative Samples for Sequential Recommendation

To make Sequential Recommendation (SR) successful, recent works focus on...

2 Yongjun Chen, et al. ∙

research

∙ 07/21/2022

BigIssue: A Realistic Bug Localization Benchmark

As machine learning tools progress, the inevitable question arises: How ...

15 Paul Kassianik, et al. ∙

research

∙ 07/18/2022

Marvista: A Human-AI Collaborative Reading Tool

We present Marvista – a human-AI collaborative tool that employs a suite...

0 Xiang 'Anthony' Chen, et al. ∙

research

∙ 06/06/2022

Policy Optimization for Markov Games: Unified Framework and Faster Convergence

This paper studies policy optimization algorithms for multi-agent reinfo...

2 Runyu Zhang, et al. ∙

research

∙ 05/31/2022

MACE: An Efficient Model-Agnostic Framework for Counterfactual Explanation

Counterfactual explanation is an important Explainable AI technique to e...

0 Wenzhuo Yang, et al. ∙

research

∙ 05/18/2022

Modeling Multi-hop Question Answering as Single Sequence Prediction

Fusion-in-decoder (Fid) (Izacard and Grave, 2020) is a generative questi...

3 Semih Yavuz, et al. ∙

research

∙ 05/17/2022

OneAligner: Zero-shot Cross-lingual Transfer with One Rich-Resource Language Pair for Low-Resource Sentence Retrieval

Aligning parallel sentences in multilingual corpora is essential to cura...

0 Tong Niu, et al. ∙

research

∙ 05/13/2022

Near-Negative Distinction: Giving a Second Life to Human Evaluation Datasets

Precisely assessing the progress in natural language generation (NLG) ta...

0 Philippe Laban, et al. ∙

research

∙ 05/03/2022

Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation

Question generation (QGen) models are often evaluated with standardized ...

0 Philippe Laban, et al. ∙

research

∙ 04/27/2022

Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework

Current contrastive learning frameworks focus on leveraging a single sup...

0 Shu Zhang, et al. ∙

research

∙ 04/11/2022

A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis

Sentiment analysis is an important task in natural language processing. ...

0 Ehsan Hosseini-Asl, et al. ∙

research

∙ 04/05/2022

ELECRec: Training Sequential Recommenders as Discriminators

Sequential recommendation is often considered as a generative task, i.e....

1 Yongjun Chen, et al. ∙

research

∙ 03/25/2022

A Conversational Paradigm for Program Synthesis

Program synthesis strives to generate a computer program as a solution t...

11 Erik Nijkamp, et al. ∙

Caiming Xiong

Featured Co-authors

Sign in with Google

Consider DeepAI Pro