Textual graphs (TGs) are graphs whose nodes correspond to text (sentence...
In this work, we study multi-source test-time model adaptation from user...
Chain-of-Thought and Program-Aided Language Models represent two distinc...
We endow Large Language Models (LLMs) with fine-grained self-evaluation ...
Many training algorithms of a deep neural network can be interpreted as
...
We present a simple self-training method that achieves 87.4
on ImageNet,...
Despite its success, deep learning still needs large labeled datasets to...
The human mind is a powerful multifunctional knowledge storage and manag...
Mixture of Softmaxes (MoS) has been shown to be effective at addressing ...
In this work, we study the credit assignment problem in reward augmented...
Cloze test is widely adopted in language exams to evaluate students' lan...
Learning meaningful representations that maintain the content necessary ...
Knowledge bases are important resources for a variety of natural languag...
We present RACE, a new dataset for benchmark evaluation of methods in th...
Dialogue state tracking (DST) is a process to estimate the distribution ...