As language models continue to be integrated into applications of person...
Online data streams make training machine learning models hard because o...
Previous work has examined how debiasing language models affect downstre...
Conversational surveys, where an agent asks open-ended questions through...
Hierarchical domain-specific classification schemas (or subject heading
...
In times of crisis, identifying the essential needs is a crucial step to...
Popular metrics used for evaluating image captioning systems, such as BL...
This paper presents a new metric called TIGEr for the automatic evaluati...