GLUECons: A Generic Benchmark for Learning Under Constraints

02/16/2023
by   Hossein Rajaby Faghihi, et al.
0

Recent research has shown that integrating domain knowledge into deep learning architectures is effective – it helps reduce the amount of required data, improves the accuracy of the models' decisions, and improves the interpretability of models. However, the research community is missing a convened benchmark for systematically evaluating knowledge integration methods. In this work, we create a benchmark that is a collection of nine tasks in the domains of natural language processing and computer vision. In all cases, we model external knowledge as constraints, specify the sources of the constraints for each task, and implement various models that use these constraints. We report the results of these models using a new set of extended evaluation criteria in addition to the task performances for a more in-depth analysis. This effort provides a framework for a more comprehensive and systematic comparison of constraint integration techniques and for identifying related research challenges. It will facilitate further research for alleviating some problems of state-of-the-art neural models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/27/2021

DomiKnowS: A Library for Integration of Symbolic Domain Knowledge in Deep Learning

We demonstrate a library for the integration of domain knowledge in deep...
research
07/22/2019

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Integration of vision and language tasks has seen a significant growth i...
research
08/19/2023

FinEval: A Chinese Financial Domain Knowledge Evaluation Benchmark for Large Language Models

Large language models (LLMs) have demonstrated exceptional performance i...
research
05/23/2020

Learning Constraints for Structured Prediction Using Rectifier Networks

Various natural language processing tasks are structured prediction prob...
research
08/12/2022

USB: A Unified Semi-supervised Learning Benchmark

Semi-supervised learning (SSL) improves model generalization by leveragi...
research
05/30/2023

Beyond One-Model-Fits-All: A Survey of Domain Specialization for Large Language Models

Large language models (LLMs) have significantly advanced the field of na...
research
05/29/2018

Automating Personnel Rostering by Learning Constraints Using Tensors

Many problems in operations research require that constraints be specifi...

Please sign up or login with your details

Forgot password? Click here to reset