Reasoning about Actions over Visual and Linguistic Modalities: A Survey

07/15/2022
by   Shailaja Keyur Sampat, et al.
0

'Actions' play a vital role in how humans interact with the world and enable them to achieve desired goals. As a result, most common sense (CS) knowledge for humans revolves around actions. While 'Reasoning about Actions Change' (RAC) has been widely studied in the Knowledge Representation community, it has recently piqued the interest of NLP and computer vision researchers. This paper surveys existing tasks, benchmark datasets, various techniques and models, and their respective performance concerning advancements in RAC in the vision and language domain. Towards the end, we summarize our key takeaways, discuss the present challenges facing this research area, and outline potential directions for future research.

READ FULL TEXT

page 3

page 4

research
02/23/2021

Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others

To achieve human-like common sense about everyday life, machine learning...
research
06/18/2023

Advancing Biomedicine with Graph Representation Learning: Recent Progress, Challenges, and Future Directions

Graph representation learning (GRL) has emerged as a pivotal field that ...
research
07/22/2019

Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods

Integration of vision and language tasks has seen a significant growth i...
research
11/25/2022

TRAC: A Textual Benchmark for Reasoning about Actions and Change

Reasoning about actions and change (RAC) is essential to understand and ...
research
12/19/2022

Reasoning with Language Model Prompting: A Survey

Reasoning, as an essential ability for complex problem-solving, can prov...
research
12/27/2022

A Survey on Table-and-Text HybridQA: Concepts, Methods, Challenges and Future Directions

Table-and-text hybrid question answering (HybridQA) is a widely used and...
research
02/14/2023

UKnow: A Unified Knowledge Protocol for Common-Sense Reasoning and Vision-Language Pre-training

This work presents a unified knowledge protocol, called UKnow, which fac...

Please sign up or login with your details

Forgot password? Click here to reset