To integrate action recognition methods into autonomous robotic systems,...
Self-supervised representation learning for human action recognition has...
Current prevailing Video Object Segmentation (VOS) methods usually perfo...
Light field cameras can provide rich angular and spatial information to
...
The mobile robot relies on SLAM (Simultaneous Localization and Mapping) ...
Grounded Situation Recognition (GSR) is capable of recognizing and
inter...
Recently, the no-box adversarial attack, in which the attacker lacks acc...
The increasing concerns regarding the privacy of machine learning models...
The digitization of documents allows for wider accessibility and
reprodu...
Domain adaptation is essential for activity recognition, as common
spati...
A semantic map of the road scene, covering fundamental road elements, is...
The Segment Anything Model (SAM) is a new image segmentation tool traine...
This paper raises the new task of Fisheye Semantic Completion (FSC), whe...
Seeing only a tiny part of the whole is not knowing the full circumstanc...
Multimodal fusion can make semantic segmentation more robust. However, f...
In this paper, we tackle the new task of video-based Activated Muscle Gr...
Wearable robotics can improve the lives of People with Visual Impairment...
There is a growing interest in developing unlearnable examples (UEs) aga...
With the human friendly declarative intent policy expression, intent-dri...
Semantic scene understanding with Minimalist Optical Systems (MOS) in mo...
In this paper, we address panoramic semantic segmentation, which provide...
Humans have an innate ability to sense their surroundings, as they can
e...
Failure to timely diagnose and effectively treat depression leads to ove...
It has been observed that the unauthorized use of face recognition syste...
While vision-language pre-training model (VLP) has shown revolutionary
i...
Exploring an unfamiliar indoor environment and avoiding obstacles is
cha...
Autonomous vehicles utilize urban scene segmentation to understand the r...
Local feature matching is a computationally intensive task at the subpix...
The performance of semantic segmentation of RGB images can be advanced b...
Panoramic images with their 360-degree directional view encompass exhaus...
Traditional video-based human activity recognition has experienced remar...
For scene understanding in robotics and automated driving, there is a gr...
Automatically understanding human behaviour allows household robots to
i...
We explore the problem of automatically inferring the amount of kilocalo...
The robustness of semantic segmentation on edge cases of traffic scene i...
Autonomous vehicles clearly benefit from the expanded Field of View (FoV...
Transparent objects, such as glass walls and doors, constitute architect...
Lacking the ability to sense ambient environments effectively, blind and...
Intelligent vehicles clearly benefit from the expanded Field of View (Fo...
In spite of the successful application in many fields, machine learning
...
Data quality is a common problem in machine learning, especially in
high...
Independently exploring unknown spaces or finding objects in an indoor
e...
Common fully glazed facades and transparent objects present architectura...
At the heart of all automated driving systems is the ability to sense th...
Pre-training has enabled many state-of-the-art results on many tasks. In...
Drones have become a common tool, which is utilized in many tasks such a...
In recent years, the robotics community has made substantial progress in...
Convolutional Networks (ConvNets) excel at semantic segmentation and hav...
As the scene information, including objectness and scene type, are impor...
Classic computer vision algorithms, instance segmentation, and semantic
...