To integrate action recognition methods into autonomous robotic systems,...
Self-supervised representation learning for human action recognition has...
Recently, transformer-based methods have shown exceptional performance i...
Key-point-based scene understanding is fundamental for autonomous drivin...
Audio-guided Video Object Segmentation (A-VOS) and Referring Video Objec...
The recently rising markup-to-image generation poses greater challenges ...
Light field cameras can provide rich angular and spatial information to
...
The mobile robot relies on SLAM (Simultaneous Localization and Mapping) ...
Grounded Situation Recognition (GSR) is capable of recognizing and
inter...
Event cameras are capable of responding to log-brightness changes in
mic...
High-quality panoramic images with a Field of View (FoV) of 360-degree a...
In this paper, we propose LF-PGVIO, a Visual-Inertial-Odometry (VIO)
fra...
The integration of diverse visual prompts like clicks, scribbles, and bo...
Domain adaptive semantic segmentation enables robust pixel-wise understa...
Domain adaptation is essential for activity recognition, as common
spati...
Transformer-based methods have demonstrated superior performance for
mon...
Interactive Image Segmentation (IIS) has emerged as a promising techniqu...
A semantic map of the road scene, covering fundamental road elements, is...
Visual place recognition has received increasing attention in recent yea...
This paper raises the new task of Fisheye Semantic Completion (FSC), whe...
Seeing only a tiny part of the whole is not knowing the full circumstanc...
Multimodal fusion can make semantic segmentation more robust. However, f...
In this paper, we tackle the new task of video-based Activated Muscle Gr...
Wearable robotics can improve the lives of People with Visual Impairment...
Limited by hardware cost and system size, camera's Field-of-View (FoV) i...
Semantic scene understanding with Minimalist Optical Systems (MOS) in mo...
Loop closure is an important component of Simultaneous Localization and
...
In this paper, we address panoramic semantic segmentation, which provide...
Humans have an innate ability to sense their surroundings, as they can
e...
Failure to timely diagnose and effectively treat depression leads to ove...
In this work, we introduce panoramic panoptic segmentation, as the most
...
Panoramic Annular Lens (PAL), composed of few lenses, has great potentia...
Human Pose Estimation (HPE) based on RGB images has experienced a rapid
...
With the rapid development of high-speed communication and artificial
in...
Structured Visual Content (SVC) such as graphs, flow charts, or the like...
Driver observation models are rarely deployed under perfect conditions. ...
Exploring an unfamiliar indoor environment and avoiding obstacles is
cha...
Autonomous vehicles utilize urban scene segmentation to understand the r...
Local feature matching is a computationally intensive task at the subpix...
The performance of semantic segmentation of RGB images can be advanced b...
Panoramic images with their 360-degree directional view encompass exhaus...
Traditional video-based human activity recognition has experienced remar...
For scene understanding in robotics and automated driving, there is a gr...
Optical flow estimation is a basic task in self-driving and robotics sys...
Visual-inertial-odometry has attracted extensive attention in the field ...
Automatically understanding human behaviour allows household robots to
i...
Optical flow estimation is an essential task in self-driving systems, wh...
We explore the problem of automatically inferring the amount of kilocalo...
The robustness of semantic segmentation on edge cases of traffic scene i...
Traditional frame-based cameras inevitably suffer from motion blur due t...