The contact-free sensing nature of Wi-Fi has been leveraged to achieve
p...
Wi-Fi signals may help realize low-cost and non-invasive human sensing, ...
Multi-Agent Path Finding (MAPF) is a fundamental problem in robotics tha...
Having been studied for more than a decade, Wi-Fi human sensing still fa...
Hand Pose Estimation (HPE) is crucial to many applications, but conventi...
In recent years, the popular Transformer architecture has achieved great...
We present the All-Seeing (AS) project: a large-scale data and model for...
The combination of audio and vision has long been a topic of interest in...
The Flatland Challenge, which was first held in 2019 and reported in Neu...
Diffusion models have attracted significant attention due to their remar...
Pseudo-labels are widely employed in weakly supervised 3D segmentation t...
This paper presents a novel transformer architecture for graph represent...
Large language models (LLMs) have notably accelerated progress towards
a...
Multi-Agent Path Finding (MAPF) is an important core problem for many ne...
We present an interactive visual framework named InternGPT, or iGPT for
...
Multimodal emotion recognition identifies human emotions from various da...
Detecting Resident Space Objects (RSOs) and preventing collisions with o...
We propose a simple, efficient, yet powerful framework for dense visual
...
This paper introduces a new immersed boundary (IB) method for viscous
in...
Object detection with on-board sensors (e.g., lidar, radar, and camera) ...
In this report, we present our champion solution to the WSDM2023 Toloka
...
The application of Natural Language Processing (NLP) to specialized doma...
Self-supervised facial representation has recently attracted increasing
...
In this report, we present our champion solutions to five tracks at Ego4...
Compared to the great progress of large-scale vision transformers (ViTs)...
Efficient trajectory generation in complex dynamic environment stills re...
Unmanned surface vessels (USVs) are widely used in ocean exploration and...
Existing trackers usually select a location or proposal with the maximum...
This work investigates a simple yet powerful adapter for Vision Transfor...
Point cloud segmentation is fundamental in understanding 3D environments...
Linear regression is a supervised method that has been widely used in
cl...
Whereas adversarial training can be useful against specific adversarial
...
Crucial for healthcare and biomedical applications, respiration monitori...
We propose an accurate and efficient scene text detection framework, ter...
Multi-agent Pickup and Delivery (MAPD) is a challenging industrial probl...
In recent years, radio frequency (RF) sensing has gained increasing
popu...
Given the significant amount of time people spend in vehicles, health is...
Human Activity Recognition (HAR) plays a critical role in a wide range o...
Being able to see into walls is crucial for diagnostics of building heal...
It is well-known DNNs would generate different prediction results even g...
The recent success of Transformer has provided a new direction to variou...
The Flatland competition aimed at finding novel approaches to solve the
...
We present an extremely simple Ultra-Resolution Style Transfer framework...
During Multi-Agent Path Finding (MAPF) problems, agents can be delayed b...
Action recognition, which is formulated as a task to identify various hu...
Multi-Agent Path Finding has been widely studied in the past few years d...
We introduce a novel neural network-based BRDF model and a Bayesian fram...
Fast Style Transfer is a series of Neural Style Transfer algorithms that...
Accurate knowledge of the distribution system topology and parameters is...
Modern two-stage object detectors generally require excessively large mo...