b'Tao Kong'

research

∙ 08/07/2023

MOMA-Force: Visual-Force Imitation for Real-World Mobile Manipulation

In this paper, we present a novel method for mobile manipulators to perf...

0 Taozheng Yang, et al. ∙

research

∙ 08/07/2023

Exploring Visual Pre-training for Robot Manipulation: Datasets, Models and Methods

Visual pre-training with large-scale real-world data has made great prog...

0 Ya Jing, et al. ∙

research

∙ 07/19/2023

ClickSeg: 3D Instance Segmentation with Click-Level Weak Annotations

3D instance segmentation methods often require fully-annotated dense lab...

0 Leyao Liu, et al. ∙

research

∙ 07/05/2023

What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?

Recent advancements in Large Language Models (LLMs) such as GPT4 have di...

0 Yan Zeng, et al. ∙

research

∙ 03/20/2023

Learning to Explore Informative Trajectories and Samples for Embodied Perception

We are witnessing significant progress on perception models, specificall...

0 Ya Jing, et al. ∙

research

∙ 10/24/2022

Towards Unifying Reference Expression Generation and Comprehension

Reference Expression Generation (REG) and Comprehension (REC) are two hi...

5 Duo Zheng, et al. ∙

research

∙ 10/03/2022

Generative Category-Level Shape and Pose Estimation with Semantic Primitives

Empowering autonomous agents with 3D understanding for daily objects is ...

3 Guanglin Li, et al. ∙

research

∙ 09/08/2022

Exploring Target Representations for Masked Autoencoders

Masked autoencoders have become popular training paradigms for self-supe...

0 Xingbin Liu, et al. ∙

research

∙ 07/05/2022

3D Part Assembly Generation with Instance Encoded Transformer

It is desirable to enable robots capable of automatic assembly. Structur...

0 Rufeng Zhang, et al. ∙

research

∙ 04/12/2022

Learning Design and Construction with Varying-Sized Materials via Prioritized Memory Resets

Can a robot autonomously learn to design and construct a bridge from var...

0 Yunfei Li, et al. ∙

research

∙ 02/08/2022

Navigating to Objects in Unseen Environments by Distance Prediction

Object Goal Navigation (ObjectNav) task is to navigate an agent to an ob...

0 Minzhao Zhu, et al. ∙

research

∙ 11/15/2021

iBOT: Image BERT Pre-Training with Online Tokenizer

The success of language Transformers is primarily attributed to the pret...

0 Jinghao Zhou, et al. ∙

research

∙ 10/14/2021

Self-Supervised Learning by Estimating Twin Class Distributions

We present TWIST, a novel self-supervised representation learning method...

12 Feng Wang, et al. ∙

research

∙ 08/26/2021

ICM-3D: Instantiated Category Modeling for 3D Instance Segmentation

Separating 3D point clouds into individual instances is an important tas...

0 Ruihang Chu, et al. ∙

research

∙ 08/05/2021

Learning to Design and Construct Bridge without Blueprint

Autonomous assembly has been a desired functionality of many intelligent...

0 Yunfei Li, et al. ∙

research

∙ 08/05/2021

Simultaneous Semantic and Collision Learning for 6-DoF Grasp Pose Estimation

Grasping in cluttered scenes has always been a great challenge for robot...

6 Yiming Li, et al. ∙

research

∙ 06/30/2021

SOLO: A Simple Framework for Instance Segmentation

Compared to many other dense prediction tasks, e.g., semantic segmentati...

0 Xinlong Wang, et al. ∙

research

∙ 06/10/2021

Adversarial Option-Aware Hierarchical Imitation Learning

It has been a challenge to learning skills for an agent from long-horizo...

6 Mingxuan Jing, et al. ∙

research

∙ 03/31/2021

Scale-aware Automatic Augmentation for Object Detection

We propose Scale-aware AutoAug to learn data augmentation policies for o...

0 Yukang Chen, et al. ∙

research

∙ 03/30/2021

Locate then Segment: A Strong Pipeline for Referring Image Segmentation

Referring image segmentation aims to segment the objects referred by a n...

0 Ya Jing, et al. ∙

research

∙ 12/31/2020

TransTrack: Multiple-Object Tracking with Transformer

Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...

20 Peize Sun, et al. ∙

research

∙ 11/25/2020

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals

We present Sparse R-CNN, a purely sparse method for object detection in ...

0 Peize Sun, et al. ∙

research

∙ 11/18/2020

Dense Contrastive Learning for Self-Supervised Visual Pre-Training

To date, most existing self-supervised learning methods are designed and...

2 Xinlong Wang, et al. ∙

research

∙ 03/23/2020

SOLOv2: Dynamic, Faster and Stronger

In this work, we aim at building a simple, direct, and fast instance seg...

24 Xinlong Wang, et al. ∙

research

∙ 12/10/2019

SOLO: Segmenting Objects by Locations

We present a new, embarrassingly simple approach to instance segmentatio...

27 Xinlong Wang, et al. ∙

research

∙ 09/17/2019

Deep Point-wise Prediction for Action Temporal Proposal

Detecting actions in videos is an important yet challenging task. Previo...

0 Luxuan Li, et al. ∙

research

∙ 09/17/2019

Task-Aware Monocular Depth Estimation for 3D Object Detection

Monocular depth estimation enables 3D perception from a single 2D image,...

9 Xinlong Wang, et al. ∙

research

∙ 04/25/2019

Attention-based Transfer Learning for Brain-computer Interface

Different functional areas of the human brain play different roles in br...

0 Chuanqi Tan, et al. ∙

research

∙ 04/08/2019

FoveaBox: Beyond Anchor-based Object Detector

We present FoveaBox, an accurate, flexible and completely anchor-free fr...

0 Tao Kong, et al. ∙

research

∙ 01/19/2019

Consistent Optimization for Single-Shot Object Detection

We present consistent optimization for single stage object detection. Pr...

0 Tao Kong, et al. ∙

research

∙ 12/04/2018

Zoom-In-to-Check: Boosting Video Interpolation via Instance-level Discrimination

We propose a light-weight video frame interpolation algorithm. Our key i...

0 Liangzhe Yuan, et al. ∙

research

∙ 08/24/2018

Deep Feature Pyramid Reconfiguration for Object Detection

State-of-the-art object detectors usually learn multi-scale representati...

0 Tao Kong, et al. ∙

research

∙ 08/06/2018

A Survey on Deep Transfer Learning

As a new classification platform, deep learning has recently received in...

0 Chuanqi Tan, et al. ∙

research

∙ 07/06/2017

RON: Reverse Connection with Objectness Prior Networks for Object Detection

We present RON, an efficient and effective framework for generic object ...

0 Tao Kong, et al. ∙

research

∙ 04/03/2016

HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection

Almost all of the current top-performing object detection networks emplo...

0 Tao Kong, et al. ∙

Tao Kong

Featured Co-authors

Sign in with Google

Consider DeepAI Pro