Yizhuo Li

research

∙ 07/13/2023

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

This paper introduces InternVid, a large-scale video-centric multimodal ...

0 Yi Wang, et al. ∙

research

∙ 05/10/2023

VideoChat: Chat-Centric Video Understanding

In this study, we initiate an exploration into video understanding by in...

0 Kunchang Li, et al. ∙

research

∙ 03/28/2023

Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Video Foundation Models (VFMs) have received limited exploration due to ...

0 Kunchang Li, et al. ∙

research

∙ 12/06/2022

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

The foundation models have recently shown excellent performance on a var...

4 Yi Wang, et al. ∙

research

∙ 11/17/2022

UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer

Learning discriminative spatiotemporal representation is the key problem...

0 Kunchang Li, et al. ∙

research

∙ 11/17/2022

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

In this report, we present our champion solutions to five tracks at Ego4...

0 Guo Chen, et al. ∙

research

∙ 02/14/2022

HAKE: A Knowledge Engine Foundation for Human Activity Understanding

Human activity understanding is of widespread interest in artificial int...

0 Yong-Lu Li, et al. ∙

research

∙ 03/21/2021

PGT: A Progressive Method for Training Models on Long Videos

Convolutional video models have an order of magnitude larger computation...

17 Bo Pang, et al. ∙

research

∙ 12/14/2020

TDAF: Top-Down Attention Framework for Vision Tasks

Human attention mechanisms often work in a top-down manner, yet it is no...

14 Bo Pang, et al. ∙

research

∙ 10/30/2020

HOI Analysis: Integrating and Decomposing Human-Object Interaction

Human-Object Interaction (HOI) consists of human, object and implicit in...

0 Yong-Lu Li, et al. ∙

research

∙ 06/10/2020

TubeTK: Adopting Tubes to Track Multi-Object in a One-Step Training Model

Multi-object tracking is a fundamental vision problem that has been stud...

0 Bo Pang, et al. ∙

Yizhuo Li

Featured Co-authors

Sign in with Google

Consider DeepAI Pro