Bo Xiong

research

∙ 01/24/2022

Box Embeddings for the Description Logic EL++

Recently, various methods for representation learning on Knowledge Bases...

0 Bo Xiong, et al. ∙

research

∙ 01/20/2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition

While today's video recognition systems parse snapshots or short clips a...

10 Chao-Yuan Wu, et al. ∙

research

∙ 12/02/2021

Improved Multiscale Vision Transformers for Classification and Detection

In this paper, we study Multiscale Vision Transformers (MViT) as a unifi...

21 Yanghao Li, et al. ∙

research

∙ 11/18/2021

PyTorchVideo: A Deep Learning Library for Video Understanding

We introduce PyTorchVideo, an open-source deep-learning library that pro...

295 Haoqi Fan, et al. ∙

research

∙ 06/06/2021

Semi-Riemannian Graph Convolutional Networks

Graph Convolutional Networks (GCNs) are typically studied through the le...

0 Bo Xiong, et al. ∙

research

∙ 04/29/2021

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning

We present a large-scale study on unsupervised spatiotemporal representa...

0 Christoph Feichtenhofer, et al. ∙

research

∙ 04/22/2021

Multiscale Vision Transformers

We present Multiscale Vision Transformers (MViT) for video and image rec...

9 Haoqi Fan, et al. ∙

research

∙ 04/16/2021

Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos

We introduce an approach for pre-training egocentric video models using ...

0 Yanghao Li, et al. ∙

research

∙ 04/01/2021

Multiview Pseudo-Labeling for Semi-supervised Learning from Video

We present a multiview pseudo-labeling approach to video learning, a nov...

0 Bo Xiong, et al. ∙

research

∙ 11/18/2020

MOFA: Modular Factorial Design for Hyperparameter Optimization

Automated hyperparameter optimization (HPO) has shown great power in man...

0 Bo Xiong, et al. ∙

research

∙ 03/03/2019

Less is More: Learning Highlight Detection from Video Duration

Highlight detection has the potential to significantly ease video browsi...

0 Bo Xiong, et al. ∙

research

∙ 08/11/2018

Pixel Objectness: Learning to Segment Generic Objects Automatically in Images and Videos

We propose an end-to-end learning framework for segmenting generic objec...

0 Bo Xiong, et al. ∙

research

∙ 03/31/2018

Snap Angle Prediction for 360^∘ Panorama

360^∘ panoramas are a rich medium, yet notoriously difficult to visualiz...

0 Bo Xiong, et al. ∙

research

∙ 12/12/2017

Im2Flow: Motion Hallucination from Static Images for Action Recognition

Existing methods to recognize actions in static images take the images a...

0 Ruohan Gao, et al. ∙

research

∙ 01/19/2017

FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos

We propose an end-to-end learning framework for segmenting generic objec...

0 Suyog Dutt Jain, et al. ∙

research

∙ 01/19/2017

Pixel Objectness

We propose an end-to-end learning framework for generating foreground ob...

0 Suyog Dutt Jain, et al. ∙

Bo Xiong

Featured Co-authors

Sign in with Google

Consider DeepAI Pro