Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

by   Benjamin Wilson, et al.

We introduce Argoverse 2 (AV2) - a collection of three datasets for perception and forecasting research in the self-driving domain. The annotated Sensor Dataset contains 1,000 sequences of multimodal data, encompassing high-resolution imagery from seven ring cameras, and two stereo cameras in addition to lidar point clouds, and 6-DOF map-aligned pose. Sequences contain 3D cuboid annotations for 26 object categories, all of which are sufficiently-sampled to support training and evaluation of 3D perception models. The Lidar Dataset contains 20,000 sequences of unlabeled lidar point clouds and map-aligned pose. This dataset is the largest ever collection of lidar sensor data and supports self-supervised learning and the emerging task of point cloud forecasting. Finally, the Motion Forecasting Dataset contains 250,000 scenarios mined for interesting and challenging interactions between the autonomous vehicle and other actors in each local scene. Models are tasked with the prediction of future motion for "scored actors" in each scenario and are provided with track histories that capture object location, heading, velocity, and category. In all three datasets, each scenario contains its own HD Map with 3D lane and crosswalk geometry - sourced from data captured in six distinct cities. We believe these datasets will support new and existing machine learning research problems in ways that existing datasets do not. All datasets are released under the CC BY-NC-SA 4.0 license.


page 4

page 9

page 15

page 22

page 23

page 24

page 25

page 26


Argoverse: 3D Tracking and Forecasting with Rich Maps

We present Argoverse – two datasets designed to support autonomous vehic...

Point Cloud Forecasting as a Proxy for 4D Occupancy Forecasting

Predicting how the world can evolve in the future is crucial for motion ...

One Thousand and One Hours: Self-driving Motion Prediction Dataset

We present the largest self-driving dataset for motion prediction to dat...

PixSet : An Opportunity for 3D Computer Vision to Go Beyond Point Clouds With a Full-Waveform LiDAR Dataset

Leddar PixSet is a new publicly available dataset (

Sequential Forecasting of 100,000 Points

Predicting the future is a crucial first step to effective control, sinc...

Unsupervised Sequence Forecasting of 100,000 Points for Unsupervised Trajectory Forecasting

Predicting the future is a crucial first step to effective control, sinc...

Slice Transformer and Self-supervised Learning for 6DoF Localization in 3D Point Cloud Maps

Precise localization is critical for autonomous vehicles. We present a s...

Please sign up or login with your details

Forgot password? Click here to reset