Language Prompt for Autonomous Driving

09/08/2023
by   Dongming Wu, et al.
0

A new trend in the computer vision community is to capture objects of interest following flexible human command represented by a natural language prompt. However, the progress of using language prompts in driving scenarios is stuck in a bottleneck due to the scarcity of paired prompt-instance data. To address this challenge, we propose the first object-centric language prompt set for driving scenes within 3D, multi-view, and multi-frame space, named NuPrompt. It expands Nuscenes dataset by constructing a total of 35,367 language descriptions, each referring to an average of 5.3 object tracks. Based on the object-text pairs from the new benchmark, we formulate a new prompt-based driving task, \ie, employing a language prompt to predict the described object trajectory across views and frames. Furthermore, we provide a simple end-to-end baseline model based on Transformer, named PromptTrack. Experiments show that our PromptTrack achieves impressive performance on NuPrompt. We hope this work can provide more new insights for the autonomous driving community. Dataset and Code will be made public at \href{https://github.com/wudongming97/Prompt4Driving}{https://github.com/wudongming97/Prompt4Driving}.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 8

research
04/12/2022

DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection

Autonomous driving faces great safety challenges for a lack of global pe...
research
05/17/2023

Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes

Modern autonomous driving systems are typically divided into three main ...
research
07/14/2023

Drive Like a Human: Rethinking Autonomous Driving with Large Language Models

In this paper, we explore the potential of using a large language model ...
research
07/14/2023

Linking vision and motion for self-supervised object-centric perception

Object-centric representations enable autonomous driving algorithms to r...
research
11/13/2018

Deep Object Centric Policies for Autonomous Driving

While learning visuomotor skills in an end-to-end manner is appealing, d...
research
04/27/2022

Self-Driving Car Steering Angle Prediction: Let Transformer Be a Car Again

Self-driving vehicles are expected to be a massive economic influence ov...
research
07/04/2023

FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation

This technical report summarizes the winning solution for the 3D Occupan...

Please sign up or login with your details

Forgot password? Click here to reset