QUICK REVIEW

[論文レビュー] Argoverse: 3D Tracking and Forecasting with Rich Maps

Ming-Fang Chang, John Lambert|arXiv (Cornell University)|Nov 6, 2019

Autonomous Vehicle Technology and Safety参考文献 53被引用数 157

ひとこと要約

Argoverse は、同期 LiDAR と 360°ステレオ画像＋豊富なHDマップを備えた3D追跡と、採掘された軌跡を用いたモーション予測の2つの大規模自動運転データセットを導入し、HDマップの文脈が3D追跡と予測性能を向上させることを示しています。

ABSTRACT

We present Argoverse -- two datasets designed to support autonomous vehicle machine learning tasks such as 3D tracking and motion forecasting. Argoverse was collected by a fleet of autonomous vehicles in Pittsburgh and Miami. The Argoverse 3D Tracking dataset includes 360 degree images from 7 cameras with overlapping fields of view, 3D point clouds from long range LiDAR, 6-DOF pose, and 3D track annotations. Notably, it is the only modern AV dataset that provides forward-facing stereo imagery. The Argoverse Motion Forecasting dataset includes more than 300,000 5-second tracked scenarios with a particular vehicle identified for trajectory forecasting. Argoverse is the first autonomous vehicle dataset to include "HD maps" with 290 km of mapped lanes with geometric and semantic metadata. All data is released under a Creative Commons license at www.argoverse.org. In our baseline experiments, we illustrate how detailed map information such as lane direction, driveable area, and ground height improves the accuracy of 3D object tracking and motion forecasting. Our tracking and forecasting experiments represent only an initial exploration of the use of rich maps in robotic perception. We hope that Argoverse will enable the research community to explore these problems in greater depth.

研究の動機と目的

自動運転における3D物体追跡とモーション予測のための大規模でモーダルなデータを提供する。
HDマップの要素（ベクター車線中心線、地盤高、走行可能領域）を導入し、それらが知覚タスクでの有用性を実証する。
さまざまなクラスにわたる真の3Dトラックアノテーションと、多様で現実的なシナリオを含む予測ベンチマークを提供する。

提案手法

同期LiDAR、360°RGB動画、前方ステレオ、6-DoF位置情報を備えた2つのデータセット（3D TrackingとMotion Forecasting）を公開する。
ベクター車線グラフ、ラスタ地盤高、ラスタ走行可能領域/ROIを含むHDマップを提供する。
走行可能領域から5 m以内の15クラスの3Dキューブトラックを注釈する。
1006時間分のデータから5 s先の未来軌道を抽出して大型予測ベンチマークを作成する（324,557シーケンス）。
マップデータとセンサーデータを結ぶAPIを提供し、マップベースの知覚と予測のベースラインを可能にする。

実験結果

リサーチクエスチョン

RQ1HDマップ情報（車線中心線、走行可能領域、地盤高）は自動運転における3D物体追跡をどう改善できるか？
RQ2複雑な走行シーン（交差点、合流、密集した交通）で、豊富な意味情報を持つ地図をどのように活用してモーション予測を改善できるか？
RQ3地図ベースの事前情報と多モーダル予測が追跡精度と予測の多様性に与える影響は？
RQ4地図文脈が地上除去、向きのスナップ、軌道のプリューニングに予測モデルへどう影響するか？

主な発見

HDマップの文脈は3D追跡精度を向上させ、特に地上除去とレーン方向への向き合わせに寄与する。
予測ベンチマークは、マップ由来の中心線と走行可能領域の事前情報が、多様で妥当な将来軌道と高いDACを実現することを示す。
社会的文脈と地図機能（中心線、車線オフセット）を組み込んだベースライン予測モデルは、複数の予測時間幅（最大3 s）でminADE/minFDEとDACを向上させる。
データセットには追跡用の113ログセグメント（11,052個の注釈オブジェクト）と5秒 forecasting sequencesが324,557あり、マルチモーダル予測の豊かな評価を可能にする。
より大規模でリッチなマップはマップ自動化研究を促進し、HD-map駆動の知覚と予測手法のベンチマークを提供する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。