QUICK REVIEW

[論文レビュー] Research on the Application of Computer Vision Based on Deep Learning in Autonomous Driving Technology

Jingyu Zhang, Jin Xin Cao|arXiv (Cornell University)|Jun 1, 2024

Applied Advanced Technologies被引用数 5

ひとこと要約

本論文は、画像認識、リアルタイム追跡/分類、環境認識/意思決定支援、経路計画/ナビゲーションを網羅する深層学習駆動の自動運転システムを提案し、各モジュールで98%以上の精度とミリ秒単位の応答時間を達成する。

ABSTRACT

This research aims to explore the application of deep learning in autonomous driving computer vision technology and its impact on improving system performance. By using advanced technologies such as convolutional neural networks (CNN), multi-task joint learning methods, and deep reinforcement learning, this article analyzes in detail the application of deep learning in image recognition, real-time target tracking and classification, environment perception and decision support, and path planning and navigation. Application process in key areas. Research results show that the proposed system has an accuracy of over 98% in image recognition, target tracking and classification, and also demonstrates efficient performance and practicality in environmental perception and decision support, path planning and navigation. The conclusion points out that deep learning technology can significantly improve the accuracy and real-time response capabilities of autonomous driving systems. Although there are still challenges in environmental perception and decision support, with the advancement of technology, it is expected to achieve wider applications and greater capabilities in the future. potential.

研究の動機と目的

理論から実装まで、深層学習が自動運転のコンピュータビジョンをどのように向上させるかを調査する。
画像認識、追跡、環境認識、意思決定支援、経路計画の性能向上を評価する。
自動運転における深層学習のシステム設計・制限事項・将来の方向性に関する指針を提供する。

提案手法

選択的探索を用いた2段階検出フレームワーク、CNN特徴抽出、およびSVM分類を組み合わせたCNNベースの画像認識を実装する。
L = L_cls + λ L_reg となる結合損失を用いた、リアルタイムのターゲット追跡と分類のためのマルチタスク結合学習ネットワークを採用する。
環境認識と意思決定支援のため、シミュレーション環境でマルチモーダルセンサデータを用いたDouble DQNを使用する。
適応的な経路計画と障害物回避のため、グラフニューラルネットワーク（GNN）とA*探索を統合する。
ミリ秒レベルの意思決定サイクルを用いた実世界データとシミュレーションのシナリオでシステム性能を評価する。

実験結果

リサーチクエスチョン

RQ1自動運転における画像認識、ターゲット追跡、分類の精度とリアルタイム性能を深層学習はどのように向上させるか。
RQ2深層学習と強化学習を適用した場合、環境認識・意思決定支援・経路計画の性能はどの程度向上するか。
RQ3ヒューリスティックガイダンスを備えたグラフニューラルネットワークベースの経路計画が、動的交通でリアルタイムな障害物回避を実現できるか。

主な発見

Functional Module	Accuracy (%)	Response Time (ms)	Computational Efficiency
画像識別	98.5	45	高い
リアルタイムターゲット追跡と分類	98.2	50	高い
環境認識と意思決定支援	97.8	60	中程度
ルート計画とナビゲーション	98.0	55	高い

画像認識精度：98.5%、約45 msの応答時間。
リアルタイムターゲット追跡と分類の精度：98.2%、約50 msの応答時間。
環境認識と意思決定支援の精度：97.8%、約60 msの応答時間。
ルート計画とナビゲーションの精度：98.0%、約55 msの応答時間。
全体として、モジュール全体で高い精度とミリ秒レベルの応答性を示す。
シミュレーションと実世界データは、自動運転における深層学習の実用性と普及拡大の潜在性を支持する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。