QUICK REVIEW

[論文レビュー] YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems

Mujadded Al Rabbani Alif|arXiv (Cornell University)|Oct 30, 2024

Advanced Neural Network Applications被引用数 20

ひとこと要約

この論文は車両検出の YOLO11 を分析し、YOLOv8/YOLOv10 より小型車両や遮蔽車両に対して特に mAP と速度の改善を示します。

ABSTRACT

Accurate vehicle detection is essential for the development of intelligent transportation systems, autonomous driving, and traffic monitoring. This paper presents a detailed analysis of YOLO11, the latest advancement in the YOLO series of deep learning models, focusing exclusively on vehicle detection tasks. Building upon the success of its predecessors, YOLO11 introduces architectural improvements designed to enhance detection speed, accuracy, and robustness in complex environments. Using a comprehensive dataset comprising multiple vehicle types-cars, trucks, buses, motorcycles, and bicycles we evaluate YOLO11's performance using metrics such as precision, recall, F1 score, and mean average precision (mAP). Our findings demonstrate that YOLO11 surpasses previous versions (YOLOv8 and YOLOv10) in detecting smaller and more occluded vehicles while maintaining a competitive inference time, making it well-suited for real-time applications. Comparative analysis shows significant improvements in the detection of complex vehicle geometries, further contributing to the development of efficient and scalable vehicle detection systems. This research highlights YOLO11's potential to enhance autonomous vehicle performance and traffic monitoring systems, offering insights for future developments in the field.

研究の動機と目的

車両タイプ（自動車、トラック、バス、オートバイ、自転車）および状況を跨ぐ車両検出に対する YOLO11 の性能を評価する。
精度、速度、堅牢性の観点から YOLO11 を YOLOv8 および YOLOv10 と比較する。
リアルタイム検出における小物体・遮蔽処理を改善するアーキテクチャ的イノベーションを特定する。

提案手法

416x416 解像度の車両データセット 1,321 枚を train/validation/test（70/15/15）に分割して使用する。
色相、彩度、明度、回転、平行移動、スケーリング、シア、視点、モザイクを含む広範なデータ増強を適用する。
C3k2 バックボーン、SPPF、C2PSA ブロックを備えた YOLO11 アーキテクチャを導入する；三つのスケール検出ヘッド（P3、P4、P5）を使用する。
SGD（モメンタム 0.937、ウェイト減衰 0.0005）で 300 エポック学習を行い、コサイン減衾学習率で開始値 0.01、マルチスケール訓練（320-640）を適用する。
精度、再現率、F1、mAP（IoU 0.5–0.95）、推論時間を用いて評価する；YOLOv8 および YOLOv10 と比較する。

実験結果

リサーチクエスチョン

RQ1YOLO11 は YOLOv8/YOLOv10 と比較して小型車両および遮蔽車両の検出を改善するか？
RQ2YOLO11 の精度、速度、堅牢性の特徴は車为種ごとにどうなるか？
RQ3アーキテクチャ的革新（C3k2、C2PSA）はリアルタイム車両検出の性能にどう影響するか？
RQ4YOLO11 はさまざまな環境条件下で前任機と比較してどう性能を発揮するか？

主な発見

モデル	mAP@0.5	mAP@0.75	mAP@[0.5:0.95]
YOLOv8	73.9%	64.5%	45.2%
YOLOv10	74.3%	65.2%	46.7%
YOLOv11	76.8%	68.1%	48.5%

YOLO11 は全クラスで mAP@0.5 が高く 0.743、クラス別スコアは Car 0.837、Motorcycle 0.679、Truck 0.355、Bus 0.863、Bicycle 0.982。
YOLOv8 および YOLOv10 を IoU 閾値 0.95 までの mAP で上回る（mAP@0.5:0.95 = 48.5%）。
YOLO11 は推論速度 290 FPS を達成し、YOLOv10 の 280 FPS を上回り、YOLOv8 の 260 FPS よりも速い。
YOLO11 は小型/遮蔽物体（例：自転車、オートバイ）の検出改善と、さまざまな条件での堅牢性向上を示す。
混同行列および精度-再現分析は、車とバスの性能がより強く、トラックとバスの誤分類が主に発生していることを示唆する。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。