QUICK REVIEW

[논문 리뷰] YOLOv11 for Vehicle Detection: Advancements, Performance, and Applications in Intelligent Transportation Systems

Mujadded Al Rabbani Alif|arXiv (Cornell University)|2024. 10. 30.

Advanced Neural Network Applications인용 수 20

한 줄 요약

이 논문은 차량 탐지를 위한 YOLO11을 분석하여 작은 차량과 가려진 차량에 대해 특히 YOLOv8/YOLOv10보다 향상된 mAP와 속도를 보임을 보여준다.

ABSTRACT

Accurate vehicle detection is essential for the development of intelligent transportation systems, autonomous driving, and traffic monitoring. This paper presents a detailed analysis of YOLO11, the latest advancement in the YOLO series of deep learning models, focusing exclusively on vehicle detection tasks. Building upon the success of its predecessors, YOLO11 introduces architectural improvements designed to enhance detection speed, accuracy, and robustness in complex environments. Using a comprehensive dataset comprising multiple vehicle types-cars, trucks, buses, motorcycles, and bicycles we evaluate YOLO11's performance using metrics such as precision, recall, F1 score, and mean average precision (mAP). Our findings demonstrate that YOLO11 surpasses previous versions (YOLOv8 and YOLOv10) in detecting smaller and more occluded vehicles while maintaining a competitive inference time, making it well-suited for real-time applications. Comparative analysis shows significant improvements in the detection of complex vehicle geometries, further contributing to the development of efficient and scalable vehicle detection systems. This research highlights YOLO11's potential to enhance autonomous vehicle performance and traffic monitoring systems, offering insights for future developments in the field.

연구 동기 및 목표

다양한 차량 유형(차, 트럭, 버스, 오토바이, 자전거)과 다양한 조건에서 차량 탐지를 위한 YOLO11의 성능을 평가한다.
정확도, 속도 및 견고성 면에서 YOLO11을 YOLOv8 및 YOLOv10과 비교 벤치마킹한다.
실시간 탐지에서 소형 객체와 가려짐 처리 성능을 향상시키는 아키텍처 혁신을 식별한다.

제안 방법

트레인/검증/테스트 분할(70/15/15)을 갖춘 416x416 해상도의 차량 1,321개 이미지 데이터셋을 사용한다.
색상(Hue), 채도, 밝기, 회전, 평행이동, 스케일링, 전단, 원근 및 모자이크 등을 포함한 광범위한 데이터 증강을 적용한다.
C3k2 백본, SPPF 및 C2PSA 블록을 갖춘 YOLO11 아키텍처를 도입하며 세 스케일 탐지 헤드(P3, P4, P5)를 사용한다.
모멘텀 0.937, 가중치 감소 0.0005인 SGD로 300 에포크 학습하며 코사인 어닐링 학습률(0.01에서 시작) 및 다중 스케일 학습(320-640)을 사용한다.
정밀도, 재현율, F1, mAP(IoU 0.5–0.95), 추론 시간으로 평가하고 YOLOv8 및 YOLOv10과 비교한다.

실험 결과

연구 질문

RQ1YOLO11이 YOLOv8/YOLOv10에 비해 소형 차량 및 가려진 차량의 탐지 성능을 향상시키는가?
RQ2다양한 차량 유형에 대해 YOLO11의 정확도, 속도 및 견고성 특성은 어떠한가?
RQ3아키텍처 혁신(C3k2, C2PSA)이 실시간 차량 탐지 성능에 어떤 영향을 미치는가?
RQ4다양한 환경 조건에서 YOLO11은 전임자들에 비해 어떻게 성능을 발휘하는가?

주요 결과

YOLO11은 모든 클래스에서 mAP@0.5가 0.743로 더 높으며, 클래스별 점수는 Car 0.837, Motorcycle 0.679, Truck 0.355, Bus 0.863, Bicycle 0.982이다.
YOLO11은 IoU 임계값 0.95까지의 mAP에서 YOLOv8 및 YOLOv10을 상회한다(mAP@0.5:0.95 = 48.5%).
YOLO11은 추론 속도 290 FPS를 달성하여 YOLOv10(280 FPS) 및 YOLOv8(260 FPS)보다 빠르다.
YOLO11은 더 작은 객체/가려진 객체의 탐지 및 다양한 조건에서의 견고성이 향상되었다.
혼동 및 정밀-재현 분석은 자동차와 버스에서 더 강한 성능을 나타내며, 트럭과 버스 간의 일부 오분류가 주로 발생한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.