QUICK REVIEW

[論文レビュー] YOLOv11 Optimization for Efficient Resource Utilization

Adil Rasheed, M. Zarkoosh|arXiv (Cornell University)|Dec 19, 2024

Cloud Computing and Resource Management被引用数 11

ひとこと要約

本論文は、効率性のためにサイズ別の六つの YOLOv11 変種を開発し、データセットごとに最適なモデルを選択するオブジェクトサイズ分類器を追加し、YOLOv11および YOLOv8と比較して、サイズと推論時間を削減しつつ、精度は同等か向上を示す。

ABSTRACT

The objective of this research is to optimize the eleventh iteration of You Only Look Once (YOLOv11) by developing size-specific modified versions of the architecture. These modifications involve pruning unnecessary layers and reconfiguring the main architecture of YOLOv11. Each proposed version is tailored to detect objects of specific size ranges, from small to large. To ensure proper model selection based on dataset characteristics, we introduced an object classifier program. This program identifies the most suitable modified version for a given dataset. The proposed models were evaluated on various datasets and compared with the original YOLOv11 and YOLOv8 models. The experimental results highlight significant improvements in computational resource efficiency, with the proposed models maintaining the accuracy of the original YOLOv11. In some cases, the modified versions outperformed the original model regarding detection performance. Furthermore, the proposed models demonstrated reduced model sizes and faster inference times. Models weights and the object size classifier can be found in this repository

研究の動機と目的

YOLOv11を物体サイズ範囲に合わせて効率的な物体検出を動機づける。
小型・中型・大型・混合サイズの物体に特化した六つの改良された YOLOv11 変種を開発する。
データセットの特性に基づいて適切なモデルを自動的に選択するオブジェクトサイズ分類器を提供する。
多様なデータセットにわたって改良モデルを YOLOv11 および YOLOv8 と比較して評価する。
精度と効率のトレードオフを維持または改善しつつ、モデルサイズと推論時間の削減を示す。

提案手法

6つの YOLOv11 変種を提案する：YOLOv11-small, -medium, -large, -sm, -ml, -sl、それぞれ特定の物体サイズ範囲を対象とする。
各変種ごとにサイズと無関係なコンポーネントを削除するために、バックボーン/ネック/ヘッドブロックを体系的に剪定し再構成する。
データセットのラベルを分析して最も適切なモデルを選択するオブジェクトサイズ分類器を使用する。
標準検出指標（Recall、Precision、mAP@50）とリソース指標（モデルサイズ、GFLOPs、推論時間、電力）を用いて6つのデータセットでモデルを評価する。
改変モデルを元の YOLOv11 および YOLOv8 と比較して、精度と効率の間のトレードオフを評価する。

Figure 1: Showing the research trends for one-stage and two-stage object detection methods from 2016 to 2024.

実験結果

リサーチクエスチョン

RQ1サイズ別の剪定がリソース使用量を削減しつつ精度を維持できるか？
RQ2自動化されたオブジェクトサイズ分類器が与えられたデータセットに対して最も適切なモデルを信頼性高く選択できるか？
RQ3改変された YOLOv11 変種の実行時・サイズ指標は、YOLOv11 および YOLOv8 と比較して多様なデータセットでどうなるか？
RQ4特定のアプリケーション領域（例：農業、医療画像、航空監視）に対して、特定の物体サイズに焦点を当てた変種が優れている場合があるか？

主な発見

Dataset	Model	Recall (%)	Precision (%)	mAP@50 (%)
WeedCrop	YOLOv11	67.07	70.62	69.09
WeedCrop	YOLOv11-small	70.98	69.27	69.27
WeedCrop	YOLOv8	63.98	63.98	68.50
BCCD	YOLOv11	91.95	84.90	92.98
BCCD	YOLOv11-medium	90.56	85.34	92.67
BCCD	YOLOv8	91.62	86.93	93.19
Underwater Pipes	YOLOv11	99.15	99.01	99.43
Underwater Pipes	YOLOv11-large	98.90	99.21	99.48
Underwater Pipes	YOLOv8	99.46	98.60	99.47
Aerial Airport	YOLOv11	87.42	91.73	92.93
Aerial Airport	YOLOv11-sm	86.91	93.15	93.20
Aerial Airport	YOLOv8	87.05	90.87	92.23
Brain Tumor	YOLOv11	74.73	89.21	81.67
Brain Tumor	YOLOv11-ml	72.05	90.90	80.15
Brain Tumor	YOLOv8	72.46	90.80	80.49
Face Detection	YOLOv11	93.10	97.98	96.14
Face Detection	YOLOv11-sl	93.10	93.05	97.70
Face Detection	YOLOv8	87.93	93.63	93.05

改変された YOLOv11 変種は、ほとんどのデータセットにおいて recall、precision、mAP@50 で YOLOv8 を上回る傾向がある。
提案された全ての変種は元の YOLOv11 よりモデルサイズが小さく、GFLOPsおよび推論時間においてより（または同様に）効率的。
特定のデータセットで、改変された変種（例：YOLOv11-small、YOLOv11-large、YOLOv11-sm、YOLOv11-sl）が元の YOLOv11 より高い mAP@50 を達成する。
元の YOLOv11 が、改変されたいくつかのバリアント（例：YOLOv11-mediumおよびYOLOv11-sm）よりも優れている場合があるが、最大の性能ギャップは 2%未満である。
全モデルの推論時間は画像あたり 5 ms 未満で、改変された変種は平均して推論時間を短縮。
いくつかの改変バリアント（YOLOv11-small、-medium、-large、-ml）の電力消費は、YOLOv11および YOLOv8より低い。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。