QUICK REVIEW

[论文解读] YOLOv11 Optimization for Efficient Resource Utilization

Adil Rasheed, M. Zarkoosh|arXiv (Cornell University)|Dec 19, 2024

Cloud Computing and Resource Management被引用 11

一句话总结

本文开发六种针对尺寸的 YOLOv11 变体以提高效率，并额外设计一个对象尺寸分类器以便为每个数据集选择最佳模型，并将其与 YOLOv11 和 YOLOv8 进行比较，在保持或提高准确率的同时实现更小的模型规模和更快的推理时间。

ABSTRACT

The objective of this research is to optimize the eleventh iteration of You Only Look Once (YOLOv11) by developing size-specific modified versions of the architecture. These modifications involve pruning unnecessary layers and reconfiguring the main architecture of YOLOv11. Each proposed version is tailored to detect objects of specific size ranges, from small to large. To ensure proper model selection based on dataset characteristics, we introduced an object classifier program. This program identifies the most suitable modified version for a given dataset. The proposed models were evaluated on various datasets and compared with the original YOLOv11 and YOLOv8 models. The experimental results highlight significant improvements in computational resource efficiency, with the proposed models maintaining the accuracy of the original YOLOv11. In some cases, the modified versions outperformed the original model regarding detection performance. Furthermore, the proposed models demonstrated reduced model sizes and faster inference times. Models weights and the object size classifier can be found in this repository

研究动机与目标

通过将 YOLOv11 针对对象尺寸范围进行定制来提升高效对象检测。
开发六个改进的 YOLOv11 变体，专门针对小、中、大和混合尺寸对象。
提供一个对象尺寸分类器基于数据集特征自动选择合适的模型。
在多样数据集上对改进模型与 YOLOv11 和 YOLOv8 进行评估。
在维持或提高准确率的前提下，显示模型大小和推理时间的下降。

提出的方法

提出六个 YOLOv11 变体：YOLOv11-small、-medium、-large、-sm、-ml、-sl，每个都针对特定对象尺寸范围。
系统地对主干/颈部/头部模块进行剪枝和重新配置，去除与尺寸无关的组件。
使用对象尺寸分类器分析数据集标签并选择最合适的模型。
在六个数据集上使用标准检测指标（召回率、精确度、mAP@50）和资源指标（模型大小、GFLOPs、推理时间、功耗）进行评估。
将改进模型与原始 YOLOv11 和 YOLOv8 进行比较，以评估准确性与效率之间的权衡。

Figure 1: Showing the research trends for one-stage and two-stage object detection methods from 2016 to 2024.

实验结果

研究问题

RQ1针对 YOLOv11 的尺寸特定剪枝是否在降低资源使用的同时保持准确性？
RQ2自动对象尺寸分类器是否能可靠地为给定数据集选择最合适的模型？
RQ3在不同数据集上，修改后的 YOLOv11 变体的运行时和尺寸指标与 YOLOv11 和 YOLOv8 相比如何？
RQ4是否某些以对象尺寸为焦点的变体在特定应用领域（如农业、医学成像、航拍监控）更具优势？

主要发现

数据集	模型	召回率 (%)	精确度 (%)	mAP@50 (%)
WeedCrop	YOLOv11	67.07	70.62	69.09
WeedCrop	YOLOv11-small	70.98	69.27	69.27
WeedCrop	YOLOv8	63.98	63.98	68.50
BCCD	YOLOv11	91.95	84.90	92.98
BCCD	YOLOv11-medium	90.56	85.34	92.67
BCCD	YOLOv8	91.62	86.93	93.19
Underwater Pipes	YOLOv11	99.15	99.01	99.43
Underwater Pipes	YOLOv11-large	98.90	99.21	99.48
Underwater Pipes	YOLOv8	99.46	98.60	99.47
Aerial Airport	YOLOv11	87.42	91.73	92.93
Aerial Airport	YOLOv11-sm	86.91	93.15	93.20
Aerial Airport	YOLOv8	87.05	90.87	92.23
Brain Tumor	YOLOv11	74.73	89.21	81.67
Brain Tumor	YOLOv11-ml	72.05	90.90	80.15
Brain Tumor	YOLOv8	72.46	90.80	80.49
Face Detection	YOLOv11	93.10	97.98	96.14
Face Detection	YOLOv11-sl	93.10	93.05	97.70
Face Detection	YOLOv8	87.93	93.63	93.05

经修改的 YOLOv11 变体在大多数数据集上在召回率、精确度和 mAP@50 方面通常超越 YOLOv8。
所有提出的变体的模型大小均小于原始 YOLOv11，GFLOPs 和推理时间更高效（或同等）。
某些修改后的变体（如 YOLOv11-small、YOLOv11-large、YOLOv11-sm、YOLOv11-sl）在某些数据集上达到比原始 YOLOv11 更高的 mAP@50。
原始 YOLOv11 有时优于某些修改后的变体（如 YOLOv11-medium 和 YOLOv11-sm），但最大性能差距不足 2%。
所有模型的推理时间均在每张图像 5 ms 之内，修改后的变体平均推理时间更短。
对多种修改变体（YOLOv11-small、-medium、-large、-ml）相比 YOLOv11 和 YOLOv8，功耗更低。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。