QUICK REVIEW

[论文解读] Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Dipesh Gyawali, Alok Regmi|arXiv (Cornell University)|Apr 5, 2020

Advanced Neural Network Applications参考文献 8被引用 31

一句话总结

本文比较了多种预训练的卷积神经网络（尤其是 ResNet18）用于通过迁移学习进行垃圾图像分类，在 ResNet18 下达到最佳验证准确度 87.8%。它分析特征提取、微调以及在一个4类垃圾数据集上的模型表现。

ABSTRACT

Waste is a wealth in a wrong place. Our research focuses on analyzing possibilities for automatic waste sorting and collecting in such a way that helps it for further recycling process. Various approaches are being practiced managing waste but not efficient and require human intervention. The automatic waste segregation would fit in to fill the gap. The project tested well known Deep Learning Network architectures for waste classification with dataset combined from own endeavors and Trash Net. The convolutional neural network is used for image classification. The hardware built in the form of dustbin is used to segregate those wastes into different compartments. Without the human exercise in segregating those waste products, the study would save the precious time and would introduce the automation in the area of waste management. Municipal solid waste is a huge, renewable source of energy. The situation is win-win for both government, society and industrialists. Because of fine-tuning of the ResNet18 Network, the best validation accuracy was found to be 87.8%.

研究动机与目标

促进自动垃圾分拣以帮助回收并减少人力劳动。
评估使用预训练的 CNN 进行垃圾图像分类的迁移学习。
确定表现最佳的模型并分析其学习行为与混淆模式。

提出的方法

在4类垃圾数据集（Plastic, Paper, Metal, Glass）上使用带有预训练的 CNN 的迁移学习（ResNet 和 VGG）。
对图像进行预处理：将尺寸调整为 512x512，归一化通道，并数据增强以提高泛化能力。
解冻最后的卷积层进行特征提取，然后通过添加的分类层进行微调。
将数据按 80/20 进行训练/验证划分；使用 Adam（lr=0.001）和负对数似然损失进行优化。
在 NVIDIA GTX 1050 上使用 PyTorch 进行训练，以比较模型性能和学习曲线。
在比较训练/验证损失和准确度后，选择 ResNet18 作为表现最佳的模型。

实验结果

研究问题

RQ1哪些预训练的 CNN 架构（ResNet、VGG 等）在垃圾分类任务中表现最好？
RQ2迁移学习结合微调是否能提高所提供数据集的垃圾分类准确度？
RQ3每个模型的训练与验证损失和准确度如何演变，是否出现过拟合或欠拟合？

主要发现

Glass	Metal	Paper	Plastic
83.8	3.7	3.7	8.8
1.7	89.0	2.5	6.8
0.1	1.1	90.2	8.6
7.5	4.6	5.2	82.8

最佳验证准确度在 ResNet18 上实现，约为 87.8%。
ResNet50 和 VGG16 显示出过拟合倾向，训练损失与验证损失发散。
ResNet18 在训练和验证损失均呈下降趋势，且验证精度合理，未出现显著过拟合。
混淆矩阵显示 Paper 与 Plastic 之间误分较多，Glass 常被误认成 Plastic，而 Metal 的预测相对较强。
特征图展示了从边缘到更复杂形状的逐层特征提取。
数据增强和归一化被用来减轻过拟合并提升泛化。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。