QUICK REVIEW

[論文レビュー] Comparative Analysis of Multiple Deep CNN Models for Waste Classification

Dipesh Gyawali, Alok Regmi|arXiv (Cornell University)|Apr 5, 2020

Advanced Neural Network Applications参考文献 8被引用数 31

ひとこと要約

この論文は、転移学習を用いた複数の事前学習CNN（特にResNet18）を比較し、4クラスの廃棄物データセットでの検証精度87.8%をResNet18で達成。特徴抽出、ファインチューニング、モデルの性能を分析する。

ABSTRACT

Waste is a wealth in a wrong place. Our research focuses on analyzing possibilities for automatic waste sorting and collecting in such a way that helps it for further recycling process. Various approaches are being practiced managing waste but not efficient and require human intervention. The automatic waste segregation would fit in to fill the gap. The project tested well known Deep Learning Network architectures for waste classification with dataset combined from own endeavors and Trash Net. The convolutional neural network is used for image classification. The hardware built in the form of dustbin is used to segregate those wastes into different compartments. Without the human exercise in segregating those waste products, the study would save the precious time and would introduce the automation in the area of waste management. Municipal solid waste is a huge, renewable source of energy. The situation is win-win for both government, society and industrialists. Because of fine-tuning of the ResNet18 Network, the best validation accuracy was found to be 87.8%.

研究の動機と目的

自動的な廃棄物分別を促進しリサイクルを支援し人手を削減する。
廃棄物画像分類のための事前学習CNNの転移学習を評価する。
最も性能の高いモデルを特定し、その学習挙動と混同行列パターンを分析する。

提案手法

4クラス廃棄物データセット（Plastic, Paper, Metal, Glass）上で事前学習CNN（ResNetおよびVGG）を用いた転移学習を適用する。
画像を整形：512x512にリサイズ、チャネルを正規化し、一般化を向上させるデータ拡張を行う。
特徴抽出のために最後の畳み込み層をアンフリーズし、追加の分類器層でファインチューニングする。
データを80/20で訓練/検証に分割；Adam（lr=0.001）と負の対数尤損失で最適化する。
NVIDIA GTX 1050上でPyTorchを用いてモデル性能と学習曲線を比較する。
訓練/検証損失と精度を比較した後、ResNet18を最も性能が高いモデルとして選択する。

実験結果

リサーチクエスチョン

RQ1廃棄物分類タスクで最も良い性能を示す事前学習CNNアーキテクチャは何か（ResNet、VGGなど）？
RQ2ファインチューニングを伴う転移学習は提供データセットで廃棄物分類の精度を改善するか？
RQ3各モデルについて訓練損失と検証損失・精度はどのように推移し、過学習または不足学習を示唆するか？
RQ4廃棄物カテゴリ間の誤分類パターンは何か、データセットの課題をどのように示しているか？

主な発見

ガラス	金属	紙	プラスチック
83.8	3.7	3.7	8.8
1.7	89.0	2.5	6.8
0.1	1.1	90.2	8.6
7.5	4.6	5.2	82.8

最高の検証精度はResNet18で約87.8%。
ResNet50とVGG16は訓練/検証損失が分岐する過学習傾向を示した。
ResNet18は訓練損失と検証損失が低下し、妥当な検証精度を示し過学習は顕著でない。
混同行列はPaperとPlasticの混同が高く、GlassがPlasticと誤認されることが多い一方、Metalの予測は比較的強い。
特徴マップはCNN層を通じてエッジからより複雑な形状への特徴抽出が段階的に見られる。
データ拡張と正規化を用いて過学習を抑制し一般化を向上させた。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。