QUICK REVIEW

[論文レビュー] SiMiC: Context-Aware Silicon Microstructure Characterization Using Attention-Based Convolutional Neural Networks for Field-Emission Tip Analysis

Jing Jie Tan, Rupert Schreiner|arXiv (Cornell University)|Jan 21, 2026

Machine Learning in Materials Science被引用数 0

ひとこと要約

SiMiCはSEM画像からシリコン微細構造特徴を抽出する注意機構付きCNNを用い、場発電子ビーム用チップの幅・高さ・半径の多パラメータ予測を実現。注意機構・データ拡張・バックボーン選択により性能向上を示す。

ABSTRACT

Accurate characterization of silicon microstructures is essential for advancing microscale fabrication, quality control, and device performance. Traditional analysis using Scanning Electron Microscopy (SEM) often requires labor-intensive, manual evaluation of feature geometry, limiting throughput and reproducibility. In this study, we propose SiMiC: Context-Aware Silicon Microstructure Characterization Using Attention-Based Convolutional Neural Networks for Field-Emission Tip Analysis. By leveraging deep learning, our approach efficiently extracts morphological features-such as size, shape, and apex curvature-from SEM images, significantly reducing human intervention while improving measurement consistency. A specialized dataset of silicon-based field-emitter tips was developed, and a customized CNN architecture incorporating attention mechanisms was trained for multi-class microstructure classification and dimensional prediction. Comparative analysis with classical image processing techniques demonstrates that SiMiC achieves high accuracy while maintaining interpretability. The proposed framework establishes a foundation for data-driven microstructure analysis directly linked to field-emission performance, opening avenues for correlating emitter geometry with emission behavior and guiding the design of optimized cold-cathode and SEM electron sources. The related dataset and algorithm repository that could serve as a baseline in this area can be found at https://research.jingjietan.com/?q=SIMIC

研究の動機と目的

高速で客観的なシリコン微細構造の特性評価を促進し、場発電子チップ設計を支援する。
SEMで撮像したシリコンベースの場発エミッタチップのML分析用データセットを開発する。
CNNバックボーン、オプションの構造モジュール、注目機構を統合したSiMiCアーキテクチャを提案し、幾何パラメータを予測する。
精度と解釈性のためにバックボーンオプション（ResNet、EfficientNet、MobileNet）と注意機構を評価する。
データ拡張と幅/高さの明示的な入力が半径予測と全体性能を改善することを示す。

提案手法

SEM画像を処理するCNNバックボーンがチップ幾何（幅、高さ、半径）を予測する。
任意の構造モジュールが参照寸法を提供し、埋め込み層が幅・高さを注意フローへ投影する。
探索された2つの注意機構: Bahdanau型加法的注意とMulti-Head Attention（MHA）。
CoordConvベースの構造強化と学習可能な射影が物体幾何を埋め込み、注意を誘導する。
学習にはロバスト性と滑らかな最適化のバランスをとるためにHuber損失を使用。
データ拡張の有無を含むバックボーン（ResNet、EfficientNet、MobileNet）を広範に評価して性能を比較。

実験結果

リサーチクエスチョン

RQ1SEM画像からの幾何パラメータ（幅、高さ、半径）をCNNと注意機構で信頼的に推定できるか。
RQ2最も予測精度を与えるバックボーン構造と注意機構はどれか。
RQ3データ拡張と明示的な幾何入力（幅/高さ）は半径予測と全体的なモデル性能にどう影響するか。
RQ4CoordConvを含む構造モジュールの組み込みは特徴整列と予測の解釈性を向上させるか。

主な発見

Approaches	RMSE Width	RMSE Height	RMSE Radius (Full)	RMSE Radius (Half)	R2 Width (Full)	R2 Height (Full)	R2 Radius (Full)	R2 Radius (Half)
ResNet	1.1947	1.2817	0.0443	0.0225	0.2147	0.2131	0.2227	0.2623
EfficientNet	1.2982	1.3182	0.0942	0.0313	0.2033	0.2189	0.2293	0.2895
MobileNet	1.2712	1.3276	0.0574	0.0274	0.2113	0.2221	0.2231	0.2934
Additive Attention + ResNet	1.1892	1.2801	0.0421	0.0221	0.2138	0.2112	0.2234	0.2908
Multihead Attention + ResNet	1.0097	1.2484	0.0395	0.0192	0.2259	0.2228	0.2268	0.2887
Augmentation + Multihead Attention + ResNet	0.9312	1.1158	0.0319	0.0117	0.2295	0.2330	0.2362	0.3098

ResNetバックボーンはすべての予測変数に対してRMSEの点でEfficientNetおよびMobileNetを上回った。
Multi-head attentionは加法的注意および非注意ベースのベースラインより性能を向上させた。
データ拡張は構成間でRMSEを substantial に低減し、R2を向上させた。
幅と高さを入力特徴として提供すると半径RMSEが劇的に低下（約半分）し、ResNetで半径0.0225、全体0.0443へ削減。
Augmentation + Multihead Attention + ResNetは最良の半径RMSE（全0.0319、半0.0117）と半分予測で最大のR2（0.3098）を達成。
注意マップはモデルが対象の輪郭や文脈領域に焦点を合わせ、幾何ヒントが半径推定を導くことを示している。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。