QUICK REVIEW

[논문 리뷰] Prediction of the atomistic Hubbard U interaction from moiré system STM-images using image recognition

Nachiket Anil Tanksale, Tobias Stauber|arXiv (Cornell University)|2026. 02. 21.

Graphene research and applications인용 수 0

한 줄 요약

본 논문은 평면 밴드 영역에서 시뮬레이션된 STM FT-LDOS 이미지를 통해 현장 허버드 U를 추정하기 위한 CNN 기반 회귀를 개발하고, 높은 정확성을 달성하며 U/t ≈ 1 근처에서 약한-강한 결합으로의 교차를 드러낸다.

ABSTRACT

The atomistic Hubbard interaction U, representing the on-site Coulomb repulsion, serves as a pivotal parameter in theoretical models describing of correlated systems, yet its precise experimental determination especially in moiré systems remains challenging. Scanning Tunneling Microscopy(STM) provides real-space images of the local density of states (LDOS), offering rich data sets that reflect the unique electronic structure of the material. Here, we introduce a systematic methodology for extracting the Hubbard U parameter directly from these LDOS images through the application of machine learning (ML) in the case of twisted bilayer graphene in the flat-band regime. The regression of U is highly accurate even though the image-similarity is greater than 99.98%. Subsequent data-analysis further suggest a weak crossover between the weak and strong coupling regime at Uc/t 1

연구 동기 및 목표

Infer the effective on-site Hubbard interaction U directly from STM-like FT-LDOS images of twisted bilayer graphene in the flat-band regime.
Demonstrate that CNNs can accurately regress U from FT-LDOS data despite high image similarity across U values.
Explore model interpretability to identify momentum-space features driving the U regression.
Assess interpolation and extrapolation capabilities, including fractional U values, within and beyond the training range.
Provide a data-driven framework for Hamiltonian learning in correlated moiré materials.

제안 방법

Generate a dataset by solving a Hartree–Fock model of twisted bilayer graphene under hydrostatic pressure to realize a flat-band regime at θ ≈ 3.5°, for U in [0,6] eV with ε = 10.
Compute LDOS and its Fourier transform (FT-LDOS) to mimic STM images at εF and generate 256×256 grayscale inputs after normalization.
Train two CNN regressors (a custom 4-block CNN and a ResNet-18 variant) to map FT-LDOS images to U using mean squared error loss.
Evaluate performance with MAE, RMSE, and R² on test sets and held-out fractional U values to probe interpolation and extrapolation.
Apply Grad-CAM and guided backpropagation to localize the momentum-space regions and pixels driving predictions.
Compare CNN performance to a PCA+ridge baseline to highlight nonlinear feature utilization.

Figure 1 : (A) Hartree-Fock bands at filling $\nu=3$ setting the Fermi energy $\epsilon_{F}$ to zero with $\epsilon=10$ obeying all symmetries, for $U=0$ – $5$ eV. (B) Density of states of the upper two flat bands for the same parameters.

실험 결과

연구 질문

RQ1Can machine learning regress the on-site Hubbard interaction U directly from FT-LDOS STM-like images in a moiré system?
RQ2What FT-LDOS features in momentum space correlate with changes in U, and can interpretability tools reveal these features?
RQ3How well can CNNs interpolate within a discretely trained U grid and extrapolate to fractional U values not in the training set?
RQ4Is there a detectable crossover between weak and strong coupling regimes as U varies, and where does it occur?
RQ5How does the CNN-based approach compare to linear baselines in extracting U from high-dimensional STM data?

주요 결과

모델	MAE (eV)	RMSE (eV)	R²
CNN (test)	0.14±0.03	0.21±0.02	0.984±0.003
CNN (held-out)	0.50±0.08	0.54±0.07	0.87±0.04
ResNet-18 (test)	0.12±0.02	0.21±0.04	0.985±0.006
ResNet-18 (held-out)	0.49±0.02	0.57±0.02	0.85±0.01

CNNs achieve highly accurate regression of U from FT-LDOS images with R² ≈ 0.984–0.985 on the standard test sets.
Held-out performance for fractional U values drops (R² ≈ 0.87–0.88), indicating limited extrapolation beyond trained U ranges.
PCA reveals a strongly low-dimensional data structure (PC1 56.8%, PC2 26.8%, PC3 6.1%), with PC1 showing monotonic U dependence.
Grad-CAM and guided backpropagation show the regression relies on momentum-space regions near Bragg peaks and their contrasts, evolving with U.
There is an inferred crossover around U/t ≈ 1–2.7 eV, supported by Grad-CAM patterns and PCA trends.
Compared to ridge regression on leading PCs, CNNs provide notably better regression performance (MAE ~0.12–0.14 eV vs ~0.91 R² baseline).

Figure 2 : Fourier-transformed LDOS (FT-LDOS) for $\nu=-2$ and $\epsilon=12$ in the valley-coherence symmetry broken state, as discussed in Ref. Sánchez Sánchez et al. , 2024 . The principal Bragg peaks (black arrows) are located at the reciprocal lattice vectors of the two rotated graphene layers,

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.