QUICK REVIEW

[논문 리뷰] SeisBind: Physics-Aware Tri-Modal Representation Binding for Seismic Data via Contrastive Learning

Chaohua Liang, Jun Matsushima|arXiv (Cornell University)|2026. 01. 21.

Seismic Imaging and Inversion Techniques인용 수 0

한 줄 요약

SeisBind는 시지진 데이터, 속도 모델, 물리학 설명자를 결합한 삼모달 물리 인식 임베딩 공간을 학습하여 대조 학습을 이용해 견고한 교차-모달 검색 및 해석 가능한 물리적 의미를 가능하게 한다.

ABSTRACT

This letter proposes a physics-aware multi-modal contrastive learning framework designed to transform complex seismic wavefields into human-readable physical representations. Traditional data-driven inversion methods often focus on pixel-wise mapping, which lacks physical grounding and interpretability. To address this, we introduce a novel framework that jointly aligns seismic shot gathers, subsurface velocity models, and explicit physical descriptors (e.g., mean velocity and gradients) in a shared latent space. By introducing these descriptors as a third modality, our approach encourages the learned embeddings to capture intrinsic geological semantics rather than superficial signal correlations. Experiments on the OpenFWI dataset demonstrate that the proposed method not only achieves robust seismic-to-velocity retrieval but also preserves meaningful physical semantics, enabling cross-modal inference of interpretable attributes. This representation-centric perspective provides a flexible foundation for expert-guided subsurface characterization.

연구 동기 및 목표

물리적 설명자에 기반해 표현을 물리적으로 설명 가능한 방식으로 정당화하여 해석 가능한 속도 추정을 유도한다.
공유 잠재 공간에서 시진 데이터, 속도 모델, 물리학 설명자를 정렬하는 tri-modal CLIP 스타일 프레임워크를 제안한다.
OpenFWI 데이터에서 견고한 시진→속도 검색 및 물리적 의미 보존을 입증한다.
물리 설명자가 학습 표현의 해석 가능성을 높이는 시맨틱 앵커로 작용함을 보인다.

제안 방법

입력들을 256차원 공유 임베딩 공간으로 매핑하기 위해 Seismic, Velocity, Physics/Info의 세 인코더를 사용한다.
Seismic 인코더는 다중 샷 수집을 처리하기 위해 대칭 CNN을 사용한다.
Velocity 인코더는 단일 채널 입력에 대해 수정된 ResNet-18 백본을 사용한다.
Physics 인코더는 저차원 설명자 벡터를 MLP로 인코딩하여 시맨틱 앵커를 제공한다.
Seismic–Velocity 및 Velocity–Physics 임베딩을 정렬하기 위해 대칭 CLIP 스타일 InfoNCE 목표를 학습한다.
학습의 안정성과 물리적 일관성을 강화하기 위해 손실을 λ1=1, λ2=0.1 가중치로 균형을 맞춘다.

실험 결과

연구 질문

RQ1삼모달 임베딩 공간이 시진 데이터, 속도 모델, 물리 설명자를 효과적으로 정렬하여 크로스-모달 검색을 가능하게 하는가?
RQ2명시적 물리 설명자의 포함이 학습 표현의 해석 가능성과 물리적 일관성을 개선하는가?
RQ3삼모달 대비 학습이 unimodal baselines에 비해 시진→속도 검색 성능에 어떤 영향을 주는가?

주요 결과

지표	지진 데이터 → 속도 R@1	지진 데이터 → 속도 R@5	지진 데이터 → 속도 R@10	속도 → 물리 R@1	속도 → 물리 R@5	속도 → 물리 R@10	지진-속도 평균 양의 유사도	지진-속도 평균 음의 유사도	속도-물리 평균 양의 유사도	속도-물리 평균 음의 유사도
검색(OpenFWI)	0.5780	0.8610	0.9350	0.6810	0.9150	0.9610	0.6164	−0.0070	0.6378	−0.0014

Seismic → Velocity 검색 정확도: R@1 = 0.5780, R@5 = 0.8610, R@10 = 0.9350.
Velocity → Physics 검색 정확도: R@1 = 0.6810, R@5 = 0.9150, R@10 = 0.9610.
Seismic–Velocity의 평균 양의 유사도는 0.6164이고 평균 음의 유사도는 −0.0070이다.
Velocity–Physics의 평균 양의 유사도는 0.6378이고 평균 음의 유사도는 −0.0014이다.
정성적 결과는 ground-truth에 고유 구조를 잘 보존하는 top-1 속도 모델을 보여주며, top-2/top-3도 물리적으로 일관성을 유지한다.
Cross-modal 검색을 통한 물리 설명자 추론은 선택된 설명자(k=5)에서 평균 상대 오차 9.12%를 달성한다.

Figure 2 : Qualitative retrieval results. For each query seismic shot gather (first column), the corresponding ground truth (second column) and the top-3 retrieved velocity models are displayed. Similarity scores are provided above each retrieved candidate, demonstrating the model’s precision in str

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.