QUICK REVIEW

[論文レビュー] Testing Storage-System Correctness: Challenges, Fuzzing Limitations, and AI-Augmented Opportunities

Ying Wang, Jiahui Chen|arXiv (Cornell University)|Feb 2, 2026

Software System Performance and Reliability被引用数 0

ひとこと要約

この論文はストレージシステムの正確性検証を調査し、障害モデルを概説し、層を超えた既存技術を評価し、ファuzzingの限界を検討し、意味論を意識した検証のAI活用機会を探る。

ABSTRACT

Storage systems are fundamental to modern computing infrastructures, yet ensuring their correctness remains challenging in practice. Despite decades of research on system testing, many storage-system failures (including durability, ordering, recovery, and consistency violations) remain difficult to expose systematically. This difficulty stems not primarily from insufficient testing tooling, but from intrinsic properties of storage-system execution, including nondeterministic interleavings, long-horizon state evolution, and correctness semantics that span multiple layers and execution phases. This survey adopts a storage-centric view of system testing and organizes existing techniques according to the execution properties and failure mechanisms they target. We review a broad spectrum of approaches, ranging from concurrency testing and long-running workloads to crash-consistency analysis, hardware-level semantic validation, and distributed fault injection, and analyze their fundamental strengths and limitations. Within this framework, we examine fuzzing as an automated testing paradigm, highlighting systematic mismatches between conventional fuzzing assumptions and storage-system semantics, and discuss how recent artificial intelligence advances may complement fuzzing through state-aware and semantic guidance. Overall, this survey provides a unified perspective on storage-system correctness testing and outlines key challenges

研究の動機と目的

現代のストレージシステムアーキテクチャと正確性検証を複雑にする障害モデルを特徴づける。
障害クラスと基盤となる前提条件に基づいて、既存のストレージシステム検証技法を体系化する。
ファuzzingがストレージシステムの意味論とどのように整合するかを分析し、その限界を特定する。
AI対応手法が状態認識と意味論的指針を検証に提供する可能性を検討する。
将来の意味論を意識した検証を指針づける統一的な、ストレージ中心のフレームワークを概説する。

提案手法

実行特性と対象となる障害機構ごとに検証技法を整理する。
5つの大きな障害クラスを検討する：時間的/発生源、状態進化、クラッシュ一貫性/回復、ハードウェア/永続モデル、分散調整/レプリカ一貫性。
ファ fuzzing パイプラインを分解し、ストレージシステム要件との整合性を評価し、ギャップを特定する。
AI補強検証を、状態進化・履歴・意味論的正しさを推論する機構として議論する。
ストレージシステムの正確性検証を統一的に捉え、将来の意味論意識型検証の方向性を概説する。

Figure 1 . Multi-Layer Structure of Modern Storage Systems.

実験結果

リサーチクエスチョン

RQ1階層化アーキテクチャ全体でストレージシステムの正確性検証を妨げる障害モードは何か。
RQ2長期・層を跨ぐ・回復時の障害を暴露する点で、既存技法はどれくらい効果的か。
RQ3ファuzzingはストレージシステム検証のどこに位置し、その根本的な限界は何か。
RQ4AIをどのように活用して、状態を持つ意味論的検証を強化できるか。
RQ5統一的で意味論意識型のストレージスタック検証フレームワークにはどんな将来像があるか。

主な発見

nondeterministic な相互作用、長寿命の状態進化、層間意味論の影響により、ストレージシステムの正確性検証は難しい。
既存技法は複数の障害クラスを網羅するが、意味論的指針、長期的なカバレッジ、スケーラビリティに制約がある。
ファ fuzzing はストレージ意味論と整合しない仮定に依存することが多く、障害の露出が不十分になる。
AI補強アプローチは、状態認識と意味論的指針を提供でき、深い状態や複数フェーズの障害のカバレッジ向上の可能性がある。
ストレージ中心の視点は技法を統合し、今後の意味論意識型検証の主要な課題と機会を浮き彫りにする。

Figure 2 . Four fundamental dimensions of storage-system testing complexity.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。