QUICK REVIEW

[논문 리뷰] Defusing Logic Bombs in Symbolic Execution with LLM-Generated Ghost Code

Dimitrios Stamatios Bouras, Sergey Mechtaev|arXiv (Cornell University)|2026. 01. 31.

Logic, programming, and type systems인용 수 0

한 줄 요약

본 논문은 LLM으로 생성된 고스트 코드(Ghost code)를 활용해 SMT 솔버가 솔버-적대적 코드(solver-hostile code)를 다루도록 보조하는 하이브리드 심볼릭 실행 프레임워크인 Gordian를 제시하며, 베이스라인 대비 커버리지를 크게 향상시키고 LLM 토큰 사용량을 줄인다.

ABSTRACT

Symbolic execution is a powerful program analysis technique, but its effectiveness is fundamentally limited by solver-hostile program fragments, complex numerical reasoning, and unbounded heap structures. Recent work proposed replacing constraint solvers with large language models (LLMs) to bypass these limitations, but such approaches struggle to analyze real-world codebases, where deep execution paths require globally consistent reasoning across many interacting constraints. We present Gordian, a hybrid symbolic execution framework that uses LLMs selectively to generate lightweight ghost code that aids an SMT solver in handling solver-hostile code fragments, while preserving its precise, global reasoning capability. In particular, we propose three types of ghost code: (1) inversion of difficult code fragments with iterative bidirectional constraint propagation, (2) modeling via solver-friendly surrogates while preserving relevant behavior, and (3) semantic partitioning of unbounded heap spaces. We implemented Gordian on top of the KLEE symbolic execution engine and evaluated it on synthetic "logic bombs" capturing distinct symbolic reasoning challenges, a popular mathematical library FDLibM, and three structured-input programs (libexpat, jq, and bc). Across all benchmarks, Gordian improves coverage on average by 52-84% over traditional symbolic execution baselines, and by 86-419% over LLM-based techniques, while reducing LLM token usage by an average of 90-96%. This highlights the practicality and effectiveness of this approach in real-world settings.

연구 동기 및 목표

전통적인 심볼릭 실행이 solver-hostile 코드, 복잡한 수치 추론, 그리고 무한 확장되는 힙 구조에서 가지는 한계를 제시한다.
글로벌 추론을 희생하지 않으면서 SMT 해석을 돕는 가벼운 고스트 코드를 생성하기 위해 LLM를 선택적으로 사용하는 하이브리드 프레임워크를 제안한다.
세 가지 고스트 코드 변형(역전, 대리 코드, 의미적 힙 분할)을 도입하고 이를 KLEE와 통합한다.
합성 LogicBombs, fdlibm, 그리고 실제 세계의 구조화된 입력 프로그램들에서 향상된 커버리지를 시연한다.]
methodAndDetails
["LLM으로 solver-hostile 프래그먼트를 식별하고 SMT 기반 워크플로에 연결되는 고스트 코드를 생성한다.","세 가지 고스트 코드 유형을 제안한다: 순전달/역전 제약 전파를 통한 프래그먼트 역전(fragment inversion), 솔버 친화적 대리 코드(solver-friendly surrogates), 그리고 의미적 힙 토폴로지 제약.","고스트 코드를 Gordian의 심볼릭 실행 엔진(KLEE 위에 구축)에 통합하여 제약을 전파하고 경로 폭주를 줄인다.","변환된 프래그먼트를 둘러싼 접두 경로 제약과 접미 경로 제약을 조화시키기 위한 양방향 제약 전파를 제공한다.","KLEE에서 구체적으로 구현하고 LogicBombs, fdlibm, libexpat, jq, bc와 같은 프로젝트를 대상으로 평가한다."]
research_questions':['LLM생성 고스트 코드가 SMT 솔버가 solver-hostile 프래그먼트를 전통적 심볼릭 실행만으로는 더 효과적으로 추론하도록 만들 수 있는가?','세 가지 고스트 코드 변형(역전, 대리 코드, 의미적 힙 제약)이 다양한 벤치마크에서 성능 향상에 기여하는가?','전통적 심볼릭 실행과 순수 LLM 기반 방법에 비해 Gordian의 커버리지와 LLM 토큰 사용량은 어떻게 되는가?','고스트 코드의 입력이 원래 프로그램과 대조되어 확인되는 상황에서 해결에 도움을 줄 때 건전성이 보존되는가?','복잡한 동적 데이터 구조를 가진 실제 코드베이스에 대한 Gordian의 실질적 영향은 무엇인가?]
key_findings':['Gordian은 벤치마크 전반에서 전통적 심볼릭 실행 기반 대비 52–84% 더 높은 커버리지를 달성한다.','Gordian은 최첨단 LLM 기반 심볼릭 실행 기법보다 86–419% 더 높은 커버리지를 달성한다.','Gordian은 평균적으로 LLM 토큰 사용을 90–96% 줄인다.','각 고스트 코드 유형은 제거 연구에서 보이는 바와 같이 성능 향상에 기여한다.','Gordian은 합성 LogicBombs, fdlibm 수학 라이브러리, 그리고 libexpat, jq, bc와 같은 구조화된 입력 프로그램에서 효과적임을 입증한다.]
table_headers:[]
table_rows:[]} }{

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.