QUICK REVIEW

[논문 리뷰] CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving

Erick Silva, Rehana Yasmin|arXiv (Cornell University)|2026. 03. 16.

Autonomous Vehicle Technology and Safety인용 수 0

한 줄 요약

CRASH는 2168개의 NHTSA AV 사건 보고서(2021–2025)를 분석하여 주요 원인과 AV 기여를 규명하는 LLM 기반 에이전트로, 전문가 정합성과 확장 가능한 구조화된 안전 분석을 달성합니다.

ABSTRACT

As AVs grow in complexity and diversity, identifying the root causes of operational failures has become increasingly complex. The heterogeneity of system architectures across manufacturers, ranging from end-to-end to modular designs, together with variations in algorithms and integration strategies, limits the standardization of incident investigations and hinders systematic safety analysis. This work examines real-world AV incidents reported in the NHTSA database. We curate a dataset of 2,168 cases reported between 2021 and 2025, representing more than 80 million miles driven. To process this data, we introduce CRASH, Cognitive Reasoning Agent for Safety Hazards, an LLM-based agent that automates reasoning over crash reports by leveraging both standardized fields and unstructured narrative descriptions. CRASH operates on a unified representation of each incident to generate concise summaries, attribute a primary cause, and assess whether the AV materially contributed to the event. Our findings show that (1) CRASH attributes 64% of incidents to perception or planning failures, underscoring the importance of reasoning-based analysis for accurate fault attribution; and (2) approximately 50% of reported incidents involve rear-end collisions, highlighting a persistent and unresolved challenge in autonomous driving deployment. We further validate CRASH with five domain experts, achieving 86% accuracy in attributing AV system failures. Overall, CRASH demonstrates strong potential as a scalable and interpretable tool for automated crash analysis, providing actionable insights to support safety research and the continued development of autonomous driving systems.

연구 동기 및 목표

수동 검토를 넘어 대규모 AV 사고 서사를 대상으로 구조화된 추론을 자동화한다.
사고 보고서에서 주요 원인을 규명하고 AV의 실패한 부분 시스템을 식별한다.
각 사고에 자율주행차가 실질적으로 기여했는지 평가한다.
안전 연구 및 정책 통찰을 지원하기 위한 해석 가능한 요약과 데이터 준비 출력물을 제공한다.

제안 방법

2021–2025년 NHTSA AV 사건 보고서 2,168건의 데이터세트를 선별하고 정리한다 (~80 million miles).
분석과 시뮬레이션 준비 출력물을 위한 사전처리, 처리(LLM 추론), 사후처리의 3단계 CRASH 파이프라인을 설계한다.
신뢰할 수 있는 JSON 형식 출력을 보장하기 위해 도메인 규칙과 원샷 예제를 갖춘 제약된 프롬프트 기반 LLM 접근법을 사용한다.
시스템 실패, 인간 요인, 환경 조건의 세 가지 포괄 범주로 AV 사고 원인 분류 체계(태생)를 개발한다.
사람이 개입하는 전문가 검토를 통한 출력 검증 및 두 가지 NLP 베이스라인(다수 클래스 및 키워드 규칙)과의 비교를 수행한다.

실험 결과

연구 질문

RQ1CRASH가 서로 다른 사고 서사에서 AV 책임을 신뢰성 있게 규명할 수 있는가?
RQ2주요 원인 차원에서 전문가 판단 및 기준 NLP 방법과 비교하여 CRASH의 성능은 어떠한가?
RQ3CRASH 파이프라인이 대규모 사고 데이터셋을 처리할 만큼 확장 가능하고 효율적인가?
RQ4대규모 서사 분석에서 어떤 시스템 차원의 안전 패턴(예: 지연, 인지 실패)이 도출되는가?

주요 결과

64%의 사고가 인지 또는 계획 실패(시스템 관련 추론)에 기인한다.
보고된 사고의 약 50%가 후방 추돌로 발생한다.
CRASH는 전문가 판단과 대조했을 때 AV 책임에 대한 정확도 86%, 늦은 AI 탐지에 대해 84%; 주요 원인 규명에 대한 76%; 고장난 부분 시스템 규명에 대해 46%를 달성한다.
케이스당 추론 시간은 두 개의 NVIDIA A4500 GPU에서 약 30초로 수동 검토보다 빠른 처리 속도를 가능하게 한다.
CRASH는 AV 실패, 지연된 AI, 원인, 시스템 실패 차원에서 기준선 방법(다수 클래스 및 키워드 규칙)을 능가한다(표 4 결과).

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.