QUICK REVIEW

[논문 리뷰] Dynasto: Validity-Aware Dynamic-Static Parameter Optimization for Autonomous Driving Testing

Dmytro Humeniuk, Mohammad Hamdaqa|arXiv (Cornell University)|2026. 03. 22.

Autonomous Vehicle Technology and Safety인용 수 0

한 줄 요약

Dynasto는 유효성 제약 하에서 동적 적대적 행동과 정적 초기 조건을 공동으로 최적화하여 ADS 테스트에서 현실적인 안전-중요 실패를 발견합니다. 이는 STL 기반 유효성, 초기 조건에 대한 GA, 그래프 기반 실패 군집화를 이용합니다.

ABSTRACT

Extensive simulation-based testing is important for assuring the safety of autonomous driving systems (ADS). However, generating safety-critical traffic scenarios remains challenging because failures often arise from rare, complex interactions with surrounding vehicles. Existing automatic scenario-generation approaches frequently fail to distinguish genuine ADS faults from collisions caused by implausible or invalid adversarial behaviors, and they typically optimize either scenario initialization or agent behavior in isolation. We propose Dynasto, a two-step testing approach that jointly optimizes initial scenario parameters and dynamic adversarial behaviors to uncover realistic safety-critical failures. First, we train an adversarial agent using reinforcement learning (RL) with temporal-logic-based validity criteria and a safe-distance model inspired by ISO 34502 to promote behaviorally plausible failures. Second, a genetic algorithm (GA) searches over initial conditions while replaying the adversary's failure-inducing behaviors to reveal additional failures that the RL agent alone does not uncover. Finally, a graph-based clustering pipeline groups failures into representative modes based on semantic event sequences. Our evaluation experiments in HighwayEnv across two ADS controllers show that Dynasto finds 60%-70% more valid failures than an RL-only adversary under the same evaluation budget. With clustering, we obtain about 12 interpretable failure modes per system under test, revealing valid failures driven by weaknesses in ego-controller behavior. These results indicate that coordinated dynamic-static optimization with explicit validity constraints is effective for exposing safety-relevant failures in ADS testing.

연구 동기 및 목표

ADS 테스트에서 유효하고 invalid한 실패를 구분하여 안전하고 현실적인 실패 발견을 촉진한다.
동적 적대자 최적화와 정적 시나리오 탐색을 연결하는 2단계 테스트 워크플로를 개발한다.
사후 실패 분석을 가능하게 하기 위해 실패를 해석 가능한 모드로 군집화한다.

제안 방법

유효성 기반 보상과 STL 기반 안전 기준을 갖춘 RL 적대자를 학습하여 유효한 실패를 생성한다.
적대자의 실패 유발 행동을 재생하여 정적 초기 조건을 탐색하는 GA를 사용하고 추가적인 실패를 발견한다.
ISO 34502에 맞춘 신호-시간-논리(Signal Temporal Logic) 규칙으로 유효/무효 실패를 라벨링한다.
실패 추적을 기록하고 그래프 기반 군집화 파이프라인(kNN 그래프 + LeidEN)을 적용하여 실패 모드를 식별한다.
사건 시퀀스로 실패를 표현하고 레벤슈타인 거리를 사용해 군집화에서 유사성을 측정한다.
GA 평가를 RL 주도 실패 추적과 통합하기 위한 ask-tell 인터페이스를 사용한다.

실험 결과

연구 질문

RQ1시나리오 생성 중 잘못된 적대적 행동으로 인한 크래시와 실제 ADS 오 faults를 구분할 수 있는 방법은 무엇인가?
RQ2동적(RL)과 정적(GA) 최적화를 조정하는 것이 RL만 사용하는 것보다 안전하고 관련된 실패를 더 잘 발견하게 하는가?
RQ3수집된 추적에서 후사 군집화가 해석 가능한 실패 모드를 드러낼 수 있는가?
RQ4유효성 제약이 발견된 실패의 다양성과 품질에 미치는 영향은 무엇인가?
RQ5Leiden 군집화 접근 방식이 유사한 실패 추적을 의미 있는 모드로 그룹화하는 데 얼마나 효과적인가?

주요 결과

Dynasto는 동일한 평가 예산에서 DQN만 사용하는 적대자에 비해 60%–70% 더 많은 유효 실패를 발견한다.
군집화는 시험 대상 시스템당 약 12개의 해석 가능한 실패 모드를 산출한다.
사후 분석은 자이 오-차의 행동의 약점을 바탕으로 수백 개의 실패를 응집된 모드로 묶는다.
2단계 접근 방식은 다양한 정적 초기화하에서 적대적 행동을 재생 재생함으로써 RL 전용보다 더 넓은 실패 공간을 확장한다.
유효성 의미와 명시적 안전 제약은 발견된 실패의 현실성을 크게 향상시킨다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.