QUICK REVIEW

[논문 리뷰] APISENSOR: Robust Discovery of Web API from Runtime Traffic Logs

Yaru Yang, Chenxing Zhong|arXiv (Cornell University)|2026. 03. 25.

Software System Performance and Reliability인용 수 0

한 줄 요약

APISensor는 혼합 런타임 트래픽에서 노이즈 제거, 경로 표준화, 그리고 구조 템플릿과 의미적 동작의 이중 단계 그래프 기반 클러스터링을 통해 웹 API를 강건하게 재구성하는 블랙박스 API 탐지 프레임워크이다.

ABSTRACT

Large Language Model (LLM)-based agents increasingly rely on APIs to operate complex web applications, but rapid evolution often leads to incomplete or inconsistent API documentation. Existing work falls into two categories: (1) static, white-box approaches based on source code or formal specifications, and (2) dynamic, black-box approaches that infer APIs from runtime traffic. Static approaches rely on internal artifacts, which are typically unavailable for closed-source systems, and often over-approximate API usage, resulting in high false-positive rates. Although dynamic black-box API discovery applies broadly, its robustness degrades in complex environments where shared collection points aggregate traffic from multiple applications. To improve robustness under mixed runtime traffic, we propose APISENSOR, a black-box API discovery framework that reconstructs application APIs unsupervised. APISENSOR performs structured analysis over complex traffic, combining traffic denoising and normalization with a graph-based two-stage clustering process to recover accurate APIs. We evaluated APISENSOR across six web applications using over 10,000 runtime requests with simulated mixed-traffic noise. Results demonstrate that APISENSOR significantly improves discovery accuracy, achieving an average Group Accuracy Precision of 95.92% and an F1-score of 94.91%, outperforming state-of-the-art methods. Across different applications and noise settings, APISENSOR achieves the lowest performance variance and at most an 8.11-point FGA drop, demonstrating the best robustness among 10 baselines. Ablation studies confirm that each component is essential. Furthermore, APISENSOR revealed API documentation inconsistencies in a real application, later confirmed by community developers.

연구 동기 및 목표

문서가 불완전한 상태로 진화하는 웹 서비스에서 LLM-활성화 에이전트가 작동하도록 견고한 API 탐지를 고무한다.
실세계의 혼합 트래픽 환경에서 정적인 화이트박스 및 순진한 블랙박스 접근의 한계를 다룬다.
정밀도와 강건성을 향상시키기 위해 API 템플릿을 추출하고 의미적 클러스터링으로 이를 정제하는 비지도 파이프라인을 개발한다.

제안 방법

런타임 트래픽에서 비-API 요청을 제거하기 위해 다중 신호 노이즈 필터링을 적용한다.
불필요한 구성 요소를 제거하고 형식을 표준화하여 API 경로를 정규화한다.
두 단계 클러스터링 사용: 1단계는 Drain3 접두 트리를 이용해 구조 템플릿을 채굴하고 경로를 인터페이스 레벨 템플릿으로 그룹화한다(예: /api/v1/items/*).
2단계는 각 템플릿 그룹 내에서 경로 구조, 쿼리 매개변수, 페이로드 복잡도 등의 경량 특징과 그래프 기반 클러스터링을 사용해 의미론적 정제를 수행하고(동작을 구분하기 위해) 그래프 기반 클러스터링을 수행하며(또는 K-평균으로 대체) 행동을 분리한다.
일관성 손실 L_cons = ||A − σ(ZZ^T)||_F^2 와 최종 목적 함수 L = L_cons + λ L_clus에서 클러스터링 정규화 항을 활용해 임베딩을 학습하고 일관된 의미론적 클러스터를 형성한다.

실험 결과

연구 질문

RQ1RQ1: API를 다양한 애플리케이션에서 베이스라인과 비교하여 API를 얼마나 효과적으로 발견하는가?
RQ2RQ2: 다양한 API 트래픽의 소음(혼합 다중 테넌트 게이트웨이)에도 APISensor가 얼마나 견고한가?
RQ3RQ3: 노이즈 제거, 템플릿 마이닝, 의미론적 그래프 클러스터링 각각의 구성 요소가 전체 성능에 기여하는 바는 무엇인가?

주요 결과

APISensor는 여섯 개의 오픈 소스 애플리케이션에서 높은 탐지 정확도를 달성하며 평균 PGA 95.92%, RGA 94.36%, FGA 94.91%를 기록했다.
APISensor는 다양한 노이즈 유형과 수준에서도 강건성을 보이며 FGA 최대 하락폭이 8.11포인트이고 순도 변동은 1% 이내이다.
애플리케이션 및 노이즈 설정 전반에 걸쳐 APISensor가 10개의 baselines 중 가장 낮은 성능 변동성을 보인다.
절취 연구는 각 구성 요소(노이즈 필터링, 템플릿 마이닝, 의미적 정제, 그래프 클러스터링)가 성능에 필수적임을 확인한다.
연구는 10K+ 트래픽 벤치마크를 제공하고 199개의 실제 엔드포인트를 제시하며 커뮤니티가 검증한 Dify에서 그림자 API의 발견을 언급한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.