QUICK REVIEW

[논문 리뷰] Visual Insights into Agentic Optimization of Pervasive Stream Processing Services

Boris Sedlak, Víctor Casamayor Pujol|arXiv (Cornell University)|2026. 02. 19.

Context-Aware Activity Recognition Systems인용 수 0

한 줄 요약

본 논문은 맥락 인지 다차원 자동스케일링을 위한 MUDAP과 RASK를 제시한다. 이는 같은 위치의 엣지 스트림 처리 서비스의 SLO를 충족하기 위한 두 부분으로 구성된 플랫폼과 에이전트이다. 또한 에이전트가 어떻게 학습하고 시간이 지남에 따라 SLO 충족을 개선하는지 시각화한다.

ABSTRACT

Processing sensory data close to the data source, often involving Edge devices, promises low latency for pervasive applications, like smart cities. This commonly involves a multitude of processing services, executed with limited resources; this setup faces three problems: first, the application demand and the resource availability fluctuate, so the service execution must scale dynamically to sustain processing requirements (e.g., latency); second, each service permits different actions to adjust its operation, so they require individual scaling policies; third, without a higher-level mediator, services would cannibalize any resources of services co-located on the same device. This demo first presents a platform for context-aware autoscaling of stream processing services that allows developers to monitor and adjust the service execution across multiple service-specific parameters. We then connect a scaling agent to these interfaces that gradually builds an understanding of the processing environment by exploring each service's action space; the agent then optimizes the service execution according to this knowledge. Participants can revisit the demo contents as video summary and introductory poster, or build a custom agent by extending the artifact repository.

연구 동기 및 목표

동적 자원 제약 하에 엣지 장치에서 보편적으로 확산되는 스트림 처리의 자동 스케일링을 촉진한다.
전통적인 CPU 스케일링을 넘어 서비스별로 세밀한 매개변수 조정을 가능하게 한다.
환경 역학을 학습하고 다중 서비스 구성을 최적화하는 확장 가능한 에이전트적 접근법을 제공한다.
에이전트의 내부 모델과 SLO 충족에 미치는 영향을 시각화하는 것을 보여준다.

제안 방법

REST API를 통해 서비스 실행 및 자원의 미세한 조정을 위한 MUDAP를 도입한다.
SLO에 대한 매개변수 효과를 해석하기 위해 시계열 데이터로 회귀 기반 모델을 구축하는 RASK를 제시한다.
모델과 SLO 목표를 수치 해석기로 결합하여 글로벌 확장 조치를 도출한다.
엣지 장치에서 다중 로컬 서비스의 자율적이고 분산된 스케일링을 가능하게 한다.
에이전트가 학습한 세계 모델과 시간이 지남에 따른 SLO 충족에 미치는 영향을 시각화한다.

실험 결과

연구 질문

RQ1다차원 자동 스케일링 플랫폼(MUDAP)이 엣지 장치의 스트림 처리 서비스를 어떻게 세밀하게 제어하게 할 수 있는가?
RQ2스케일링 에이전트(RASK)가 시계열 데이터에서 매개변수 효과가 SLO 충족에 미치는 예측 모델을 학습할 수 있는가?
RQ3회귀 기반 모델과 수치 해석기를 통합하면 다중 로컬 서비스 간의 효과적이고 분산된 자동 스케일링을 얻을 수 있는가?
RQ4에이전트 주도 확장이 기준선 또는 비에이전트 접근법에 비해 SLO 충족을 얼마나 개선하는가?

주요 결과

데모는 에이전트 주도 확장을 통해 300초 이내에 SLO 충족이 56%에서 98%로 개선되는 것을 보여준다.
300초의 활용 기간 동안 에이전트는 최적 배정의 파레토 전면을 따라 구성을 선택하여 높은 SLO 충족을 유지한다.
이 방법은 Q-learning과 같은 일반적인 RL 방법에 비해 샘플 효율이 더 높아 정확한 환경 모델을 달성한다.
RASK는 엣지 장치에서 작동하여 다중 로컬 서비스의 자율적이고 견고한 확장을 가능하게 한다.
아티팩트 저장소와 시각적 데모는 연구자들이 새로운 서비스와 매개변수에 Scaling 에이전트를 확장하도록 허용한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.