QUICK REVIEW

[论文解读] Visual Insights into Agentic Optimization of Pervasive Stream Processing Services

Boris Sedlak, Víctor Casamayor Pujol|arXiv (Cornell University)|Feb 19, 2026

Context-Aware Activity Recognition Systems被引用 0

一句话总结

本文介绍 MUDAP 和 RASK，一种两部分的平台与智能体，用于对就地边缘流处理服务进行多维、情境感知的自动扩展，以满足 SLO。它还可可视化智能体如何学习并随时间改进对 SLO 的实现。

ABSTRACT

Processing sensory data close to the data source, often involving Edge devices, promises low latency for pervasive applications, like smart cities. This commonly involves a multitude of processing services, executed with limited resources; this setup faces three problems: first, the application demand and the resource availability fluctuate, so the service execution must scale dynamically to sustain processing requirements (e.g., latency); second, each service permits different actions to adjust its operation, so they require individual scaling policies; third, without a higher-level mediator, services would cannibalize any resources of services co-located on the same device. This demo first presents a platform for context-aware autoscaling of stream processing services that allows developers to monitor and adjust the service execution across multiple service-specific parameters. We then connect a scaling agent to these interfaces that gradually builds an understanding of the processing environment by exploring each service's action space; the agent then optimizes the service execution according to this knowledge. Participants can revisit the demo contents as video summary and introductory poster, or build a custom agent by extending the artifact repository.

研究动机与目标

在动态资源约束下，利用边缘设备对普遍存在的流处理进行自动扩展的动机。
实现细粒度、面向服务的参数调整，超越传统 CPU 扩展。
提供可扩展的智能体方法，学习环境动态并优化多服务配置。
演示智能体内部模型的可视化及其对 SLO 实现的影响。

提出的方法

通过 REST API 引入 MUDAP，实现对服务执行与资源的细粒度调整。
给出 RASK，从时间序列数据建立回归模型以解释参数对 SLO 的影响。
将模型与 SLO 目标结合数值求解器，以推断全局扩展动作。
实现边缘设备上对多个就地服务的自治、分布式扩展。
可视化智能体学习到的世界模型及其对 SLO 实现的时间演化影响。

实验结果

研究问题

RQ1一个多维度自动扩展平台（MUDAP）如何实现对边缘设备上流处理服务的细粒度控制？
RQ2一个扩展智能体（RASK）是否能从时间序列数据学习参数对 SLO 实现的预测模型？
RQ3将回归模型与数值求解器整合是否能在多就地服务之间实现有效的、去中心化的自动扩展？
RQ4与基线或非智能体方法相比，智能体驱动的扩展在多大程度上提升 SLO 实现？

主要发现

演示显示在 agent 驱动扩展的前 300 秒内，SLO 实现率从 56% 提高到 98%。
在 300 秒的使用期内，智能体通过在帕累托前沿的配置选择，维持了高水平的 SLO 实现。
该方法在获得准确环境模型方面比典型的 RL 方法如 Q-learning 更具样本效率。
RASK 在边缘设备上运行，实现对多就地服务的自治、鲁棒扩展。
工件库与可视化演示使研究人员能够将扩展智能体扩展到新服务和新参数上。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。