QUICK REVIEW

[논문 리뷰] SWE-Hub: A Unified Production System for Scalable, Executable Software Engineering Tasks

Yucheng Zeng, Shupeng Li|arXiv (Cornell University)|2026. 02. 28.

Software Engineering Research인용 수 0

한 줄 요약

SWE-Hub는 환경 프로비저닝의 통합, 확장 가능한 작업 합성, 다양한 작업 생성을 하나로 묶어 대규모로 실행 가능한 소프트웨어 엔지니어링(SWE) 작업을 생성하는 엔드-투-엔드 데이터 팩토리를 소개합니다. 이는 수정 작업, 현실성 주도적 회귀 및 장기-지향 저장소 구축의 세 가지 작업 생산 라인으로 공유 실행 기반을 구축합니다.

ABSTRACT

Progress in software-engineering agents is increasingly constrained by the scarcity of executable, scalable, and realistic data for training and evaluation. This scarcity stems from three fundamental challenges in existing pipelines: environments are brittle and difficult to reproduce across languages; synthesizing realistic, system-level bugs at scale is computationally expensive; and existing data predominantly consists of short-horizon repairs, failing to capture long-horizon competencies like architectural consistency. We introduce extbf{SWE-Hub}, an end-to-end system that operationalizes the data factory abstraction by unifying environment automation, scalable synthesis, and diverse task generation into a coherent production stack. At its foundation, the extbf{Env Agent} establishes a shared execution substrate by automatically converting raw repository snapshots into reproducible, multi-language container environments with standardized interfaces. Built upon this substrate, extbf{SWE-Scale} engine addresses the need for high-throughput generation, combining cross-language code analysis with cluster-scale validation to synthesize massive volumes of localized bug-fix instances. extbf{Bug Agent} generates high-fidelity repair tasks by synthesizing system-level regressions involving cross-module dependencies, paired with user-like issue reports that describe observable symptoms rather than root causes. Finally, extbf{SWE-Architect} expands the task scope from repair to creation by translating natural-language requirements into repository-scale build-a-repo tasks. By integrating these components, SWE-Hub establishes a unified production pipeline capable of continuously delivering executable tasks across the entire software engineering lifecycle.

연구 동기 및 목표

소프트웨어 엔지니어링 에이전트를 교육하고 평가하기 위한 실행 가능하고 확장 가능한 데이터의 부족 문제를 해결한다.
환경 자동화, 확장 가능한 합성, 다양한 작업 생성을 생산 스택으로 통합한다.
재현 가능한 태스크를 위한 표준화된 인터페이스와 검증을 갖춘 공유 실행 기저를 제공한다.
수리, 회귀 현실성, 장기-지향 저장소 구축에 걸친 여러 작업 계열을 제공한다.

제안 방법

Env Agent는 원시 저장소 스냅샷을 재현 가능하고 다중 언어 컨테이너 환경으로 변환하며 표준화된 인터페이스를 갖춘다.
SWE-Scale 엔진은 다중 언어 분석과 자동 검증을 통해 대규모의 로컬화된 버그-수정 인스턴스를 생성한다.
Bug Agent는 단서의 원인 추정을 제시하지 않고 증상을 설명하는 사용자 유사 이슈 보고서를 갖춘 시스템 수준 회귀를 합성한다.
SWE-Architect는 자연어 요구사항을 저장소 규모의 처음부터 빌드하는 작업으로 변환하여 장기-지향 데이터 생성을 가능하게 한다.
모든 작업 라인은 쿠버네티스 기반 실행 기저, 통합 작업 스키마, 재현성을 위한 결정론적 검증 게이트를 공유한다.

Figure 1: SWE-Hub architecture. Starting from raw source repositories, the Env Agent provisions a deterministic execution substrate—pinned container images, a unified verification entrypoint, and deterministic artifacts—to make code runnable and testable. On top of this substrate, three task generat

실험 결과

연구 질문

RQ1다중 언어에 걸쳐 원시 저장소를 실행 가능하고 검증 가능한 작업 인스턴스로 신뢰할 수 있게 변환하는 생산 시스템은 어떻게 구현할 수 있는가?
RQ2확장 가능한 합성 및 현실적인 버그 생성이 통합 검증기 아래에서 장기-지향 저장소 구축 작업과 함께 통합될 수 있는가?
RQ3교차 언어 재현성, 결정론적 환경 프로비저닝 및 확장 가능한 작업 생성을 가능하게 하는 SWE의 아키텍처 설계는 무엇인가?

주요 결과

실행 가능한 SWE 작업을 위해 환경 설정, 후보 생성 및 검증을 표준화하는 통합 데이터 팩토리 아키텍처가 제안된다.
자동화된 재현성 기저(Env Agent)는 환경 준비를 테스트 정답성과 분리하고 표준화된 엔트리포인트를 가진 버전화된 이미지를 생성한다.
세 가지 제품 라인(SWE-Scale, Bug Agent, SWE-Architect)은 동일한 기저 위에서 수리 중심의 합성, 현실성 지향의 회귀 및 장기-지향 저장소 구축을 가능하게 한다.
검증은 무상태 워커와 다양한 생태계에서 결정론적이고 기계가 해석 가능한 출력과 함께 확장성의 1차 관심사로 다루어진다.

Figure 2: SWE-Hub Environment Layer pipeline. Phase 1 (Env Agent) performs environment provisioning and build readiness, identifying toolchains and installing dependencies to achieve environment readiness without requiring tests to pass. Phase 2 (Test Agent) establishes a unified verification interf

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.