QUICK REVIEW

[논문 리뷰] Pareto-Conditioned Diffusion Models for Offline Multi-Objective Optimization

Jatan Shrestha, Santeri Heiskanen|arXiv (Cornell University)|2026. 01. 31.

Advanced Multi-Objective Optimization Algorithms인용 수 0

한 줄 요약

PCD는 오프라인 다목적 최적화를 조건부 확산 샘플링으로 재구성하며, 재가중 전략과 참조-방향 스킴을 사용하여 surrogates 모델 없이도 목표 무역-off에 조건화합니다. 벤치마크 전반에서 경쟁력 있고 일관된 성능을 달성합니다.

ABSTRACT

Multi-objective optimization (MOO) arises in many real-world applications where trade-offs between competing objectives must be carefully balanced. In the offline setting, where only a static dataset is available, the main challenge is generalizing beyond observed data. We introduce Pareto-Conditioned Diffusion (PCD), a novel framework that formulates offline MOO as a conditional sampling problem. By conditioning directly on desired trade-offs, PCD avoids the need for explicit surrogate models. To effectively explore the Pareto front, PCD employs a reweighting strategy that focuses on high-performing samples and a reference-direction mechanism to guide sampling towards novel, promising regions beyond the training data. Experiments on standard offline MOO benchmarks show that PCD achieves highly competitive performance and, importantly, demonstrates greater consistency across diverse tasks than existing offline MOO approaches.

연구 동기 및 목표

정적 데이터셋만 이용 가능하고 대리모가 신뢰되지 않을 수 있는 상황에서 오프라인 MOO를 동기부여한다.
목표 파레토 무역-off에 조건을 둔 솔루션을 직접 샘플링하는 조건부 생성 프레임워크를 개발한다.
높은 품질의 Pareto-전면 구간을 강조하기 위한 다목적 재가중 전략을 도입한다.
학습 데이터 너머의 다양하고 새로운 조건점들을 생성하기 위한 참조 방향 메커니즘을 제안한다.

제안 방법

오프라인 데이터셋에서 샘플의 다목적 재가중과 함께 p(x|y;σ) 조건부 확산 모델을 학습한다.
목적 공간의 격자(grid)와 지배 관계 기반 가중치를 사용해 고성능 포인트를 강조하도록 재가중한다.
Classifier-free 가이던스를 사용해 샘플링을 조건화 타깃으로 향하도록 하며 γ가 가이던스 강도를 제어한다.
다양하고 고품질의 타깃을 생성하기 위해 NSGA-III에서 영감을 받은 참조 방향 메커니즘으로 조건점을 생성한다.
샘플링 중에 hat{y} 타깃에 조건화된 해 x를 생성하기 위해 CFG를 적용한다.
256개의 샘플 중 상위 P 백분위수(P=100,75,50)에서 하이퍼볼륨(HV)을 사용해 성능을 평가한다.

Figure 1: Overview of the PCD framework, which reframes offline MOO as a conditional sampling problem. Training: A conditional diffusion model is trained on a static dataset, using a novel reweighting strategy to emphasize high-quality solutions near the Pareto front. Sampling: At inference, the mod

실험 결과

연구 질문

RQ1명시적 목표 surrogate 없이 오프라인 MOO를 조건부 샘플링으로 어떻게 공식화할 수 있는가?
RQ2목표 무역-off에 조건화된 확산 모델이 오프라인 MOO에서 Pareto 전면을 효과적으로 커버할 수 있는가?
RQ3재가중과 참조 방향 조건화가 Pareto 전면의 커버리지와 품질을 향상시키는가?

주요 결과

방법	합성	MORL	RE	과학	MONAS	평균 순위
D best	5.45 0.19	1.70 0.27	2.60 0.07	9.35 0.14	11.53 0.06	7.43 0.05
MOBO	8.69 0.30	14.60 0.42	10.00 0.33	6.75 0.47	8.11 0.80	8.81 0.34
E2E + GN	7.33 0.55	5.70 2.14	7.06 0.32	5.35 1.38	9.33 0.53	7.82 0.40
E2E + PC	5.93 0.25	3.50 1.22	6.22 0.33	4.30 1.32	6.60 0.40	6.01 0.29
E2E	6.16 0.30	9.70 2.08	6.06 0.30	4.20 1.40	5.13 0.22	5.71 0.16
MH + GN	8.82 0.53	8.90 2.16	8.14 0.94	5.05 2.14	12.57 0.40	9.84 0.33
MH + PC	8.87 0.45	10.90 1.08	6.74 0.68	6.15 0.91	7.46 0.30	7.68 0.33
MH	6.18 0.53	8.00 1.41	6.14 0.29	5.80 0.89	5.88 0.49	6.10 0.22
MM + COMs	8.02 0.47	3.60 1.29	6.54 0.17	3.85 0.68	7.22 0.43	6.80 0.13
MM + ICT	6.73 0.46	9.10 1.95	5.44 0.32	5.05 0.74	8.42 0.40	7.08 0.13
MM + IOM	5.16 0.51	12.70 0.91	5.76 0.52	4.40 1.15	5.77 0.50	5.80 0.20
MM + TM	6.55 0.82	7.90 2.16	5.78 0.25	5.90 1.29	7.87 0.39	6.91 0.20
MM	6.07 0.50	9.50 0.79	5.94 0.41	6.55 0.93	4.97 0.46	5.80 0.21
ParetoFlow	2.44 0.28	8.50 1.32	1.74 0.17	9.05 0.27	11.19 0.52	6.74 0.23
PCD (ours)	3.38 0.20	5.50 3.30	1.51 0.13	4.05 0.33	7.54 0.50	4.80 0.30

PCD는 합성, MORL, RE, Scientific, MONAS 과제 카테고리에서 최상의 전체 평균 순위를 달성한다.
PCD는 ParetoFlow와 같은 생성 기반 베이스라인을 능가하고, 여러 과제에서 대리모 기반 베이스라인과 견주거나 우수하다.
제안된 재가중 및 참조 방향 메커니즘이 HV 결과를 일관되게 향상시키는 것으로 확인되었다.
γ가 약 2.5 정도인 classifier-free 가이던스는 그 수치를 넘어서면 수익이 감소한다.
MORL 과제에서는 고차원 검색 공간으로 인해 모든 방법의 성능이 도전적이며, 어떤 방법도 오프라인 데이터셋의 비지배 점을 능가하지 못한다.
PCD는 다양한 벤치마크에서 단 하나의 고정된 하이퍼파라미터 조합으로 견고함을 보인다.

Figure 2: Overview of the conditioning points generation procedure : a) The objective space is partitioned via direction vectors, and points are ranked based on non-dominated sorting. b) Each direction vector is paired with the point closest to it in perpendicular distance (black arrow). The rest of

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.