Skip to main content
QUICK REVIEW

[논문 리뷰] Don't Start Over: A Cost-Effective Framework for Migrating Personalized Prompts Between LLMs

Ziyi Zhao, Chongming Gao|arXiv (Cornell University)|2026. 01. 17.
Topic Modeling인용 수 0
한 줄 요약

PUMA은 서로 호환되지 않는 LLM 간의 사용자 특화 소프 프롬프트를 가볍게 만든 어댑터와 그룹 기반 사용자 선택을 사용하여 이식하며, 전체 재훈련에 필적하거나 이를 능가하는 성능을 유지하면서 비용을 최대 98%까지 절감합니다.

ABSTRACT

Personalization in Large Language Models (LLMs) often relies on user-specific soft prompts. However, these prompts become obsolete when the foundation model is upgraded, necessitating costly, full-scale retraining. To overcome this limitation, we propose the Prompt-level User Migration Adapter (PUMA), a lightweight framework to efficiently migrate personalized prompts across incompatible models. PUMA utilizes a parameter-efficient adapter to bridge the semantic gap, combined with a group-based user selection strategy to significantly reduce training costs. Experiments on three large-scale datasets show our method matches or even surpasses the performance of retraining from scratch, reducing computational cost by up to 98%. The framework demonstrates strong generalization across diverse model architectures and robustness in advanced scenarios like chained and aggregated migrations, offering a practical path for the sustainable evolution of personalized AI by decoupling user assets from the underlying models.

연구 동기 및 목표

  • Address the vulnerability of soft prompts to foundation model upgrades in 1+N personalization systems.
  • Develop a cost-efficient method to migrate thousands of user prompts across different models without full retraining.
  • Propose a parameter-efficient adapter and a group-based user selection strategy to ensure semantic alignment and scalability.
  • Demonstrate robustness across diverse model architectures and advanced migration topologies (chained and aggregated).
  • Show that migration can outperform retraining from scratch while decoupling user assets from models.

제안 방법

  • Introduce an adapter-based migration function Phi parametrized by theta that maps source prompts to target prompts across models with different embedding dimensions.
  • Train Phi end-to-end on a target-model task loss while keeping both the source prompts and models frozen.
  • Use a lightweight feed-forward adapter with residual connections and Layer Normalization as Phi.
  • Implement a two-stage group-based user selection: K-means clustering for diversity, followed by variance-based grouped sampling to form a small representative training subset.
  • Extend PUMA to advanced migration topologies: chained migration and aggregated migration by concatenating prompts from multiple sources.
  • Evaluate on three large-scale datasets (Amazon, MIND, Yelp) across multiple model families to show generalization and robustness.

실험 결과

연구 질문

  • RQ1RQ1: How effective is PUMA at migrating user personalization compared to full retraining?
  • RQ2RQ2: How efficiently does the group-based user selection reduce costs while preserving performance?
  • RQ3RQ3: Does PUMA generalize across diverse model architectures and families?
  • RQ4RQ4: How robust is PUMA in advanced migrations like chained and aggregated migrations?

주요 결과

  • PUMA matches or surpasses full retraining while significantly reducing computation (up to 98% cost reduction).
  • Efficiency gains are achieved via the group-based user selection strategy, outperforming random sampling within the same budget.
  • PUMA generalizes well across diverse architectures and model families, with migrations often yielding gains over retraining.
  • In chained migrations, performance remains robust across multiple successive model updates; aggregated migrations from multiple sources improve personalization by fusing knowledge.
  • The aggregated-source setting can surpass single-source migrations, demonstrating knowledge synergy from multiple foundation models.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.