QUICK REVIEW

[论文解读] Beyond Hard Writes and Rigid Preservation: Soft Recursive Least-Squares for Lifelong LLM Editing

Xinyu Wang, Sicheng Lyu|arXiv (Cornell University)|Jan 22, 2026

Digital Humanities and Scholarship被引用 0

一句话总结

RLSEdit 引入带软 adherence 与锚点保留的递归最小二乘编辑器，用于终身顺序编辑，实现可扩展、稳定的更新，同时保留通用能力。

ABSTRACT

Model editing updates a pre-trained LLM with new facts or rules without re-training, while preserving unrelated behavior. In real deployment, edits arrive as long streams, and existing editors often face a plasticity-stability dilemma: locate-then-edit "hard writes" can accumulate interference over time, while null-space-style "hard preservation" preserves only what is explicitly constrained, so past edits can be overwritten and unconstrained behaviors may deviate, degrading general capabilities in the many-edits regime. We propose RLSEdit, a recursive least-squares editor for long sequential editing. RLSEdit formulates editing as an online quadratic optimization with soft constraints, minimizing a cumulative key-value fitting objective with two regularizers that control for both deviation from the pre-trained weights and from a designated anchor mapping. The resulting update admits an efficient online recursion via the Woodbury identity, with per-edit cost independent of history length and scaling only with the current edit size. We further provide deviation bounds and an asymptotic characterization of the adherence-preservation trade-off in the many-edits regime. Experiments on multiple model families demonstrate stable scaling to 10K edits, outperforming strong baselines in both edit success and holistic stability -- crucially retaining early edits, and preserving general capabilities on GLUE and held-out reasoning/code benchmarks.

研究动机与目标

Motivate lifelong sequential model editing that balances learning new facts with preserving prior knowledge.
Formulate editing as online regularized least-squares with two deviation controls (from initial weights and from an anchor mapping).
Develop a Woodbury-based online recursion enabling per-edit cost that scales with current edit size, not history.
Provide theoretical deviation bounds and an asymptotic analysis for many-edits regimes.
Demonstrate superior edit success and stability across large edit streams on multiple model families.

提出的方法

Formulate a quadratic objective that sums past and current edit residuals plus two regularizers: deviation from the initial weights and deviation from a designated anchor mapping.
Derive a closed-form solution using normal equations and implement an online Woodbury recursion to update the inverse covariance C_t and weights W_t with per-edit cost depending on the current edit size u_t.
Use a Cholesky-based stabilization of the Woodbury update to ensure numerical stability.
Provide deviation bounds and an asymptotic characterization showing convergence to a ridge-regularized population minimizer under mild assumptions.
Compare RLSEdit against AlphaEdit, MEMIT, ROME, and FT on Llama-3-8B and Qwen2.5-7B with 10k edits, reporting multiple metrics (Efficacy, Generalization, Specificity, Fluency, Consistency).

Figure 1: The recursive workflow of our RLS-Woodbury editor. The process alternates between updating the covariance state via the Woodbury identity (Phase 1) and updating weights (Phase 2). The highlighted block shows how we reduce complexity from $O\!\left(d_{k}^{3}\right)$ to $O\!\left(d_{k}^{2}u_

实验结果

研究问题

RQ1Can lifelong sequential editing be effectively achieved with soft constraints that interpolate between hard writes and hard preservation?
RQ2What is the computational cost of online RLSEdit per edit relative to past edits, and how does it scale with edit size?
RQ3Does RLSEdit preserve general capabilities while applying many sequential edits on large language models?
RQ4How do deviation parameters lambda and mu influence parameter drift and anchor-adherence over long edit streams?
RQ5How does RLSEdit perform compared to state-of-the-art editors under long edit streams across multiple model families?

主要发现

Model	Efficacy	Generalization	Specificity	Fluency	Consistency
Llama-3-8B	89.94 ± 0.75	72.84 ± 1.21	60.56 ± 0.35	615.58 ± 4.34	26.27 ± 0.35
Qwen2.5-7B	94.45 ± 1.07	68.55 ± 0.47	73.37 ± 0.44	625.74 ± 0.71	31.62 ± 0.81

RLSEdit achieves strong edit success and stability across 10K sequential edits on Llama-3-8B and Qwen2.5-7B, outperforming baselines on multiple metrics.
On Llama-3-8B, RLSEdit attains Efficacy 89.94, Generalization 72.84, Specificity 60.56, Fluency 615.58, Consistency 26.27, all higher than baselines.
On Qwen2.5-7B, RLSEdit achieves Efficacy 94.45, Generalization 68.55, Specificity 73.37, Fluency 625.74, Consistency 31.62, with RLSEdit often leading in Efficacy and Fluency.
RLSEdit preserves early edits better than AlphaEdit, ROME, MEMIT, and FT, maintaining performance on general-language understanding and reasoning benchmarks across editing checkpoints.
The method scales with per-edit cost dependent on the current edit size u_t, not the total number of past edits, enabling stable long-stream editing.
Empirical results show RLSEdit maintains general capabilities across GLUE tasks and code/math benchmarks better than baselines over the editing trajectory.

Figure 2: Evolution of objective terms over $10\mathsf{K}$ edits. We compare RLSEdit against baselines ( AlphaEdit , MEMIT ) on three metrics: Term 1 ( $\|{\bm{K}}_{t}{\bm{W}}-{\bm{V}}_{t}\|_{F}^{2}$ ) measures the fitting error for the current edit; Term 2 ( $\|{\bm{W}}-{\bm{W}}_{0}\|_{F}^{2}$ ) me

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。