QUICK REVIEW

[论文解读] Persuasion Tokens for Editing Factual Knowledge in LLMs

Paul Youssef, Seifert, Christin|arXiv (Cornell University)|Jan 23, 2026

Topic Modeling被引用 0

一句话总结

该论文提出将说服令牌（P-Tokens）作为一种紧凑、可训练的替代方案，用于在两组数据集和三种模型上实现对事实编辑的高效性，性能与 IKE 相当甚至更好。

ABSTRACT

In-context knowledge editing (IKE) is a promising technique for updating Large Language Models (LLMs) with new information. However, IKE relies on lengthy, fact-specific demonstrations which are costly to create and consume significant context window space. In this paper, we introduce persuasion tokens (P-Tokens) -- special tokens trained to replicate the effect of IKE demonstrations, enabling efficient knowledge editing without requiring fact-specific demonstrations. We evaluate P-Tokens across two editing datasets and three LLMs, demonstrating performance comparable to, and often exceeding, IKE. We further find that editing performance is robust to distractors with small negative effects to neighboring facts, and that increasing the number of P-Tokens improves performance. Our work addresses key limitations of IKE and provides a more practical and scalable alternative for editing LLMs.

研究动机与目标

推动减少对冗长、针对事实的 IKE 演示来更新大语言模型中的事实知识的依赖。
提出将 P-Tokens 作为可训练的特殊令牌，在更短的提示中再现 IKE 的效果。
证明 P-Tokens 在多个数据集和模型上的有效性，并分析对干扰因素的鲁棒性。
评估与 IKE 相比在令牌和推理时间上的效率提升，并探索使用更多 P-Tokens 的可扩展性。

提出的方法

引入 BEGIN_EDIT 和 END_EDIT 作为特殊的 P-Tokens，用于封装编辑并优化它们的嵌入以最小化 KL(P_PT || P_IKE)。
扩展优化至改述与邻近提示场景，以保留目标编辑并限制附带效应。
在训练中引入干扰因素以增强鲁棒性，并评估它们对编辑和邻近事实的影响。
在 CounterFact 和 zsRE 上与多种大模型（GPT-J-6B、Qwen2.5-7B/14B、Llama3-8B）比较 P-Tokens 与 IKE 及基线。
通过 CounterFact 的编辑效果指标 ES、PS、NS，以及 zsRE 的有效性/改述/特异性，外加推理时间指标和令牌数量来衡量编辑效果。

实验结果

研究问题

RQ1P-Tokens 能否在标准知识编辑任务上达到或超越 IKE 的性能？
RQ2增加 P-Tokens 的数量会如何影响编辑效果与效率？
RQ3干扰因素对编辑性能及邻近事实有何影响？
RQ4在模型之间，P-Tokens 是否比 IKE 在推理时间和提示长度上更高效？
RQ5P-Tokens 是否在提示、改述和邻近事实方面具有良好的泛化能力而不降解无关知识？

主要发现

P-Tokens 在 CounterFact 和 zsRE 上对比多种模型和指标时，优于 IKE。
通常增加 P-Tokens 的数量可以提升编辑性能。
干扰因素对编辑/改述提示的有效性影响较小，但对邻近事实可能有负面作用。
与 IKE 相比，P-Tokens 大幅减少提示长度和推理时间（在某些设置下多五倍以上）。
带干扰因素的训练提升鲁棒性；训练中移除干扰因素会降低性能。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。