QUICK REVIEW

[논문 리뷰] Towards Anytime-Valid Statistical Watermarking

Baihe Huang, Zhongcong Xu|arXiv (Cornell University)|2026. 02. 19.

Advanced Steganography and Watermarking Techniques인용 수 0

한 줄 요약

The paper introduces Anchored E-Watermarking, showing that e-value-based watermarking improves detection efficiency over baselines while maintaining text quality and enabling anytime-valid inferences with faster detection.

ABSTRACT

The proliferation of Large Language Models (LLMs) necessitates efficient mechanisms to distinguish machine-generated content from human text. While statistical watermarking has emerged as a promising solution, existing methods suffer from two critical limitations: the lack of a principled approach for selecting sampling distributions and the reliance on fixed-horizon hypothesis testing, which precludes valid early stopping. In this paper, we bridge this gap by developing the first e-value-based watermarking framework, Anchored E-Watermarking, that unifies optimal sampling with anytime-valid inference. Unlike traditional approaches where optional stopping invalidates Type-I error guarantees, our framework enables valid, anytime-inference by constructing a test supermartingale for the detection process. By leveraging an anchor distribution to approximate the target model, we characterize the optimal e-value with respect to the worst-case log-growth rate and derive the optimal expected stopping time. Our theoretical claims are substantiated by simulations and evaluations on established benchmarks, showing that our framework can significantly enhance sample efficiency, reducing the average token budget required for detection by 13-15% relative to state-of-the-art baselines.

연구 동기 및 목표

지속적이면서도 임의의 중지를 허용하는 워터마킹 방법의 필요성을 제시한다.
생성 품질을 보존하면서 탐지 효율성을 개선하기 위해 Anchored E-Watermarking을 제안한다.
최적 e-값 프레임워크의 이론적 정당성과 중지 시간에 대한 시사점을 제시한다.
실제 세계의 언어 모델에 대해 baselines와 비교하고 효율성 향상을 보고한다.

제안 방법

P-값 대신 e-값을 사용한 워터마킹 탐지 프레임워크인 Anchored E-Watermarking을 제안한다.
Worst-case 로그 증가율에 대한 최적 e-값을 도출하고 특성화한다.
워터마킹 설계를 정당화하기 위해 분포와 운송 계획에 대한 최악의 경우 최적화(worst-case optimization)를 형식화한다.
perturbations와 개인 키에 걸쳐 baselines(예: SEAL)와 비교하여 탐지 효율성과 생성 품질을 실험적으로 평가한다.
이 접근법이 유용성을 해치지 않으면서도 언제든지 유효한 추론이 가능하다는 점을 강조한다.

실험 결과

연구 질문

RQ1e-값이 워터마킹 탐지에 대해 Type-I 오류를 증가시키지 않으면서 유효한 선택적 중지를 보장할 수 있는가?
RQ2Anchored E-Watermarking이 baselines에 비해 탐지 효율성을 개선하고 생성 품질을 유지하는가?
RQ3Worst-case 조건에서 이론적으로 최적의 e-값과 그에 대응하는 중지 시간이 무엇인가?
RQ4제안된 방법이 실제 세계의 언어 모델에서 다양한 perturbation 및 개인 키에 대해 어떻게 작동하는가?

주요 결과

Scheme	Quality (↑)	Size (↓)
Exponential	0.907	∞
Inverse Transform	0.917	734.0
Binary	0.919	∞
Distribution Shift	0.912	145.0
SEAL	0.901	84.5
LABEL:thm:log-growth	0.919	72.0

Anchored E-Watermarking은 탐지 효율성에서 baselines를 현저히 능가하며 생성 품질도 높게 유지된다.
가장 최적화된 비앵커드 기준선과 비교할 때 속도가 거의 2배 향상(145 토큰에서 72.0 토큰으로)되었다.
제안 방법의 품질 점수는 baselines와 경쟁력을 유지하여 탐지 가능성과 유용성 간의 우호적 균형을 나타낸다.
프레임워크는 언제든지 유효한 추론을 가능하게 하여 전통적인 p-값 기반 테스트의 고정 수평 한계를 해결한다.
실제 언어 모델에 대한 실험 결과, 텍스트 품질을 보존하면서 상당한 효율성 향상을 보인다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.