UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
This repo includes papers about the watermarking for text and images.
Topic-based Watermarks for LLM-Generated Text. Preprint.
Alexander Nemecek, Yuzhou Jiang, Erman Ayday
A Statistical Framework of Watermarks for Large Language Models: Pivot, Detection Efficiency and Optimal Rules. Preprint.
Xiang Li, Feng Ruan, Huiyuan Wang, Qi Long, Weijie J. Su
WaterJudge: Quality-Detection Trade-off when Watermarking Large Language Models. Preprint.
Piotr Molenda, Adian Liusie, Mark J. F. Gales
Duwak: Dual Watermarks in Large Language Models. Preprint.
Chaoyi Zhu, Jeroen Galjaard, Pin-Yu Chen, Lydia Y. Chen
Lost in Overlap: Exploring Watermark Collision in LLMs. Preprint.
Yiyang Luo, Ke Lin, Chao Gu
WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off. Preprint.
Eva Giboulot, Furon Teddy
WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection. Preprint.
Anudeex Shetty, Yue Teng, Ke He, Qiongkai Xu
EmMark: Robust Watermarks for IP Protection of Embedded Quantized Large Language Models. Preprint.
Ruisi Zhang, Farinaz Koushanfar
Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models. Preprint.
Mingjia Huo, Sai Ashish Somayajula, Youwei Liang, Ruisi Zhang, Farinaz Koushanfar, Pengtao Xie
Attacking LLM Watermarks by Exploiting Their Strengths. Preprint.
Qi Pang, Shengyuan Hu, Wenting Zheng, Virginia Smith
Multi-Bit Distortion-Free Watermarking for Large Language Models. preprint.
Watermarking Makes Language Models Radioactive. Preprint.
Tom Sander, Pierre Fernandez, Alain Durmus, Matthijs Douze, Teddy Furon
Can Watermarks Survive Translation? On the Cross-lingual Consistency of Text Watermark for Large Language Models. Preprint.
Zhiwei He, Binglin Zhou, Hongkun Hao, Aiwei Liu, Xing Wang, Zhaopeng Tu, Zhuosheng Zhang, Rui Wang
GumbelSoft: Diversified Language Model Watermarking via the GumbelMax-trick. Preprint.
Jiayi Fu, Xuandong Zhao, Ruihan Yang, Yuansen Zhang, Jiangjie Chen, Yanghua Xiao
k-SemStamp: A Clustering-Based Semantic Watermark for Detection of Machine-Generated Text. Preprint.
Abe Bohan Hou, Jingyu Zhang, Yichen Wang, Daniel Khashabi, Tianxing He
Proving membership in LLM pretraining data via data watermarks. Preprint.
Johnny Tian-Zheng Wei, Ryan Yixiang Wang, Robin Jia
Permute-and-Flip: An optimally robust and watermarkable decoder for LLMs. Preprint.
Provably Robust Multi-bit Watermarking for AI-generated Text via Error Correction Code. Preprint.
Instructional Fingerprinting of Large Language Models. Preprint.
Adaptive Text Watermark for Large Language Models. Preprint.
Excuse me, sir? Your language model is leaking (information) Preprint.
Or Zamir
Cross-Attention Watermarking of Large Language Models. ICASSP2024.
Folco Bertini Baldassini, Huy H. Nguyen, Ching-Chung Chang, Isao Echizen
Optimizing watermarks for large language models. Preprint.
Bram Wouters
Towards Optimal Statistical Watermarking. Preprint.
Baihe Huang, Banghua Zhu, Hanlin Zhu, Jason D. Lee, Jiantao Jiao, Michael I. Jordan
A Survey of Text Watermarking in the Era of Large Language Models. Preprint. Survey paper.
Aiwei Liu, Leyi Pan, Yijian Lu, Jingjing Li, Xuming Hu, Lijie Wen, Irwin King, Philip S. Yu
On the Learnability of Watermarks for Language Models. Preprint.
Chenchen Gu, Xiang Lisa Li, Percy Liang, Tatsunori Hashimoto
New Evaluation Metrics Capture Quality Degradation due to LLM Watermarking. Preprint.
Karanpartap Singh, James Zou
Mark My Words: Analyzing and Evaluating Language Model Watermarks. Preprint.
Julien Piet, Chawin Sitawarin, Vivian Fang, Norman Mu, David Wagner
I Know You Did Not Write That! A Sampling Based Watermarking Method for Identifying Machine Generated Text. Preprint.
Kaan Efe Keleş, Ömer Kaan Gürbüz, Mucahid Kutlu
Improving the Generation Quality of Watermarked Large Language Models via Word Importance Scoring. Preprint
Performance Trade-offs of Watermarking Large Language Models. Preprint.
X-Mark: Towards Lossless Watermarking Through Lexical Redundancy. Preprint.
WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models. Preprint.
Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models. Preprint.
Hanlin Zhang, Benjamin L. Edelman, Danilo Francati, Daniele Venturi, Giuseppe Ateniese, Boaz Barak
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models. Preprint.
Embarrassingly Simple Text Watermarks. Preprint.
Necessary and Sufficient Watermark for Large Language Models. Preprint.
Functional Invariants to Watermark Large Transformers. Preprint.
Watermarking LLMs with Weight Quantization. EMNLP2023 findings.
DiPmark: A Stealthy, Efficient and Resilient Watermark for Large Language Models. Preprint.
A Semantic Invariant Robust Watermark for Large Language Models. Preprint.
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation. Preprint.
Advancing Beyond Identification: Multi-bit Watermark for Language Models. Preprint.
Three Bricks to Consolidate Watermarks for Large Language Models. Preprint.
Towards Codable Text Watermarking for Large Language Models. Preprint.
A Private Watermark for Large Language Models. Preprint.
Robust Distortion-free Watermarks for Language Models. Preprint.
Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy. Preprint.
Provable Robust Watermarking for AI-Generated Text. Preprint.
On the Reliability of Watermarks for Large Language Models. Preprint.
Undetectable Watermarks for Language Models. Preprint.
Watermarking Text Data on Large Language Models for Dataset Copyright Protection. Preprint.
Baselines for Identifying Watermarked Large Language Models. Preprint.
Who Wrote this Code? Watermarking for Code Generation. Preprint.
Robust Multi-bit Natural Language Watermarking through Invariant Features. ACL 2023.
Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark. ACL 2023.
Watermarking Text Generated by Black-Box Language Models. Preprint.
Protecting Language Generation Models via Invisible Watermarking. ICML 2023.
A Watermark for Large Language Models. ICML 2023. Outstanding Paper Award
Distillation-Resistant Watermarking for Model Protection in NLP. EMNLP 2022
CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks. NeurIPS 2022
Adversarial Watermarking Transformer: Towards Tracing Text Provenance with Data Hiding. IEEE S&P 2021
Watermarking GPT Outputs. slides 2023
Watermarking the Outputs of Structured Prediction with an Application in Statistical Machine Translation. EMNLP 2011
First, think about which category the work should belong to.
Second, use the same format as the others to describe the work.