Scalable Extraction of Training Data from (Production) Language ModelsResearch Paper
Nasr et al. · Google DeepMind, Cornell, UW, CMU, UC Berkeley, ETH Zurich · arXiv:2311.17035 · Disclosed to OpenAI August 30, 2023 · Published November 28, 2023
Cited for: Repeated-token attack, sustained generation causing behavioral divergence, disproportionate token consumption per query, slides 12, 25. The "poem poem poem" technique extracted 10,000+ training examples for ~$200.
arXiv:2311.17035 →
AVID-2023-V009 — Proof Pudding (moohax & monoxgas)Vulnerability Report
AI Vulnerability Database · Will Pearce & Nick Landers · DerbyCon 2019 presentation: "42: The answer to life, the universe, and everything offensive security"
Cited for: Proof Pudding model extraction mechanics, systematic probe query approach, slide 13
AVID database →