PART 2Types
Slides 9–13 · 4 patterns, each a real incident
Slide 9 · Types Overview
OWASP identifies five vulnerability categories. We cover four attacks.
Each anchored to a real, verified incident or research demonstration.
☠️
Retrieval Poisoning
Malicious documents crafted so their vectors outrank legitimate ones for target queries — controlling what the AI is told to read.
🔍
Embedding Inversion
Reconstructing source text from stored vector representations — reading private documents from a vector store without direct document access.
🚪
Cross-Context Leakage
Embeddings from restricted users bleeding into responses for other users in shared vector databases — bypassing organizational access controls.
🎭
Context Manipulation
Injecting hidden instructions into documents before embedding — making the AI follow attacker-supplied directives whenever that document is retrieved.