LLM09:2025 — Misinformation

Slide 24 · Mitigation 6 of 6

Build automated checks that catch false output before it reaches users.

📄 OWASP LLM Top 10:2025 · LLM09 Prevention — Output Validation

OWASP — Output Validation

Implement Automated Fact-Checking and Output Filtering Pipelines

What OWASP Says

“Implement automated post-generation checking or filtering to catch harmful outputs.” For misinformation specifically, this means validation layers that check factual claims against authoritative sources before returning responses to users.

How Missing This Made a Real Incident Worse

No output validation existed in the Air Canada chatbot or the legal research tool in Mata. In both cases, raw LLM output reached the user or the downstream action (the tribunal, the court filing) without any automated check. A simple URL-resolution check on cited cases, or a policy-database lookup confirming whether an answer matches the actual policy record, would have flagged both before harm occurred.

How to Do This Right

→ For domain-specific deployments: build a validation layer that checks key claims against authoritative data before returning the response
→ Fail closed: if validation cannot confirm a claim, return “I cannot verify this — please consult a specialist” rather than passing unverified output through
→ For code generation: run SAST automatically on generated code before displaying it
→ For package recommendations: resolve package names against the registry before surfacing them to developers

How to Validate

Deliberately introduce a false claim into the system — wrong policy text, a fabricated case name, a non-existent package — and see if it reaches the user unchecked. If it passes through, the output validation pipeline is not working. Test every claim type your application handles.

← Back Next → The Matrix