LLM03:2025 — Supply Chain

Slide 26 · Mitigation 8 of 9 — Anomaly & Robustness Testing

Hunt the backdoor that benchmarks can't see.

📄 OWASP LLM Top 10:2025 · LLM03 Prevention #8

OWASP — Anomaly Detection & Robustness Tests

Run anomaly detection and adversarial robustness tests on supplied models and data

What OWASP Says

“Implement anomaly detection and adversarial robustness tests on supplied models and data to help detect tampering and poisoning.”

The Gap This Closes

This is the layer aimed squarely at the stealthy attacks: the LoRA backdoor (Slide 13) and PoisonGPT (Slide 15) both pass normal tests. Specialized tooling — like weight-space backdoor detectors (PEFTGuard) — inspects the artifact itself rather than trusting its everyday behavior.

How to Do This Right

→ Scan model/adapter weights for backdoor signatures, not just run prompts
→ Probe with adversarial and trigger-style inputs; watch for behavior that flips on odd cues
→ Flag statistical anomalies in supplied datasets before fine-tuning on them

How to Validate

Plant a known trigger in a test adapter and run your detection. If your pipeline only checks accuracy, it will pass the poisoned adapter — proving the gap is still there.

← Back Next → Defend the device edge