Know how the guardrails fail.

A practitioner reference for LLM jailbreak techniques. Working bypasses, model behaviors they exploit, the patches that did and didn't fix them — written for AI red teamers who need to know what's still landing today.

Enter the archive →

// index17 entries

Why trust us

Trusted by researchers across the AI security community

Jailbreaks FYI is part of a 26-site editorial network covering adversarial ML, AI governance, defensive tooling, and ops engineering — all open access.

Sites in network

Across 6 topic clusters

400+

Expert articles

And growing daily

Daily

New content

Automated + editorial

Free

Always free to read

Newsletter included

About this site · Subscribe free

Jailbreaks FYI — in your inbox

Working LLM jailbreak techniques, sourced and dated. — delivered when there's something worth your inbox.

No spam. Unsubscribe anytime.

Know how the guardrails fail.

Latest entries

Best LLM Guardrail Tools 2026: A Practitioner's Comparison

How LLM Jailbreaks Work: Techniques, Success Rates, and Defender Responses

DAN Prompt Jailbreak Explained: How 'Do Anything Now' Attacks Work

Prompt Injection vs. Jailbreak: The Distinction and the Defender's Stack

Why Jailbreaks Work: Competing Objectives and Mismatched Generalization

ArtPrompt Post-Mortem: Why ASCII-Art Bypasses Worked

Garak in 2026: what it's actually good for, what it isn't

Indirect Prompt Injection in LLM Agents: Shipped Failures

Model Behavior Fingerprinting: Identifying a Wrapped LLM

Multi-Turn Role-Play Attacks: Why One Safe Turn Gets Unsafe

Trusted by researchers across the AI security community

Jailbreaks FYI — in your inbox