The field of Artificial Intelligence (AI) safety is locked in a continuous game of cat-and-mouse. As developers build stronger guardrails to prevent Large Language Models (LLMs) from generating harmful content, researchers and hackers find new ways to bypass them. For years, these vulnerabilities—known as "jailbreaks"—relied on structural manipulation, complex logic puzzles, or adversarial code injections.