An algorithm designed to defend Large Language Models (LLMs) against jailbreaking attacks that significantly reduces attack success rates.
[%brief%]