Search Results - eric+wong

1 Results
Sort By:

SMOOTHLLM: Defending Large Language Models Against Jailbreaking Attacks

An algorithm designed to defend Large Language Models (LLMs) against jailbreaking attacks that significantly reduces attack success rates.

Published: 1/28/2025