Twitter/XGitHub

Loading...

Logit-Gap Steering: Efficient Short-Suffix Jailbreaks for Aligned Large Language Models | Cybersec Research