Twitter/XGitHub

Loading...

Attack via Overfitting: 10-shot Benign Fine-tuning to Jailbreak LLMs | Cybersec Research