Twitter/XGitHub

Loading...

SEC-bench: Automated Benchmarking of LLM Agents on Real-World Software Security Tasks | Cybersec Research