Javad Rafiei Asl
A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models (video, pdf)
Speaker: Javad Rafiei Asl
Author(s): Javad Rafiei Asl; Sidhant Narula; Mohammad Ghasemigol; Eduardo Blanco ; Daniel Takabi
Abstract: This paper introduces HarmNet, a modular framework designed to systematically construct, refine, and execute multi-turn jailbreak queries against LLMs, demonstrating significantly higher attack success rates compared to prior methods.
