TwinBreak: Jailbreaking LLM Security Alignments based on Twin Prompts

Authors: 

Torsten Krauß, Hamid Dashtbani, and Alexandra Dmitrienko, University of Würzburg