Exploiting Task-Level Vulnerabilities: An Automatic Jailbreak Attack and Defense Benchmarking for LLMs

Authors: 

Lan Zhang and Xinben Gao, University of Science and Technology of China; Liuyi Yao, unaffiliated; Jinke Song, The Hong Kong University of Science and Technology; Yaliang Li