Bridging the Gap in Vision Language Models in Identifying Unsafe Concepts Across Modalities

Authors: 

Yiting Qu, Michael Backes, and Yang Zhang, CISPA Helmholtz Center for Information Security