Table of Links
-
Methodology and 3.1 Preliminary
-
A. Character Generation Detail
C. Effect of Text Moderator on Text-based Jailbreak Attack
6 Limitation
One potential limitation of our work, despite its strong performance on state-of-the-art MLLMs, lies in its effectiveness against poorly performing MLLMs. These models may lack adequate instructionfollowing and image understanding capabilities, rendering them ineffective in role-playing tasks. Another limitation is our approach for generating character prompts for the diffusion model, which relies on direct generation by a LLM. This method, while effective and straightforward, may be constrained by the LLM’s ability to produce effective diffusion model prompts. Additionally, the diffusion model’s capability to generate character images from these may further limit the efficacy of our approach.
Authors:
(1) Siyuan Ma, University of Wisconsin–Madison ([email protected]);
(2) Weidi Luo, The Ohio State University ([email protected]);
(3) Yu Wang, Peking University ([email protected]);
(4) Xiaogeng Liu, University of Wisconsin-Madison ([email protected]).
This paper is