zchin31415[AT]gmail.com
Hi! I am a PhD student at CISPA Helmholtz Center for Information Security under the supervision of Mario Fritz. My research focuses on AI safety and interpretability for large generative models, with a particular emphasis on text-to-image diffusion models and LLMs.
My work develops methods to understand and control generative AI systems. P4D introduces a red-teaming framework that automatically discovers problematic prompts in text-to-image models, helping developers identify safety vulnerabilities before deployment. Building on this, I develop ICER, which generates fleunt adversarial prompts to red-team these models. Currently, I am also working on attribution/ interpretation methods for understanding model misbehavior in both text-to-image models and LLMs.
I work closely with Pin-Yu Chen (IBM Research) and Ping-Chun Hsieh (NYCU). I received my M.S. in Computer Science from National Yang Ming Chiao Tung University (NYCU). For more details, please see my CV and Google Scholar. Feel free to reach out at zchin31415[AT]gmail.com for potential collaborations or discussions.
For a comprehensive list of my publications, please refer to my Google Scholar.
(† indicates equal contribution)
Besides research, I am an opera and classical crossover singer who performs both soprano and alto pieces. The majority of my free time is spent running and have completed three half marathons and one full marathon. I also enjoy reading and exploring dessert and coffee shops in my free time.