zchin31415[AT]gmail.com

Zhi-Yi Chin

Hi! I am a PhD student at CISPA Helmholtz Center for Information Security under the supervision of Mario Fritz. My research focuses on AI safety and interpretability for large generative models, with a particular emphasis on text-to-image diffusion models and LLMs.

My work develops methods to understand and control generative AI systems. P4D introduces a red-teaming framework that automatically discovers problematic prompts in text-to-image models, helping developers identify safety vulnerabilities before deployment. Building on this, I develop ICER, which generates fleunt adversarial prompts to red-team these models. Currently, I am also working on attribution/ interpretation methods for understanding model misbehavior in both text-to-image models and LLMs.

I work closely with Pin-Yu Chen (IBM Research) and Ping-Chun Hsieh (NYCU). I received my M.S. in Computer Science from National Yang Ming Chiao Tung University (NYCU). For more details, please see my CV and Google Scholar. Feel free to reach out at zchin31415[AT]gmail.com for potential collaborations or discussions.

Links: CV / Twitter / Github / Google Scholar / Linkedin / Instagram / Threads / Facebook

Publications

For a comprehensive list of my publications, please refer to my Google Scholar.
(† indicates equal contribution)

Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts
Zhi-Yi Chin^†, Chieh-Ming Jiang^†, Pin-Yu Chen, Ching-Chun Huang, Wei-Chen Chiu
ICML 2024
Project Page / Code / Dataset
We introduce P4D a white-box red-teaming method for T2I model by correspondence model guidance and token-level prompt optimization technique.
In-Context Experience Replay Facilitates Safety Red-Teaming of Text-to-Image Diffusion Models
Zhi-Yi Chin, Mario Fritz, Pin-Yu Chen, Wei-Chen Chiu
arXiv 2024

We propose ICER, a jailbreaking framework for T2I models that stores past red-teaming attempts as experience replay, employs bandit sampling from this replay buffer to construct LLM priors, and guides LLM generation of fluent jailbreaking prompts through Bayesian optimization, creating a self-improving attack cycle.
Masking Improves Contrastive Self-Supervised Learning for ConvNets, and Saliency Tells You Where
Zhi-Yi Chin^†, Chieh-Ming Jiang^†, Pin-Yu Chen, Ching-Chun Huang, Wei-Chen Chiu
WACV 2024
Code
We propose a saliency-aware masking strategy for SSL in ConvNets that balances mask distribution between foreground and background regions while introducing hard negatives through strategic salient patch masking.
Multi-Camera Tracking by Candidate Intersection Ratio Tracklet Matching
Yun-Lun Li, Zhi-Yi Chin, Ming-Ching Chang, Chen-Kuo Chiang
CVPR 2021 AI City Challenge Workshop

Projects

3D Point Cloud Data Augmentation via Scene Representation Network
Pei-Tse Chiang, Meng-Hsun Tsai, Zhi-Yi Chin, Chieh-Ming Jiang
pdf / Code / Slides
We develop a 3D point cloud augmentation pipeline that leverages SRN and image features to generate new 3D shapes through latent space interpolation, demonstrating success on ModelNet10. in 2021 MediaTek Research Project
RSNA Pneumonia Detection
Zhi-Yi Chin, Chieh-Ming Jiang
pdf / Code / Slides
We develop a high-performing pneumonia detection system for the RSNA Kaggle challenge by combining EfficientNet for classification and YOLOR for detection, with optimized prediction thresholds and bounding box refinements that boosted accuracy by 2%. Final project in Setected Topics in Visual Recognition Using Deep Learning 2021 Fall @ NYCU
Generative Models as a Data Augmentation for Classification
Zhi-Yi Chin, Chieh-Ming Jiang
Code / Video / Slides
We explore image transformation through latent space manipulation in GAN steer, demonstrating its superiority over traditional data-space transformations for data augmentation. Final poject in Deep Learning and Practice 2021 Summer @ NYCU
Reimplemenatation Challenge -- Maximum a Posteriori Policy Optimisation
Zhi-Yi Chin,Yi-Hsin Chen, Yu-Hsuan Li, Yu-Jie Chen
pdf / Code / Slides
We extend and enhance the MPO paper's algorithm implementation by incorporating numerical stabilization techniques and exploring E-step modifications for improved performance. Reimplementation project in Reinforcement Learning 2021 Spring @ NYCU
Calendar Helper
Zhi-Yi Chin, Mi Li, Jhong-Yu Huang, JC Chen
Code
We build a comprehensive task management platform that seamlessly blends calendars and to-do lists, featuring an innovative tagging system for smarter organization of events and tasks in 2019 CodeU @ Google
Lane Detection
Zhi-Yi Chin, Shao-Yu Weng, Bo-Yu Cheng
pdf / Code / Video / Slides
We develop an enhanced multi-lane detection system achieving over 70% accuracy by combining hourglass network with double hinge loss. Final project in Computer Vision 2021 Spring @ NYCU
Mango Classification
Zhi-Yi Chin, Tzu-Cheng Lin, Kung-Hao Chang, Yu-Chang Chen
pdf / Code
Achieve accuracy 82.31% on the testing data and rank 8 in the public board in AICUP Mango Image Recognition Challenge: Grade Classification and Defective Classification
Face Morphing and Warping
Code / Videos
Face swapping from my face to another person's face smoothly without ghost effect by morphing and warping technique. Final project in Introduction to Multimedia Technology in Fall 2019 @ CCU

Blog Posts

Honors

Dean's list (6 times), Computer Science and Information Engineering Dept. at CCU, Fall '17, Spring '18, Fall '18, Spring '19, Fall '19, Spring '20
College Student Research Scholarship, get NT$ 48,000 from Ministry of Science and Technology, 2020
Google Student Travel Scholarship, scholarship to attend Grace Hopper Celebration, 2019

Services

Reviewer:
ICLR 2025
CVPR 2025 (Outstanding Reviewer)
ICCV 2025

Miscellany

Besides research, I am an opera and classical crossover singer who performs both soprano and alto pieces. The majority of my free time is spent running and have completed three half marathons and one full marathon. I also enjoy reading and exploring dessert and coffee shops in my free time.

Books I enjoy: The Ride of a Lifetime (Robert Iger), Becoming (Michelle Obama), What We Owe the Future (William MacAskill), Atomic Habits (James Clear), Make Time (Jake Knapp and John Zeratsky), Show Your Work (Austin Kleon)
TV shows I enjoy: Grey's Anatomy, Lessons in Chemistry, The Morning Show, Hospital Playlist

This website is built from the source code of Nelson F. Liu's awesome website (https://nelsonliu.me ).