I am a third-year(2023 Spring-now) Ph.D student in Computer Science at University of Notre Dame, advised by Prof. Xiangliang Zhang. Before this, I received my B.E. degree in Computer Science and Engineering at the University of Electronic Science and Technology of China (UESTC) and my M.S degree in Computer Science at King Abdullah University of Science and Technology (KAUST).
I am deeply interested in trustworthy large language models (LLMs) and adversarial machine learning. My work focuses on developing innovative methodologies in adversarial learning and LLMs to ensure the robustness and reliability of AI systems. This involves creating algorithms to detect and mitigate adversarial attacks and designing frameworks to enhance the safety of LLMs. My goal is to address critical challenges in AI by making machine learning models more resilient and trustworthy in real-world applications. Currently, I am engaged in the ND-IBM Tech Ethics Lab Collaborative Project, where I aim to extend the trustworthiness of LLMs to real-world safety-critical applications, such as lab safety, beyond the traditional focus areas.
I am seeking potential research collaborations and the position of industry research intern. If you are interested, please contact me.
🔥 News
- 2025.01: 🎉🎉 Thrilled to be awarded OpenAI’s Researcher Access Program.
- 2024.09: 🎉🎉 One first-author paper has been accepted by EMNLP 2024!
- 2024.09: 🎉🎉 One paper has been accepted by NeurIPS 2024 Dataset and Benchmark Track as a spotlight!
- 2024.05: 🎉🎉 One Paper is accepted by ACL 2024!
- 2024.05: 🎉🎉 One first-author paper has been accepted by ICML 2024!
- 2022.12: 🎉🎉 One Paper is accepted by AAAI 2023!
- 2022.10: 🎉🎉 One Paper is accepted by BigData 2022!
- 2022.01: 🎉🎉 One paper is accepted by ICLR 2022!
📝 Publications
See more publications in my Google Scholar

Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data
Yujun Zhou, Yufei Han, Haomin Zhuang, Hongyan Bao, Xiangliang Zhang

Defending Jailbreak Prompts via In-Context Adversarial Game
Yujun Zhou, Yufei Han, Haomin Zhuang, Taicheng Guo, Kehan Guo, Zhenwen Liang, Hongyan Bao, and Xiangliang Zhang

LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs, Yujun Zhou, Jingdong Yang, Yue Huang, Kehan Guo, Zoe Emory, Bikram Ghosh, Amita Bedar, Sujay Shekar, Pin-Yu Chen, Tian Gao, Werner Geyer, Nuno Moniz, Nitesh V Chawla, Xiangliang Zhang
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective, Yue Huang, Chujie Gao, Siyuan Wu, Haoran Wang, Xiangqi Wang, Yujun Zhou, Yanbo Wang, Jiayi Ye, Jiawen Shi, Qihui Zhang, Yuan Li, Han Bao, Zhaoyi Liu, Tianrui Guan, Dongping Chen, Ruoxi Chen, Kehan Guo, Andy Zou, Bryan Hooi Kuen-Yew, Caiming Xiong, Elias Stengel-Eskin, Hongyang Zhang, Hongzhi Yin, Huan Zhang, Huaxiu Yao, Jaehong Yoon, Jieyu Zhang, Kai Shu, Kaijie Zhu, Ranjay Krishna, Swabha Swayamdipta, Taiwei Shi, Weijia Shi, Xiang Li, Yiwei Li, Yuexing Hao, Zhihao Jia, Zhize Li, Xiuying Chen, Zhengzhong Tu, Xiyang Hu, Tianyi Zhou, Jieyu Zhao, Lichao Sun, Furong Huang, Or Cohen Sasson, Prasanna Sattigeri, Anka Reuel, Max Lamparth, Yue Zhao, Nouha Dziri, Yu Su, Huan Sun, Heng Ji, Chaowei Xiao, Mohit Bansal, Nitesh V Chawla, Jian Pei, Jianfeng Gao, Michael Backes, Philip S Yu, Neil Zhenqiang Gong, Pin-Yu Chen, Bo Li, Xiangliang Zhang
Can LLMs Solve Molecule Puzzles? A Multimodal Benchmark for Molecular Structure Elucidation, Kehan Guo, Bozhao Nan, Yujun Zhou, Taicheng Guo, Zhichun Guo, Mihir Surve, Zhenwen Liang, Nitesh V. Chawla, Olaf Wiest, Xiangliang Zhang
Social Science Meets LLMs: How Reliable Are Large Language Models in Social Simulations?, Yue Huang, Zhengqing Yuan, Yujun Zhou*, Kehan Guo, Xiangqi Wang, Haomin Zhuang, Weixiang Sun, Lichao Sun, Jindong Wang, Yanfang Ye, Xiangliang Zhang
Position: We Need An Adaptive Interpretation of Helpful, Honest, and Harmless Principles, Yue Huang, Chujie Gao, Yujun Zhou, Kehan Guo, Xiangqi Wang, Or Cohen-Sasson, Max Lamparth, Xiangliang Zhang
Beyond Single-Value Metrics: Evaluating and Enhancing LLM Unlearning with Cognitive Diagnosis, Yicheng Lang, Kehan Guo, Yue Huang, Yujun Zhou, Haomin Zhuang, Tianyu Yang, Yao Su, Xiangliang Zhang
Artificial Intelligence in Spectroscopy: Advancing Chemistry from Prediction to Generation and Beyond, Kehan Guo, Yili Shen, Gisela Abigail Gonzalez-Montiel, Yue Huang, Yujun Zhou, Mihir Surve, Zhichun Guo, Prayel Das, Nitesh V Chawla, Olaf Wiest, Xiangliang Zhang
SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark, Zhenwen Liang, Kehan Guo, Gang Liu, Taicheng Guo, Yujun Zhou, Tianyu Yang, Jiajun Jiao, Renjie Pi, Jipeng Zhang, Xiangliang Zhang
Towards efficient and domain-agnostic evasion attack with high-dimensional categorical inputs, Hongyan Bao, Yufei Han, Yujun Zhou, Xin Gao, Xiangliang Zhang
Towards understanding the robustness against evasion attack on categorical data, Hongyan Bao, Yufei Han, Yujun Zhou, Yun Shen, Xiangliang Zhang
📖 Educations
- 2023.01 - Present, Ph.D,
University of Notre Dame
- 2021.09 - 2022.12, M.S,
King Abdullah University of Science and Technology
- 2017.09 - 2021.06, B.Eng,
University of Electronic Science and Technology of China