I am a 2nd-year(2023 Spring-now) Ph.D student in Computer Science at University of Notre Dame, advised by Prof. Xiangliang Zhang. Before this, I received my B.E. degree in Computer Science and Engineering at the University of Electronic Science and Technology of China (UESTC) and my M.S degree in Computer Science at King Abdullah University of Science and Technology (KAUST).

I am deeply interested in trustworthy large language models (LLMs) and adversarial machine learning. My work focuses on developing innovative methodologies in adversarial learning and LLMs to ensure the robustness and reliability of AI systems. This involves creating algorithms to detect and mitigate adversarial attacks and designing frameworks to enhance the safety of LLMs. My goal is to address critical challenges in AI by making machine learning models more resilient and trustworthy in real-world applications. Currently, I am engaged in the ND-IBM Tech Ethics Lab Collaborative Project, where I aim to extend the trustworthiness of LLMs to real-world safety-critical applications, such as lab safety, beyond the traditional focus areas.

I am seeking potential research collaborations and the position of industry research intern. If you are interested, please contact me.

🔥 News

  • 2024.09:  🎉🎉 One first-author paper has been accepted by EMNLP 2024!
  • 2024.09:  🎉🎉 One paper has been accepted by NeurIPS 2024 Dataset and Benchmark Track as a spotlight!
  • 2024.05:  🎉🎉 One Paper is accepted by ACL 2024!
  • 2024.05:  🎉🎉 One first-author paper has been accepted by ICML 2024!
  • 2022.12:  🎉🎉 One Paper is accepted by AAAI 2023!
  • 2022.10:  🎉🎉 One Paper is accepted by BigData 2022!
  • 2022.01:  🎉🎉 One paper is accepted by ICLR 2022!

📝 Publications

See more publications in my Google Scholar

ICML 2024
sym

ICML 2024 Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data

Yujun Zhou, Yufei Han, Haomin Zhuang, Hongyan Bao, Xiangliang Zhang

Code

EMNLP 2024
sym

EMNLP 2024 Defending Jailbreak Prompts via In-Context Adversarial Game

Yujun Zhou, Yufei Han, Haomin Zhuang, Taicheng Guo, Kehan Guo, Zhenwen Liang, Hongyan Bao, and Xiangliang Zhang

Code

📖 Educations