I am a third-year(2023 Spring-now) Ph.D student in Mine Lab in Computer Science at University of Notre Dame, advised by Prof. Xiangliang Zhang. I’m also a graduate student in Foundation Models and Applications Lab (FMAL) at Lucy Family Institute for Data & Society. Before this, I received my B.E. degree in Computer Science and Engineering at the University of Electronic Science and Technology of China (UESTC) and my M.S degree in Computer Science at King Abdullah University of Science and Technology (KAUST).

I am deeply interested in Large Language Model (LLM) Reasoning and trustworthy LLMs. I aim to understand how LLMs reason, where their reasoning fails, and how to enhance their ability to generalize beyond seen problems. In parallel, I study how to make LLMs more reliable, safe, and aligned when applied to safety-critical contexts. Also, I am engaged in ND-IBM Tech Ethics Lab Collaborative Project, where I explore ways to extend the trustworthiness of LLMs to practical domains such as laboratory safety and investigate the misalignment behaviors.

I am seeking potential research collaborations and the position of industry research intern. If you are interested, please contact me.

🔥 News

  • 2024.08:   One first-author paper has been accepted by EMNLP Findings 2025!
  • 2025.08:   I’m excited to share that I have extended my internship at Tencent AI Lab!
  • 2025.05:   I joined Tencent AI Lab as a research intern this summer! See you in Seattle!
  • 2025.05:   One Paper is accepted by ACL Findings 2025!
  • 2025.04:   One Paper is accepted by IJCAI 2025 Survey Track!
  • 2025.01:   Thrilled to be awarded OpenAI’s Researcher Access Program.
  • 2024.09:   One first-author paper has been accepted by EMNLP 2024!
  • 2024.09:   One paper has been accepted by NeurIPS 2024 Dataset and Benchmark Track as a spotlight!
  • 2024.05:   One Paper is accepted by ACL 2024!
  • 2024.05:   One first-author paper has been accepted by ICML 2024!
  • 2022.12:   One Paper is accepted by AAAI 2023!
  • 2022.10:   One Paper is accepted by BigData 2022!
  • 2022.01:   One paper is accepted by ICLR 2022!

📝 Selected Publications

See more publications in my Google Scholar

Arxiv Preprint
sym

Arxiv Preprint Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation

Yujun Zhou*, Zhenwen Liang*, Haolin Liu, Wenhao Yu, Kishan Panaganti, Linfeng Song, Dian Yu, Xiangliang Zhang, Haitao Mi, Dong Yu

Code

Huggingface Models

EMNLP Findings 2025
sym

EMNLP Findings 2025 Dissecting Logical Reasoning in LLMs: A Fine-Grained Evaluation and Supervision Study

Yujun Zhou*, Jiayi Ye*, Zipeng Ling*, Yufei Han, Yue Huang, Haomin Zhuang, Zhenwen Liang, Kehan Guo, Taicheng Guo, Xiangqi Wang, Xiangliang Zhang

Code

EMNLP 2024
sym

EMNLP 2024 Defending Jailbreak Prompts via In-Context Adversarial Game

Yujun Zhou, Yufei Han, Haomin Zhuang, Taicheng Guo, Kehan Guo, Zhenwen Liang, Hongyan Bao, and Xiangliang Zhang

Code

ICML 2024
sym

ICML 2024 Attack-free Evaluating and Enhancing Adversarial Robustness on Categorical Data

Yujun Zhou, Yufei Han, Haomin Zhuang, Hongyan Bao, Xiangliang Zhang

Code

Arxiv Preprint
sym

Arxiv Preprint LabSafety Bench: Benchmarking LLMs on Safety Issues in Scientific Labs

Yujun Zhou, Jingdong Yang, Yue Huang, Kehan Guo, Zoe Emory, Bikram Ghosh, Amita Bedar, Sujay Shekar, Pin-Yu Chen, Tian Gao, Werner Geyer, Nuno Moniz, Nitesh V Chawla, Xiangliang Zhang

Code

Dataset

📖 Educations

Map Widget