Jialun CAO received her PhD degree from the Department of Computer Science and Engineering at The Hong Kong University of Science and Technology (HKUST), under the supervision of Prof. Shing-Chi Cheung in the CASTLE lab. She is now a Research Assistant Professor in HKUST.

Her research interests lie in the intersection of Software Engineering (SE) and Large Language Models (LLMs), with an emphasis on LLM4SE, and LLM Evaluation. She has published more than 20 papers at the top conferences and journals, including ICSE, FSE, ASE, TOSEM, CAV, Usenix Security, AAAI, etc. She serves as a program committee member in top conferences such as ICSE, FSE, and ASE, SANER, Internetware, APSEC, etc; and is a reviewer for top journals including TOSEM, TSE, EmSE, etc.

I am on the job market now, seeking a tenure-track Assistant Professor position. Contact me if it’s suitable😊

🔥 News

2025.05.15: 🎉🎉 Two papers 📑 “From Informal to Formal” and 📑 “CruxEval-X” have been accepted by ACL 2025 to the main conference! Congrats!
2025.05.01: 🎉🎉 Our paper 📑 “Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries” has been accepted by TOSEM. Congrats to Ziniu!
2025.03.31: 🎉🎉 Our paper 📑 “CodeCleaner: Elevating Standards with A Robust Data Contamination Mitigation Toolkit” has been accepted by Internetware 2025. Congrats!
2025.03.08: Our 🤗Huggingface repo reached 1.6k+ downloads. Contributions are welcomed 👉 [Link] The Chinese article has reached 37k+ reads👀 and 2k+ forwards↗️. Full paper 👉 [Paper]
2025.02.12: 🎉🎉 Our paper 📑 “SemBIC: Semantic-aware Identification of Bug-inducing Commits” has been accepted to FSE 2025. Congrats to Xiao!
2025.02.02: 🎉🎉 Our paper 📑 “A study on Prompt Design, Advantages and Limitations of ChatGPT for Deep Learning Program Repair” has been accepted to JASE 2025. Finally!
2025.01.23: 🎉🎉 I am honored to receive the prestigious 🏆ACM SIGSOFT 2025 Outstanding Dissertation Award! Only 1 or 2 award receivers worldwide per year 🎉 🔗[News]
2024.12.10: 🎉🎉 Our paper 📑 “DomainEval: An Auto-Constructed Benchmark for Multi-Domain Code Generation” has been accepted to AAAI 2025. Congrats to Qiming!

🌍 Visiting & Program Experience

2024.12. I am honored to be mentored by Prof. Paola Ricaurte Quijano at Harvard University in the 5th cohort of the Asia Pacific Women in Leadership (APWiL) Mentoring Program.
2024.10. I am honored to visit Prof. Michael Pradel at the University of Stuttgart.
2024.03. I am honored to visit Prof. Pinjia He at the Chinese University of Hong Kong, Shenzhen.

📝 Publications

2025

[C19] 📄 Jialun Cao, Songqiang Chen, Wuqi Zhang, Hau Ching Lo, Yeting Li, Shing-Chi Cheung. CodeCleaner: Elevating Standards with A Robust Data Contamination Mitigation Toolkit. In Internetware 2025. 🔗[Paper] 💻 [Github]
[J3] 📚 Jialun Cao, Meiziniu Li, Ming Wen, Shing-chi Cheung. A study on prompt design, advantages and limitations of chatgpt for deep learning program repair. In Journal of Automated Software Engineering (ASEJ). 🔗[arxiv] 🔗[Official]
[C20] 📝 Jialun Cao, Yaojie Lu, Meiziniu Li, Haoyang Ma, Haokun Li, Mengda He, Cheng Wen, Le Sun, Hongyu Zhang, Shengchao Qin, Shing-Chi Cheung, Cong Tian. From Informal to Formal – Incorporating and Evaluating LLMs on Natural Language Requirements to Verifiable Formal Proofs. In ACL Main 2025. 🔗[Paper] 🤗 [Huggingface] 🇨🇳[Chinese article]
[C21] 📄 Xiao Chen, Hengcheng Zhu, Jialun Cao (Corresponding), Ming Wen, Shing-Chi Cheung (Corresponding). SemBIC: Semantic-aware Identification of Bug-inducing Commits. In the ACM International Conference on the Foundations of Software Engineering (FSE 2025). 🔗[paper]
[C22] 📄 Qiming Zhu, Jialun Cao (Co-1st), Yaojie Lu, Hongyu Lin, Xianpei Han, Ben He, Le Sun, Shing-Chi Cheung. DomainEval: An Auto-Constructed Benchmark for Multi-Domain Code Generation. In 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025). 🔗[Paper] 🎯[Leaderboard] 💻 [Github]
[C23] 📝 Ruiyang Xu, Jialun Cao (Co-1st), Yaojie Lu, Ming Wen, Hongyu Lin, Xianpei Han, Ben He, Shing-Chi Cheung, Le Sun. CruxEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution. In ACL Main 2025. 🔗[Paper] 🎯[Leaderboard] 💻 [Github]
[C24] 📄 Mengyang Wu, Yuzhi Zhao, Jialun Cao, Mingjie Xu, zhongming Jiang, Xuehui Wang, Qinbin Li, Guangneng Hu, Shengchao Qin, Chi-Wing Fu. ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation. In 39th Annual AAAI Conference on Artificial Intelligence (AAAI 2025). 🔗[Paper]
[J4] 📝 Meiziniu Li, Dongze Li, Jianmeng Liu, Jialun Cao, Yongqiang Tian, Shing-Chi Cheung. Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries. In TOSEM 2025. 🔗[Paper]
[Pre1] 📝 Jialun Cao, Yuk-Kit Chan, Zixuan Ling, Wenxuan Wang†, Shuqing Li, Mingwei Liu, Ruixi Qiao, Yuting Han, Chaozheng Wang, Boxi Yu, Pinjia He, Shuai Wang, Zibin Zheng, Michael R. Lyu, Shing-Chi Cheung. How Should We Build A Benchmark? Revisiting 274 Code-Related Benchmarks For LLMs. In arXiv 2025. 🔗[Paper] 🇨🇳[Chinese article]
[Pre2] 📝 Jialun Cao, Wuqi Zhang, Shing-Chi Cheung. Concerned with Data Contamination? Assessing Countermeasures in Code Language Model. In arXiv. 🔗[Paper]
[Pre3] 📝 Jiarong Wu, Songqiang Chen, Jialun Cao (Corresponding), Hau Ching Lo, Shing-Chi Cheung (Corresponding). Isolating Language-Coding from Problem-Solving: Benchmarking LLMs with PseudoEval. In arXiv. 🔗[Paper]
[Pre4] 📝 Jingyi Chen, Songqiang Chen, Jialun Cao (Corresponding), Jiasi Shen (Corresponding), Shing-Chi Cheung. When LLMs Meet API Documentation: Can Retrieval Augmentation Aid Code Generation Just as It Helps Developers? In arXiv. 🔗[Paper]
[Pre5] 📝 Dekun Dai, MingWei Liu, Anji Li, Jialun Cao, Yanlin Wang, Chong Wang, Xin Peng, Zibin Zheng. FeedbackEval: A Benchmark for Evaluating Large Language Models in Feedback-Driven Code Repair Tasks. In arXiv. 🔗[Paper]
[Pre6] 📝 Xiaolei Li, Jialun Cao, Yepang Liu, Shing-Chi Cheung, Hailong Wang. ReuseDroid: A VLM-empowered Android UI Test Migrator Boosted by Active Feedback. In arXiv. 🔗[Paper]
[Pre7] 📝 Ruiyang Xu*, Jialun Cao (Co-1st), Mingyuan Wu, Wenliang Zhong, Yaojie Lu, Ben He, Xianpei Han, Shing-Chi Cheung, Le Sun. EmbedAgent: Benchmarking Large Language Models in Embedded System Development. In arXiv. 🔗[Paper].
[Pre8] 📝 Qiming Zhu, Jialun Cao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Shing-Chi Cheung. Across Programming Language Silos: A Study on Cross-Lingual Retrieval-augmented Code Generation. In arXiv. 🔗[Paper]

2024

[C13] 📄 Jialun Cao, Zhiyong Chen*, Jiarong Wu, Shing-chi Cheung, Chang Xu. JavaBench: A Benchmark of Object-Oriented Code Generation for Evaluating Large Language Models. In the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE 2024). 🔗[Paper] 🎯[Leaderboard] 💻 [Github]
🏆 [C14] 📄 Distinguished paper award. Zongze Jiang, Ming Wen, Jialun Cao, Xuanhua Shi and Hai Jin. Towards Understanding the Effectiveness of Large Language Models on Directed Test Input Generation. In the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE 2024). 🔗[Paper] 💻 [Github]
[C15] 📄 Congying Xu, Songqiang Chen, Jiarong Wu, Valerio Terragni, Shing-chi Cheung (Corresponding), Hengcheng Zhu, Jialun Cao (Corresponding). MR-Adopt: Automatic Deduction of Input Transformation Function for Metamorphic Testing. In the 39th IEEE/ACM International Conference on Automated Software Engineering (ASE 2024). 🔗[Paper]
[C16] 📄 Cheng Wen, Jialun Cao (Corresponding), Jie Su, Zhiwu Xu, Shengchao Qin (Corresponding), Mengda He, Haokun Li, Shing-Chi Cheung, Cong Tian. Enchanting Program Specification Synthesis by Large Language Models using Static Analysis and Program Verification. In 37th International Conference on Computer Aided Verification (CAV 2024) 🔗[Paper] 💻 [Homepage]
[C17] 📄 Bo Yang, Jiawei Hu, Jialun Cao (Corresponding) SDEFL: A Lightweight Fault Detection and Localization Method for Deep Neural Networks. In 31st Asia-Pacific Software Engineering Conference (APSEC 2024)
[C18] 📄 Kunpeng Jian, Yanyan Zou, Yeting Li, Jialun Cao, Menghao Li, Jian Sun, Jingyi Shi and Wei Huo. Fuzzing for Stateful Protocol Implementations: Are We There Yet? In The 18th Theoretical Aspects of Software Engineering Conference (TASE 2024)

2023

[C11] 📄 Jialun Cao, Yaojie Lu, Ming Wen, Shing-Chi Cheung. Testing Coreference Resolution Systems without Labeled Test Sets. In The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023). 🔗[Paper] 💻 [Github]
[C12] 📄 Xiaohu Du, Xiao Chen, Jialun Cao, Ming Wen, Shing-Chi Cheung, Hai Jin. Understanding the Bug Characteristics and Fix Strategies of Federated Learning Systems. In The ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE 2023). 🔗[Paper]
[J2] 📚 Meiziniu Li, Jialun Cao, Yongqiang Tian, Tsz On Li, Ming Wen, Shing-Chi Cheung. COMET: Coverage-guided Model Generation For Deep Learning Library Testing. In ACM Transactions on Software Engineering and Methodology (TOSEM) 🔗[Paper] 💻 [Github]

2022

[J1] 📚 Jialun Cao, Meiziniu Li, Yeting Li, Ming Wen, Shing-chi Cheung. SemMT: A Semantic-Based Testing Approach for Machine Translation Systems. In ACM Transactions on Software Engineering and Methodology (TOSEM). 🔗[Paper] 💻 [Github]
[C9] 📄 Jialun Cao, Meiziniu Li, Xiao Chen, Ming Wen, Yongqiang Tian, Bo Wu, Shing-chi Cheung. DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs. In Proceedings of the 44th International Conference on Software Engineering (ICSE 2022). 🔗[Paper] 💻 [Github]
[C10] 📄 Yeting Li, Yecheng Sun, Zhiwu Xu, Jialun Cao, Yuekang Li, Rongchen Li, Haiming Chen, Shing-Chi Cheung, Yang Liu, Yang Xiao. RegexScalpel: Regular Expression Denial of Service (ReDoS) Defense by Localize-and-Fix. In the 31st USENIX Security Symposium. 🔗[Paper]

2021

[C7] 📄 Yeting Li, Zixuan Chen, Jialun Cao, Zhiwu Xu, Qiancheng Peng, Haiming Chen, Liyuan Chen, Shing-Chi Cheung. ReDoSHunter: A Combined Static and Dynamic Approach for Regular Expression DoS Detection. In the 30th USENIX Security Symposium. 🔗[Paper]
[C8] 📄 Yeting Li, Shuaimin Li, Zhiwu Xu, Jialun Cao, Zixuan Chen, Yun Hu, Haiming Chen, Shing-Chi Cheung. TransRegex: Multi-modal Regular Expression Synthesis by Generate-and-Repair. In the 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE 2021). 🔗[Paper]

2020

[C5] 📄 Yeting Li, Jialun Cao, Haiming Chen, Tingjian Ge, Zhiwu Xu, Qiancheng Peng. FlashSchema: Achieving High Quality XML Schemas with Powerful Inference Algorithms and Large-scale Schema Data. In the IEEE 36th International Conference on Data Engineering (ICDE 2020). 🔗[Paper]
[C6] 📄 Yeting Li, Zhiwu Xu, Jialun Cao, Haiming Chen, Tingjian Ge, Shing-Chi Cheung. FlashRegex: Deducing Anti-ReDoS Regexes from Examples. In the Proceedings of the 35th IEEE/ACM International Conference on Automated Software Engineering (ASE 2020). 🔗[Paper]

2019

[C3] 📄 Yongjian Li, Jialun Cao (1st student author), Jun Pang. A Learning-Based Framework for Automatic Parameterized Verification. In 2019 IEEE 37th International Conference on Computer Design (ICCD 2019). 🔗[Paper]
[C4] 📄 Yeting Li, Xiaolan Zhang, Jialun Cao, Haiming Chen, Chong Gao. Learning k-Occurrence Regular Expressions with Interleaving. In the International Conference on Database Systems for Advanced Applications (DASFAA 2019) 🔗[Paper]

2018

[C1] 📄 Jialun Cao, Yongjian Li, Jun Pang. L-CMP: an automatic learning-based parameterized verification tool. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering (ASE 2018 Demo). 🔗[Paper] 💻 [Github] 🎬 [Video]
[C2] 📄 Yongjian Li, Jialun Cao (1st student author), Kaiqiang Duan. An automatic parameterized verification of FLASH cache coherence protocol. In 2018 IEEE International Conference on Software Quality, Reliability and Security (QRS 2018) 🔗[Paper]

💯 Teaching

2025 Spring, Instructor in COMP 1021 - Introduction to Computer Science. Course materials ↗️ [Link]
2023 Fall, Teaching Assistant in COMP 1021 - Introduction to Computer Science.
2020 Fall, Teaching Assistant in COMP 3021 - Java Programming.
2020 Spring, Teaching Assistant in COMP 3021 - Java Programming.

🎖 Honors and Awards

2025 ACM SIGSOFT Outstanding Doctoral Dissertation Award (Only 1~2 award receivers worldwide per year)
2024 Shortlisted Participant for the Rising Stars Women in Engineering Workshop at Asian Deans’ Forum
2024 Hong Kong Postgraduate Studentship
2024 ACM SIGSOFT CAPS Travel Grant (ASE 2024)
2023 ACM SIGSOFT CAPS Travel Grant (ESEC/FSE 2023)
2019 - 2023: Huawei Fellowship Scholarship
2017: China National Scholarship (Postgraduate, Rank 1/106, Top 1%)
2014: China National Scholarship (Undergraduate, Rank 1/52, Top 2%)

🎓 Educations

2019.09 - 2024.03, Ph.D, The Hong Kong University of Science and Technology.
2016.09 - 2019.06, M.S., State Key Laboratory of Computer Science, Institute of Software, Chinese Academy of Sciences.
2012.09 - 2016.06, B.S., Shandong University.

👩🏻‍💻 Working Experience

2024.08 - now, Research Assistant Professor at the Department of Computer Science and Engineering at The Hong Kong University of Science and Technology
2024.04 - 2024.07, Postdoctoral Fellow at HKUST, working with Prof. Shing-Chi Cheung
2023.07 - 2023.09, Intern at huawei (Hong Kong) - Fermat Lab.
2022.09 - 2023.01, Intern at Huawei - Trusted Software Engineering and Open Source Laboratory.
2021.09 - 2022.08, Intern at Huawei - Theory Lab.

🪴 Service

Award committee member:

ACM SIGSOFT Outstanding Research Award 2025
ACM SIGSOFT Distinguished Service Award 2025
ACM SIGSOFT Influential Educator Award 2025

Program committee member

ICSE 2026 research track
ICSE 2025 research track
ASE 2025 research track
FSE 2025 research track
ISSRE 2024 research track
Workshop on Responsible AI Engineering 2025 (RAIE)
APSEC 2025 Technical Track
Internetware 2025 Research Track
CAIN 2025 Research and Experience Papers-track
Forge 2025 Research track and Data and Benchmarking-track
SANER 2024 Short Papers and Posters Track track
APSEC 2024 Technical Track
Internetware 2024 Research track
AIware 2024 Main track
Forge 2024 Research track

Session chair

ASE 2024 - Session Chair of Code generation 3
ASE 2024 - Session Chair of Testing 1
Internetware 2024 - Session Chair of Session 6: Code Generation and Transformation

Reviewer

ACM Transactions on Software Engineering and Methodology (TOSEM)
IEEE Transactions on Software Engineering (TSE)
Empirical Software Engineering (EMSE)
Journal of Automated Software Engineering (JASE)
Journal of Software: Evolution and Process (JSME)
ACM Transactions on Knowledge Discovery from Data (TKDD)
IEEE Transactions on Mobile Computing (TMC)

💬 Invited Talks

2025.04, Exploring Code Generation and Reasoning Capabilities of LLMs. In the Applied Mathematics Seminar at Peking University. 🔗 [Link].
2025.01, From Benchmarks to Practice: A Preliminary Study on the Code Capabilities of Large Language Models. In the Next Era of AI-assisted R&D Seminar (2025华为AI辅助研发Next研讨会) by Huawei, Hong Kong. 📽️[Recording]
2025.01, From Requirement to Formal Specification andModel Generation via Large Language Models. In Zhejiang University (Online).
2024.11, Is Large Language Model a Rescue for Code Generation & Code Reasoning? In Trusted Large Language Model Evaluation and Open-Source Technology Forum by CCF China Open Source Conference.
2024.10, Exploring Automatic Testing and Verification for Software Programs using Large Language Models. In High Trust Software Engineering Technology Laboratory, Guangzhou Research Institute, Xidian University.
2024.08, Trusted Architecture of Intelligent CPS Systems. Micro-Forum of Intelligent Software Development hosted by Fudan University. 🔗 [Link]
2024.08, Can AI be a Panacea for Software Reliability? Exploring Automatic Testing and Verification for Software Programs. In the Software Systems Engineering Group at University College London.
2024.07, Concerned with Data Contamination? Assessing Countermeasures in Code Language Model. In the IEEE World Congress on Service - Cloud & AI Symposium
2024.05, From Requirement to Formal Specification and Model Generation via Large Language Models In the Formal Methods Committee Strategic Seminar (2024 CCF形式化方法专委会战略研讨会——形式化方法与人工智能的交叉融合：机遇与挑战) hosted in China Computer Federation (CCF). 📽️[Recording] 🪧[News]
2023.12, Crafting Future: A Dancer’s Leap into Computer Science. The 1st Forum for Women Scholars in Software Engineering hosted by ChinaSoft.
2023.12, A Study on the Problem of Data Contamination in the Era of Large Language Models. In the Forum on the new paradigm of software engineering under AIGC hosted by Chinasoft.

Jialun Cao