
Assistant Professor
AI Thrust, HKUST (GZ)
[email protected]
https://scholar.google.com/citations?hl=en&user=qb9STggAAAAJ
https://www.xiaohongshu.com/user/profile/63e87c990000000027028efa
:linkedin: Linkedin
Dr. Yao SHU (舒瑶) is now a tenure-track assistant professor of Artificial Intelligence Thrust at The Hong Kong University of Science and Technology (Guangzhou), starting from April 8, 2025. He is also affiliated with The Hong Kong University of Science and Technology (HKUST). Dr. SHU was previously a researcher at the Guangdong Lab of AI and Digital Economy (SZ) under the Genius Nova Programme. Before that, he was a senior researcher (Tencent Technical Fellow) at Tencent in 2023 and a Research Fellow (PostDoc.) at the National University of Singapore (NUS) in 2022. He received his PhD from the NUS School of Computing in 2022 under the esteemed guidance of Prof. Bryan Low (Associate Vice President (AI) of NUS and Deputy Director of NUS AI Institute), where he was awarded the IMDA Excellence in Computing Prize for the best PhD thesis in the school, as well as the Dean's Graduate Research Excellence Award for significant research achievements during his doctoral studies. He earned his B.Eng. from Huazhong University of Science and Technology (HUST) in 2017, advised by Prof. Kun He (何琨). Dr. SHU's research focuses on developing advanced learning and optimization algorithms that are both theoretically sound and practically useful. He is publishing actively in top AI conferences, including ICML, NeurIPS, ICLR, AAAI, ACL, EMNLP, and UAI. He also serves as a reviewer for leading AI conferences such as ICML, NeurIPS, ICLR, AAAI, IJCAI, UAI, AISTATS, AAMAS, etc., and was recognized as a NeurIPS & UAI 2025 Top Reviewer and ICLR 2025 Notable Reviewer. Furthermore, he is invited to serve as an Area Chair for ICLR.
Experience
Publication
Blogs
XPLOR Lab
Research Vision
Theory-First Learning & Optimization for Artificial Intelligence
We aim to develop foundational principles and practically useful algorithms that reshape how AI systems are understood and optimized. By building unifying theoretical frameworks and translating them into more efficient, robust, and adaptive algorithms, we seek to advance the scalability and reliability of large-scale AI systems, especially large language models and intelligent agents.
Research Pillars
1. Unified and Improved Black-Box Optimization and Decision-Making
We establish a coherent oracle framework covering ZOO, BO, online optimization, bandits, and RL, and leverage it to design improved optimization algorithms that outperform classical techniques in efficiency and reliability.
2. Learning Theory and Algorithmic Principles
We advance general learning theory to explain generalization, stability, robustness, and adaptation in complex environments. Guided by these insights, we develop algorithms that are provably sound and practically beneficial for large-scale learning under non-ideal conditions.
3. Algorithmic Impact via LLMs and Agents
We validate and extend theoretical advances by applying them to LLMs and AI agents. This creates a feedback loop where theoretical principles inspire new algorithms, and real-world applications reveal new theoretical questions—ultimately improving efficiency, effectiveness, personalization, and safety in deployed AI.
Research Topics
Research Interests
Recent News
- 2025.11: We have 1 paper OR-MIA accepted to AAAI’26!
- 2025.11: Our paper PAFT wins the SAC Highlights Award at EMNLP 2025!
- 2025.10: I am honored to be listed as a TOP reviewer for NeurIPS’25!
- 2025.09: We have 2 conference paper and 2 workshop paper accepted to NeurIPS’25!
- 2025.09: Our XPLOR Lab proudly secured the Second Prize in the Medical Large Model Fine-tuning and Privacy Protection track at the Second SecretFlow Cup Data Challenge (Reference)!
- 2025.09: I am honored to be awarded the 2025 CCF-Huawei Populus Euphratica Forest Fund (Special Fund for Theoretical Computer Science and Computational Economics).
- 2025.08: I am honored and excited to be invited as an area chair (AC) for ICLR’26! I look forward to contributing my best to the community.
- 2025.08: We have 1 paper (PAFT) accepted to EMNLP’25 Main as ORAL presentation (released in 2025.09)!
- 2025.07: I am honored to be listed as a TOP reviewer with free registration for UAI’25!
- 2025.06: We have 3 paper (ZoAR, ReDit, OR-MIA) accepted to the workshops @ ICML’25!
- 2025.05: We have 4 paper (R-AdaZO, Ferret, Multinoulli-SCG, WMarkGPT) accepted to ICML’25, and 1 paper (Flexora) accepted to ACL’25! I am honored to be listed as a notable reviewer for ICLR’25!
- 2025.04: I am joining AI Thrust at HKUST (GZ) officially!
- 2025.03: Our EXPO paper is accepted to Reasoning and Planning for LLMs @ ICLR’25!
- 2024.10: Our Ferret paper is accepted to FL@FM-NeurIPS’24 as an oral paper and our Flexora is accepted to FITML-NeurIPS’24!
- 2024.09: Our two papers on “Prompt Optimization” (ZOPO as spotlight!) and one paper on “Parallelized First-Order Optimization” are accepted to NeurIPS 2024!
- 2024.09: Our position paper on “Data-Centric AI in LLMs” is accepted to EMNLP 2024 Findings!
- 2024.09: Our Ferret paper is now available! Come and see how it significantly enhances the federated full-parameter tuning of Large Language Models!
- 2024.06: Our one paper on “Heterogeneous Federated Zeroth-Order Optimization” is accepted to Differentiable Almost Everything workshop @ ICML 2024 and two paper on “Prompt Optimization” are accepted to In-Context Learning workshop @ ICML 2024!
- 2024.05: Our paper on “Prompt Optimization” is accepted to ICML 2024!
- 2024.01: Our paper on “Training-free NAS” is accepted to ICLR 2024!
Presentations
Presentations
Honors & Awards
- Top Reviewer, NeurIPS & UAI, 2025
- Notable Reviewer, ICLR, 2025 (reference)
- Valedictorian for the Class of Ph.D. Graduates, School of Computing, NUS, 2023 (script)
- IMDA Excellence in Computing Prize (Best Ph.D. Thesis), School of Computing, NUS, 2023 (reference)
- Dean’s Graduate Research Excellence Award, School of Computing, NUS, 2022 (reference)
- 2nd prize of 5th AutoML Challenge (AutoML for Temporal Relational Data) in the KDD Cup 2019 provided by 4Paradigm, ChaLearn and Microsoft, June 2019 (our solution, reference)
- Honor of Outstanding Student, HUST, 2015
- 2nd prize for The Chinese Mathematics Competitions (Non-professional), 2013
Funding Support
- Principal Investigator, CCF-Huawei Populus Euphratica Forest Fund (Special Fund for Theoretical Computer Science and Computational Economics), 2025
Academic Services
Reviewer
Area Chair