PAFT: Prompt-Agnostic Fine-Tuning. Chenxing Wei*, Yao Shu*, Mingwen Ou, Ying Tiffany He, Fei Richard Yu Key Words: Prompt-Agnostic, Fine-Tuning, Robustness [arXiv, HuggingFace, code]
Refining Adaptive Zeroth-Order Optimization at Ease. Yao Shu, Qixin Zhang, Kun He, Zhongxiang Dai Key Words: Zeroth-Order Optimization, Adaptive Method, Variance Reduction [arXiv]
Meta-Prompt Optimization for LLM-Based Sequential Decision Making. Mingze Kong, Zhiyong Wang, Yao Shu, Zhongxiang Dai Key Words: Large Language Models, Decision-Making Workshop on Reasoning and Planning for Large Language Models @ ICLR, 2025 [arXiv]
FSL-Rectifier: Rectify Outliers in Few-Shot Learning via Test-Time Augmentation. Yunwei Bai, Ying Kiat Tan, Shiming Chen, Yao Shu, Tsuhan Chen Key Words: Test-Time Augmentation, Few-Shot Learning In The 39th Annual AAAI Conference on Artificial Intelligence (AAAI), 2025 [arXiv, pdf]
Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models. Yao Shu*, Wenyang Hu*, See-Kiong Ng, Bryan Kian Hsiang Low, Fei Richard Yu Key Words: Large Language Models, Federated Full-Parameter Tuning Workshop on Federated Foundation Models @ NeurIPS (Oral), 2024 [HuggingFace, arXiv, code]
Flexora: Flexible Low Rank Adaptation for Large Language Models. Chenxing Wei*, Yao Shu*, Ying Tiffany He, Fei Richard Yu Key Words: Layer Selection, Parameter-Efficient Fine-Tuning, Large Language Models Workshop on Fine-Tuning in Modern Machine Learning @ NeurIPS, 2024 [arXiv, code]
Localized Zeroth-Order Prompt Optimization. Wenyang Hu*, Yao Shu*, Zongmin Yu, Zhaoxuan Wu, Xiangqiang Lin, Zhongxiang Dai, See-Kiong Ng, Bryan Kian Hsiang Low Key Words: Prompt Optimization, Zeroth-Order Optimization, Neural Tangent Kernel, Large Language Models Workshop on In-Context Learning @ ICML, 2024 In The 38th Conference on Neural Information Processing Systems (NeurIPS Spotlight), 2024 [arXiv, pdf, code]
Prompt Optimization with EASE? Efficient Ordering-aware Automated Selection of Exemplars. Zhaoxuan Wu, Xiaoqiang Lin, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Bryan Kian Hsiang Low Key Words: Prompt Optimization, Neural Bandit, Large Language Models Workshop on In-Context Learning @ ICML, 2024 In The 38th Conference on Neural Information Processing Systems (NeurIPS), 2024 [arXiv, pdf, code]
OptEx: Expediting First-Order Optimization with Approximately Parallelized Iterations. Yao Shu#, Jiongfeng Fang, Ying Tiffany He, Fei Richard Yu Key Words: First-Order Optimization, Parallelization In The 38th Conference on Neural Information Processing Systems (NeurIPS), 2024 [arXiv, pdf, code]
Data-Centric AI in the Age of Large Language Models. Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, Gregory Kang Ruey Lau, Hieu Dao, Lucas Agussurja, Rachael Hwee Ling Sim, Xiaoqiang Lin, Wenyang Hu, Zhongxiang Dai, Pang Wei Koh, Bryan Kian Hsiang Low Key Words: Data-Centric AI, Large Language Models In The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP Findings), 2024 [arXiv]
Heterogeneous Federated Zeroth-Order Optimization using Gradient Surrogates. Yao Shu, Xiaoqiang Lin, Zhongxiang Dai, Bryan Kian Hsiang Low Key Words: Zeroth-Order Optimization, Federated Optimization, Heterogeneity Workshop on Differentiable Almost Everything @ ICML, 2024 [arXiv, pdf, code]
Use Your INSTINCT: INSTruction optimization usIng Neural bandits Coupled with Transformers. Xiaoqiang Lin*, Zhaoxuan Wu*, Zhongxiang Dai, Wenyang Hu, Yao Shu, See-Kiong Ng, Patrick Jaillet, Kian Hsiang Low Key Words: Prompt Optimization, Neural Bandit, Large Language Models Workshop on Instruction @ NeurIPS, 2023 In The 41st International Conference on Machine Learning (ICML), 2024 [arXiv, pdf, code]
Data valuation in federated learning. Zhaoxuan Wu, Xinyi Xu, Rachael Hwee Ling Sim, Yao Shu, Xiaoqiang Lin, Lucas Agussurja, Zhongxiang Dai, See-Kiong Ng, Chuan-Sheng Foo, Patrick Jaillet, Trong Nghia Hoang, Kian Hsiang Low Key Words: Data Valuation, Federated Learning Chapter 15 of Federated Learning: Theory and Practice, pages 281-296, Academic Press, 2024
Robustifying and Boosting Training-Free Neural Architecture Search. Zhenfeng He, Yao Shu#, Zhongxiang Dai, Bryan Kian Hsiang Low Key Words: Training-Free Neural Architecture Search In The 12th International Conference on Learning Representations (ICLR), 2024 [arXiv, pdf, code]
Exploiting Correlated Auxiliary Feedback in Parameterized Bandits. Arun Verma, Zhongxiang Dai, Yao Shu, ****Kian Hsiang Low Key Words: Parameterized Bandit In The 37th Conference on Neural Information Processing Systems (NeurIPS), 2023 [arXiv, pdf]