Working Experience and Education: Now I study at the School of Data Science, Chinese University of Hong Kong, Shenzhen (CUHK-SZ) , as a computer science Ph.D. student under the supervision of Prof Guiliang Liu. I have received my Bachelorβs degree from the School of Computer Science and Technology, Beijing University of Posts and Telecommunications (BUPT).
Research Interests: Reinforcement Learning and the related topics, including:
β Real-World Reinforcement Learning
β Vision-Language-Action model in robotic manipulation
β AI Scientist Agent System
My work has been published in top-tier international AI conferences such as ICLR, ICML, and NeurIPS. Please feel free to communicate and cooperate via email or WeChat.
π Educations
|
Chinese University of Hong Kong, Shenzhen, China Ph.D. in Computer Science Aug. 2023 β Present Supervisor: Prof Guiliang Liu |
|
Beijing University of Posts and Telecommunications, China Bachelor in Computer Science and Technology Aug. 2019 β Jun. 2023 |
π₯ News
β 2026.06: π Two papers about Sim2Real locomotion and robotic skill reuse accepted by IROS 2026.
β 2026.05: π One paper about Human-in-the-loop Real-World Reinforcement Learning accepted by ICML 2026.
β 2026.04: π One paper about Agent Error Recovery for Robotic Manipulation accepted by RSS 2026.
β 2026.01: π One paper about Sim-to-Real human-robot interaction accepted by ICRA 2026.
β 2026.01: π One paper about Sim-to-Real reinforcement learning humanoid locomotion accepted by ICLR 2026.
β 2025.12: π Top 1.3% (6/463) in the Tencent AI Arena Global Open Competition (Reinforcement Learning Embodied-AI Track).
β 2025.12: π As a core contributor to the agent project aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists covered by Science News.
β 2024.07: π One paper about multi-vehicle interaction with game theory accepted by ECCV 2024.
β 2023.09: π One paper about inverse constrained reinforcement learning accepted by NeurIPS 2023.
π Selected Publications
$*$ denotes equal contribiton, $\dagger$ denotes corresponding author






π Honor
- 2025.12: π Top 1.3% (6/463) in the Tencent AI Arena Global Open Competition (Reinforcement Learning Embodied-AI Track) (CNY 15000)
- 2024.11 π Duan Yong Ping Meritorious Travel Award (CNY 10000)
- 2024.11 π Duan Yong Ping Meritorious Research Award (CNY 5000)
- 2022.12 π The Chinese University of Hong Kong Shenzhen Scholarship (CNY 6000)
- 2021.10 π Gold Prize, China Internet Plus Innovation and Entrepreneurship National Competition (CNY 20000)
- 2020-2022 π Beijing University of Posts and Telecommunications Merit Student (CNY 500)
- 2020-2022 π Beijing University of Posts and Telecommunications Scholarship (CNY 3000)
π» Intern
|
DexForce Technology, China Embodied-AI Research Intern Aug. 2025 β Present |
|
Wangxuan Institute of Computer Technology, Peking University, China Research Assistant Mar. 2022 β Nov. 2022 Supervisor: Prof Xiaoqing Lyu |
ποΈ Service
- Conference Reviewer:
- AAAI (2024), ICLR (2025), ICCV (2025), NeurIPS (2025, 2026)
- Journal Reviewer:
- IEEE Transactions on Artificial Intelligence (T-AI)
- Robotics and Computer-Integrated Manufacturing