Yuxin Yao

Welcome to my website. I am Yuxin Yao, a 3rd Year PhD student in Information Engineering at University of Cambridge, supervised by Prof. Joan Lasenby. I am currently work with Elliott(Shangzhe) Wu. I obtained my Master of Engineering degree integrated with my bachelor at University College London.

My current research focus on world modelling and dynamical 3D reconstruction for simulation, specifically involving prior from large foudation models into embodied AI agents.

I am currently seeking for collaborations on project in these fields. Please contact me if you are interested!

Project Experience

Particulate: Feed-Forward 3D Object Articulation

Ruining Li*, Yuxin Yao*, Chuanxia Zheng, Christian Rupprecht, Joan Lasenby, Shangzhe Wu, Andrea Vedaldi

CVPR 2026 In Proceeding

Project Page / Paper / GitHub / HuggingFace

TLDR: Particulate is a feed-forward approach that, given a single static 3D mesh of an everyday object, directly infers all attributes of the underlying articulated structure, including its 3D parts, kinematic structure, and motion constraints.

Gaussian Splatting based Camera Pose Estimation

Yuxin Yao, Yan Zhang, Zhening Huang, Joan Lasenby

CVPR workshop 2025

Project Page / Paper / GitHub

TLDR: SmallGS processes a dynamic video with small baseline to obtain the camera poses. It leverages Gaussian splatting to optimize camera poses while mitigating dynamic object interference via predicted semantic masks. It does not rely on 3D alignments or triangulation, alleviating the instability in camera pose estimation caused by limited parallax and weak geometric constraints.

AutoKeyframe: Autoregressive Keyframe Generation for Human Motion Synthesis and Editing

Bowen Zheng, Ke Chen, Yuxin Yao, Zijiao Zeng, Xinwei Jiang, He Wang, Joan Lasenby, Xiaogang Jin

ACM SIGGRAPH 2025

Paper / GitHub

TLDR: AutoKeyFrame presents a novel framework designed to automate human motion generation while retaining precise artistic control. Unlike methods that generate full sequences at once, this approach generates keyframes (critical poses) sequentially based on a user-defined path (root trajectory) and action labels, which are then connected using motion in-betweening methods.

Simplifying and Generalising Equivariant Geometric Algebra Networks

Yuxin Yao, Christian Hockey, Joan Lasenby

9th Conference on Applied Geometric Algebras in Computer Science and Engineering (Amsterdam, NL).

Paper

TLDR: Developed CGATr, a simplified and generalised equivariant Geometric Algebra Transformer with a generalised signature. Applied it to protein structure prediction, N-body dynamics, and camera pose estimation, demonstrating strong potential for geometric deep learning.

Unsupervised Visual Relocalization

Final year project · Supervised by Prof. Simon Julier at UCL

Paper / GitHub

TLDR: Implemented unsupervised metric relocalization using transform consistency loss. Used direct image alignment and Gauss-Newton optimization on feature maps, a U-Net for feature and saliency maps, and generated training data with CARLA.

Human Motion Prediction on Egocentric Dataset

Supervised by Prof. Siyu Tang at ETH Zurich Computer Vision and Learning Group

GitHub

TLDR: Trained a motion prior on the egocentric EgoBody dataset to predict 8–9 future frames from 1–2 initial frames. Used SMPL-X/SMPL and the GAMMA model (conditional VAE with DLow and GRU) on AMASS and EgoBody.

For more previous projects and detailed description, please check my CV

Publication

Hockey, C., Yao, Y., Lasenby, J. Simplifying and Generalising Equivariant Geometric Algebra Networks. The 9th conference on Applied Geometric Algebras in Computer Science and Engineering (Abstract accepted )
Chen, H., Li, Z., Yao, Y. (2022, November). Multi-agent reinforcement learning for fleet management: a survey. In 2nd International Conference on Artificial Intelligence, Automation, and High-Performance Computing (AIAHPC 2022) (Vol. 12348, pp. 611-624). SPIE.
Yan, Y., Schaffter, T., Bergquist, T., …Yao, Y..,… DREAM Challenge Consortium. (2021). A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization. JAMA network open, 4(10), e2124946-e2124946.