I'm Mutian Tong, an incoming CS PhD student at CIS, University of Pennsylvania. I'm currently a research assistant under the supervision of Professor Jiatao Gu. Previously, I finished my undergraduate study at Columbia College, Columbia University in New York with a degree in Mathematics and Computer Science. I was fortunate enough to be advised by Professor Changxi Zheng at Columbia Computer Graphics Group, where I also learnt a lot from Dr. Rundi Wu. My research interestes include 3D/4D world modeling, video Generation, and vision for embodied tasks. You are welcome to contact me at email
Research
PointAction: 3D Points as Universal Action Representations for Robot Control
Mutian Tong*,
Han Jiang*,
Qiao Feng,
Lingjie Liu,
Jiatao Gu In Submission, 2026
PointAction bridges the gap between generative video models and robust robot control by jointly predicting RGB frames and dynamic 3D pointmaps, which serve as a universal, geometry-aware action interface.