Science Cast

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

Yinhuai WangFebruary 3, 2026 10:06am

Views (207)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

arXivPDFFebruary 2, 2026 12:00am

Authors

Yinhuai Wang, Qihan Zhao, Yuen Fui Lau, Runyi Yu, Hok Wai Tsui, Qifeng Chen, Jingbo Wang, Jiangmiao Pang, Ping Tan

Abstract

Enabling humanoid robots to perform agile and adaptive interactive tasks has long been a core challenge in robotics. Current approaches are bottlenecked by either the scarcity of realistic interaction data or the need for meticulous, task-specific reward engineering, which limits their scalability. To narrow this gap, we present HumanX, a full-stack framework that compiles human video into generalizable, real-world interaction skills for humanoids, without task-specific rewards. HumanX integrates two co-designed components: XGen, a data generation pipeline that synthesizes diverse and physically plausible robot interaction data from video while supporting scalable data augmentation; and XMimic, a unified imitation learning framework that learns generalizable interaction skills. Evaluated across five distinct domains--basketball, football, badminton, cargo pickup, and reactive fighting--HumanX successfully acquires 10 different skills and transfers them zero-shot to a physical Unitree G1 humanoid. The learned capabilities include complex maneuvers such as pump-fake turnaround fadeaway jumpshots without any external perception, as well as interactive tasks like sustained human-robot passing sequences over 10 consecutive cycles--learned from a single video demonstration. Our experiments show that HumanX achieves over 8 times higher generalization success than prior methods, demonstrating a scalable and task-agnostic pathway for learning versatile, real-world robot interactive skills.

TwitterandLinkedIn

0 comments

Add comment

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments