News·Unclaimed·

ACE-Ego-0: Unifying Egocentric Human and Robotic Data for VLA Pretraining

arXiv:2606.17200v1 Announce Type: new Abstract: Vision-Language-Action (VLA) models benefit from large-scale and diverse embodied data, yet scaling robot trajectory collection is costly and labor-intensive. Recent advances show that large-scale egocentric human videos provide complementary real-wor

via RSS