Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

EgoZero: Robot Learning from Smart Glasses (2505.20290v2)

Published 26 May 2025 in cs.RO and cs.AI

Abstract: Despite recent progress in general purpose robotics, robot policies still lag far behind basic human capabilities in the real world. Humans interact constantly with the physical world, yet this rich data resource remains largely untapped in robot learning. We propose EgoZero, a minimal system that learns robust manipulation policies from human demonstrations captured with Project Aria smart glasses, $\textbf{and zero robot data}$. EgoZero enables: (1) extraction of complete, robot-executable actions from in-the-wild, egocentric, human demonstrations, (2) compression of human visual observations into morphology-agnostic state representations, and (3) closed-loop policy learning that generalizes morphologically, spatially, and semantically. We deploy EgoZero policies on a gripper Franka Panda robot and demonstrate zero-shot transfer with 70% success rate over 7 manipulation tasks and only 20 minutes of data collection per task. Our results suggest that in-the-wild human data can serve as a scalable foundation for real-world robot learning - paving the way toward a future of abundant, diverse, and naturalistic training data for robots. Code and videos are available at https://egozero-robot.github.io.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Vincent Liu (33 papers)
  2. Ademi Adeniji (6 papers)
  3. Haotian Zhan (1 paper)
  4. Raunaq Bhirangi (10 papers)
  5. Pieter Abbeel (372 papers)
  6. Lerrel Pinto (81 papers)
  7. Siddhant Haldar (15 papers)

Summary

Analysis of Paper (Liu et al., 26 May 2025 )v1

The paper (Liu et al., 26 May 2025 )v1, though currently inaccessible due to a technical issue in the automated PDF conversion system, has been recognized as a substantial contribution by receiving support from the Simons Foundation and affiliated institutions. The inability to obtain the PDF format necessitates reliance on alternative sources or direct author contact for full comprehension of the paper's content. Nonetheless, the significance of this paper can be gauged from its association with reputable entities.

Overview and Contributions

While the complete abstract or full text of the paper is unavailable, identifying a paper number underlines that a structured investigation or theoretical proposition was presented, aimed at contributing to an existing body of knowledge. Such support often signifies that the paper addresses complex mathematical or scientific inquiries with potential implications for ongoing research in its respective field.

Implications for Future Research

incapacity to access the document suggests the impetus for further development in digital archive accessibility. Researchers might explore alternative methods for disseminating work to circumvent technical issues, ensuring rigorous peer engagement regardless of technological setbacks. Moreover, understanding the challenges encountered by the automated PDF conversion system may inspire developments in archiving technologies, optimizing them for better reliability across platforms.

Conclusion

In light of current constraints, exact insights into paper (Liu et al., 26 May 2025 )v1's contributions remain speculative until access is restored. However, its supported status allows for conjecture regarding its influence on theoretical advancements and computational methodologies. The paper's existence in the archive, awaiting technical resolution, reminds the researcher community of the persistent necessity for meticulous management and dissemination of scholarly work in digital repositories.

Github Logo Streamline Icon: https://streamlinehq.com

GitHub

Youtube Logo Streamline Icon: https://streamlinehq.com