Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
162 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Amplifying robotics capacities with a human touch: An immersive low-latency panoramic remote system (2401.03398v2)

Published 7 Jan 2024 in cs.CY and cs.RO

Abstract: AI and robotics technologies have witnessed remarkable advancements in the past decade, revolutionizing work patterns and opportunities in various domains. The application of these technologies has propelled society towards an era of symbiosis between humans and machines. To facilitate efficient communication between humans and intelligent robots, we propose the "Avatar" system, an immersive low-latency panoramic human-robot interaction platform. We have designed and tested a prototype of a rugged mobile platform integrated with edge computing units, panoramic video capture devices, power batteries, robot arms, and network communication equipment. Under favorable network conditions, we achieved a low-latency high-definition panoramic visual experience with a delay of 357ms. Operators can utilize VR headsets and controllers for real-time immersive control of robots and devices. The system enables remote control over vast physical distances, spanning campuses, provinces, countries, and even continents (New York to Shenzhen). Additionally, the system incorporates visual SLAM technology for map and trajectory recording, providing autonomous navigation capabilities. We believe that this intuitive system platform can enhance efficiency and situational experience in human-robot collaboration, and with further advancements in related technologies, it will become a versatile tool for efficient and symbiotic cooperation between AI and humans.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (9)
  1. Real-time multi-gpu-based 8kvr stitching and streaming on 5g mec/cloud environments. ETRI Journal, 44(1):62–72, 2022.
  2. Towards low-latency and high-quality adaptive 360-degree streaming. IEEE Transactions on Industrial Informatics, 2022.
  3. Towards low latency multi-viewpoint 360 interactive video: A multimodal deep reinforcement learning approach. In IEEE INFOCOM 2019-IEEE Conference on Computer Communications, pages 991–999. IEEE, 2019.
  4. Research on panoramic stereo live streaming based on the virtual reality. In 2021 IEEE International Symposium on Circuits and Systems (ISCAS), pages 1–5. IEEE, 2021.
  5. Low-latency implementation of 360 panoramic video viewing system. In 2017 International Symposium on Intelligent Signal Processing and Communication Systems (ISPACS), pages 576–579. IEEE, 2017.
  6. 360-degree video streaming: A survey of the state of the art. Symmetry, 12(9):1491, 2020.
  7. Dissecting latency in 360 video camera sensing systems. Sensors, 22(16):6001, 2022.
  8. A survey on adaptive 360 video streaming: Solutions, challenges and opportunities. IEEE Communications Surveys & Tutorials, 22(4):2801–2838, 2020.
  9. Orb-slam3: An accurate open-source library for visual, visual–inertial, and multimap slam. IEEE Transactions on Robotics, 37(6):1874–1890, 2021.

Summary

We haven't generated a summary for this paper yet.