IKEA Furniture Assembly Environment for Long-Horizon Complex Manipulation Tasks (1911.07246v1)

Published 17 Nov 2019 in cs.RO, cs.AI, cs.CV, and cs.LG

Abstract: The IKEA Furniture Assembly Environment is one of the first benchmarks for testing and accelerating the automation of complex manipulation tasks. The environment is designed to advance reinforcement learning from simple toy tasks to complex tasks requiring both long-term planning and sophisticated low-level control. Our environment supports over 80 different furniture models, Sawyer and Baxter robot simulation, and domain randomization. The IKEA Furniture Assembly Environment is a testbed for methods aiming to solve complex manipulation tasks. The environment is publicly available at https://clvrai.com/furniture

Citations (106)

View on Semantic Scholar

Summary

The paper presents a novel simulation framework that benchmarks reinforcement learning in intricate furniture assembly tasks.
It integrates MuJoCo physics and Unity3D rendering to support diverse furniture models and widely-used robotic platforms like Sawyer and Baxter.
The paper demonstrates practical advances in perception, control, and planning, offering a path to improve sim-to-real transfer in robotics.

Overview of the IKEA Furniture Assembly Environment

The paper presents the IKEA Furniture Assembly Environment, a comprehensive benchmark designed to advance reinforcement learning (RL) in complex manipulation tasks. By simulating the challenges associated with assembling IKEA furniture, this environment introduces a suite of sophisticated tasks requiring advanced perception, planning, and control.

Key Features

The environment is notable for its simulation of over 80 diverse furniture models and the inclusion of widely used robotic platforms such as Sawyer and Baxter. Its architecture supports domain randomization, essential for sim-to-real transfer, by allowing variations in physics, lighting, and textures.

Research Applications

The environment facilitates exploration in several research areas:

Perception: Researchers can tackle problems such as 3D object detection, pose estimation, and scene graph generation. The environment provides synthetic data, bolstering projects in computer vision.
Control: The complex manipulation required for furniture assembly provides a rigorous test for multi-agent RL, imitation learning, and model-based RL approaches. The environment's support for domain randomization helps in training robust policies.
Planning: The tasks simulate long-horizon challenges, where hierarchical RL methodologies can be tested to optimize task segmentation and completion strategies.

Environment Implementation

Developed using MuJoCo for physics simulation and Unity3D for rendering, the environment combines accurate physics with visual realism. This dual-framework approach enables simultaneous control of robotic arms and realistic graphics, crucial for training transferable RL models.

Experimental Setup

The assembly process is modularly designed: selecting parts, grasping, aligning connectors, and attaching. This framework is iterated until the furniture is complete. The simulation adheres to the OpenAI Gym protocol, supporting diverse observations and configurable reward functions.

Future Prospects

The paper highlights avenues for future research, such as multi-agent collaboration, realistic part attachment, and integrating domain knowledge through instruction manuals or demonstrations. Advanced tasks like tool use and real-world robot support are potential expansions.

Limitations and Considerations

Current limitations include non-physical simulation of attachment processes and constraints preventing interchangeable use of identical parts. Addressing these could enhance realism and applicability in real-world scenarios.

Conclusions and Implications

The IKEA Furniture Assembly Environment positions itself as a critical resource for developing and benchmarking RL algorithms in complex domains. It lays the foundation for significant advancements in robotic perception, planning, and control, potentially impacting areas like autonomous manufacturing and adaptive robotic systems. Future research could focus on overcoming the outlined limitations and exploring high-level planning and control strategies.

PDF Markdown