Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 82 tok/s

Gemini 2.5 Pro 45 tok/s Pro

GPT-5 Medium 25 tok/s Pro

GPT-5 High 36 tok/s Pro

GPT-4o 110 tok/s Pro

Kimi K2 207 tok/s Pro

GPT OSS 120B 469 tok/s Pro

Claude Sonnet 4.5 36 tok/s Pro

2000 character limit reached

BiAssemble: Learning Collaborative Affordance for Bimanual Geometric Assembly (2506.06221v2)

Published 6 Jun 2025 in cs.RO and cs.LG

Abstract: Shape assembly, the process of combining parts into a complete whole, is a crucial robotic skill with broad real-world applications. Among various assembly tasks, geometric assembly--where broken parts are reassembled into their original form (e.g., reconstructing a shattered bowl)--is particularly challenging. This requires the robot to recognize geometric cues for grasping, assembly, and subsequent bimanual collaborative manipulation on varied fragments. In this paper, we exploit the geometric generalization of point-level affordance, learning affordance aware of bimanual collaboration in geometric assembly with long-horizon action sequences. To address the evaluation ambiguity caused by geometry diversity of broken parts, we introduce a real-world benchmark featuring geometric variety and global reproducibility. Extensive experiments demonstrate the superiority of our approach over both previous affordance-based and imitation-based methods. Project page: https://sites.google.com/view/biassembly/.

Summary

The paper introduces a novel framework for bimanual geometric assembly using a three-phase process that integrates pick-up, alignment, and assembly.
It employs point-level affordance modeling with SO(3)-equivariant representations to predict long-horizon actions, surpassing prior methods.
Experimental benchmarks demonstrate its superior generalization and reproducibility across diverse geometric configurations in robotic tasks.

BiAssemble: Learning Collaborative Affordance for Bimanual Geometric Assembly

This paper introduces BiAssemble, a novel framework aimed at addressing the intricate task of bimanual geometric assembly through the learning of collaborative affordance. Geometric assembly, which involves reconstructing objects from fractured parts, poses significant challenges in robotics due to the complexity of geometric cues required for successful manipulation. BiAssemble leverages point-level affordance modeling, enhancing it with awareness of long-horizon action sequences to achieve effective bimanual collaboration.

Overview and Methodology

BiAssemble’s approach is structured into three primary phases: pick-up, alignment, and assembly. The pick-up phase focuses on identifying optimal grasping points that facilitate subsequent manipulation steps. The alignment phase is critical for positioning fractured parts in a way that allows for seamless reassembly, avoiding part collisions. Finally, the assembly phase executes the rejoining of parts, ensuring precise alignment and contact.

A key innovation of this framework is its sophisticated affordance prediction model, designed to understand local geometry while simultaneously accounting for potential actions that follow. This is achieved through a simulation environment that allows the framework to derive point-level affordance predictions, tuned for collaborative bimanual manipulation. The authors incorporate an SO(3)-equivariant representation method to disentangle the geometric properties from pose variations, thereby enhancing the transferability of learned affordances across varied shapes and configurations.

Benchmarking and Results

To evaluate the efficacy of BiAssemble, the authors introduce a real-world benchmark characterized by diverse geometric configurations and global reproducibility. The benchmark enables consistent assessment of policy performance, bridging the gap between simulation and practical application. Experimental results demonstrate that BiAssemble surpasses previous affordance-based and imitation-based methodologies, particularly in its ability to generalize across a wide spectrum of object geometries.

Implications and Future Direction

The implications of this research are notable both practically and theoretically. Practically, BiAssemble's framework can contribute to advanced robotic systems capable of handling complex assembly tasks in real-world scenarios such as household repairs, archaeological reconstruction, and industrial applications involving irregularly shaped objects. Theoretically, the integration of long-horizon action prediction with point-level affordance modeling provides a robust foundation for developing more sophisticated robot manipulation strategies.

Future developments in AI could see the extension of these methodologies to more generalized environments, potentially incorporating machine learning techniques that allow robots to autonomously learn strategies for handling unseen object categories. Further exploration into reinforcement learning and transfer learning paradigms might also yield models with enhanced adaptability and efficiency in dynamic assembly contexts.

Overall, BiAssemble represents a significant stride in robotic geometric assembly, offering insights and tools crucial for advancing collaborative manipulation systems in complex environments.