Prehensile whole-body throwing

Updated 1 July 2025

Prehensile whole-body throwing is a robotics capability involving coordinated dynamic motion of a robot's entire body to propel objects while maintaining grasp and hitting specific targets.
This field utilizes hierarchical control architectures, combining learned policies with model-based techniques like MPC and trajectory optimization, to achieve complex dynamic tasks.
Key achievements include significantly extended throwing range, improved accuracy, high success rates surpassing human performance in tests, and successful sim-to-real transfer for diverse robotic platforms and applications.

Prehensile whole-body throwing is a robotics capability that involves dynamically propelling an object using coordinated motion of a robot's entire body—including limbs, base, and manipulators—in a way that maintains secure grasp (“prehension”) throughout the throw and achieves specified landing targets. This ability extends the robot’s effective operational reach, supports robust interaction with dynamic environments, and presents unique challenges in modeling, control, learning, and transferability to physical systems.

1. Core Principles and Problem Definition

Prehensile whole-body throwing requires robots to integrate manipulation and whole-body kinematic/dynamic coordination. Unlike arm-only throwing, these approaches exploit body mass, momentum, and contact to maximize throwing range, achieve higher accuracy, and retain stability post-release. Typical goals include:

Extending object delivery beyond the robot’s kinematic reach.
Maintaining prehensile (secure) grasp until controlled release.
Adapting to diverse object properties, landing targets, and real-world disturbances.

The field addresses both the dynamic planning problem—generating timed, coordinated motions to achieve target release conditions—and the uncertainty problem—ensuring robust behavior in the presence of actuation, sensing, and model imperfections.

2. Hierarchical Control Architectures and Representations

Prehensile whole-body throwing primarily uses hierarchical architectures that separate high-level task reasoning from low-level motor execution:

Hierarchical Motor Primitives:
- Low-level controllers are often formulated as probabilistic motor primitives derived from demonstrations (e.g., motion capture of throwing and catching) using encoder–decoder architectures. The Neural Probabilistic Motor Primitive (NPMP) model encodes trajectory intentions as latent variables and decodes actions from proprioceptive inputs, enabling reuse and transfer across tasks and environments (Merel et al., 2019).
Task-Conditioned High-Level Policies:
- High-level policies process egocentric sensory streams (vision, proprioception), task instructions, and environment context, producing commands in a compressed latent skill space. LSTMs and specialized preprocessors aggregate temporal and sensory data, issuing goal-directed commands to the motor primitive modules.
End-Effector Trajectory Interfaces:
- Cross-embodiment and cross-domain skill transfer are facilitated by manipulating policies that produce end-effector trajectories in task/world frames. These are tracked by whole-body controllers that realize the manipulation goal through full-body motion, regardless of the particular robot kinematics (Ha et al., 14 Jul 2024).

These architectural principles facilitate adaptability to diverse tasks (beyond original demonstration data), modularity for transfer learning, and robustness to unknown object or scene configurations.

3. Learning and Control Methodologies

A wide spectrum of control methodologies is employed:

Model Predictive Control (MPC) and Trajectory Optimization:
- MPC schemes solve receding-horizon optimal control problems, combining complex nonlinear robot/object dynamics, collision avoidance (via soft barrier functions and signed distance fields), and contact constraints. MPC enables real-time planning for tasks such as weight throwing on mobile quadruped platforms, maintaining safety and achieving dynamic coordination between base and manipulator (Chiu et al., 2022).
Deep Reinforcement Learning (RL):
- RL-based methods model the task as a Markov Decision Process, using curriculum learning (progressive task difficulty and procedural variation) and reward shaping (distance-to-target, alignment, and grasping success). These policies are trained to generate and fine-tune dynamic, momentum-amplified whole-body throws (Munn et al., 8 Oct 2024). Domain randomization and staged RL frameworks address sim-to-real transfer and sample efficiency (Zhang et al., 16 Sep 2024).
Hybrid Learning-Model-Based Control:
- Some frameworks combine learning-based policies (deep RL, high-frequency residuals) with model-based trajectory or acceleration optimization, improving tracking accuracy and handling release uncertainty. A residual policy corrects nominal RL outputs at high rates (400 Hz), while an optimization-based module (“pullback tube acceleration”) ensures the end-effector passes through a robust manifold of release states (Ma et al., 20 Jun 2025).
Probabilistic Model-Based Reinforcement Learning (MBRL):
- MBRL approaches, such as MC-PILOT, use Gaussian Process models to predict the probabilistic dynamics of the object post-release, incorporating explicit modeling of release errors and system delays. Policies are optimized through Monte Carlo rollouts, minimizing expected landing miss-distance and rapidly adapting to new target configurations (Turcato et al., 8 Feb 2025).
Skill-Level Planning with Imitation Distillation:
- Long-horizon frameworks like SPIN plan at the skill/primitive level and use connectors (transition policies) to ensure smooth skill chaining. High-quality plans are generated and distilled into multi-modal diffusion policies for closed-loop real-time execution and zero-shot sim-to-real transfer (Jung et al., 25 Feb 2025).

4. Robustness, Generalization, and Sim-to-Real Transfer

Robust, generalizable performance is a central objective:

Procedural Task Variation: Training with wide randomization in object properties (mass, size, trajectories), scene layouts, and initial robot states enables generalization beyond demonstration datasets (Merel et al., 2019, Munn et al., 8 Oct 2024, Zhang et al., 16 Sep 2024).
Modular Design and Object-Agnostic Controllers: Policies that rely only on proprioception and egocentric vision, without explicit object state at runtime, transfer more readily to real-world deployments (Merel et al., 2019, Ha et al., 14 Jul 2024).
Explicit Uncertainty and Delay Handling: Robust sim-to-real transfer requires explicit modeling of actuation lags, sensing noise, and model errors, e.g., via online estimation of release time distributions or domain randomization (Turcato et al., 8 Feb 2025, Ma et al., 20 Jun 2025).
Cross-Embodiment Skill Transfer: Policies designed around end-effector/task-frame outputs can be “plugged” into new robot morphologies (arms to legged platforms), achieving zero-shot deployment (Ha et al., 14 Jul 2024).

Empirical results show success rates exceeding 70% on real mobile quadrupeds (tossing), error reductions of over 40% in distance targeting with full-body versus arm-only approaches (Munn et al., 8 Oct 2024, Ma et al., 20 Jun 2025), and rapid policy adaptation to new targets with high (>95%) accuracy in MBRL frameworks (Turcato et al., 8 Feb 2025).

5. Key Performance Metrics and Comparative Results

Whole-body throwing performance is assessed by:

Landing Error: Mean deviation between object landing and target location; e.g., ~0.28 m error at 6 m range with high-frequency residual control (Ma et al., 20 Jun 2025).
Throwing Distance: Full-body policies achieve throws up to 13.26 m on real humanoids (compared to 6.75 m with arm-only), nearly doubling operational workspace (Munn et al., 8 Oct 2024).
Velocity Tracking: Achieved end-effector velocity errors as low as 0.398 m/s at high throw speeds (Ma et al., 20 Jun 2025).
Success Rate: Robots consistently outperform humans in controlled studies, with success rates up to 56.8% versus humans at 15.2% (for hitting small, random targets at 3–5 m) (Ma et al., 20 Jun 2025).
Sample/Data Efficiency: Model-based approaches achieve near-100% accuracy on arbitrary targets with as few as 5–10 trials, greatly reducing data requirements compared to purely model-free RL (Turcato et al., 8 Feb 2025).

A summary table of improvement metrics from recent experimental studies:

Metric	Arm-Only	Full-Body (Whole-Body)	Improvement
Max Throw Distance	~6.75 m	13.26 m	~96% increase
Mean Landing Error	0.64 m	0.36 m	44% error reduction
Success Rate (small tgt)	15.2% (human)	56.8% (robot)	~3.7× improvement

6. Applications and Future Research Directions

Prehensile whole-body throwing expands the application range of robots for:

Warehouse and Logistics: Delivering objects across obstacles, extending manipulation range in cluttered or dynamic environments (Kasaei et al., 25 Feb 2024).
Human–Robot and Robot–Robot Collaboration: Dynamic handover, catching, and passing in shared spaces (Huang et al., 2023).
Rescue and Construction: Rapid supply or tool delivery, rubble clearing, and dynamic placement in unstructured environments (Munn et al., 8 Oct 2024).
Sports and Dexterous Agility: Agile catching and throwing with humanoid or mobile platforms, matching human-level performance (Munn et al., 8 Oct 2024, Zhang et al., 16 Sep 2024).

Continued advances are focusing on:

Enhanced sim-to-real robustness and online adaptation (e.g., tactile integration, 3D acceleration tracking, object inertial property estimation).
Task decomposition for multi-step delivery and collaborative throwing/catching.
Broader morphological transfer—porting skills between different robots and embodiments using unified policy architectures (Ha et al., 14 Jul 2024).
Flexible integration of model-based and model-free elements for improved robustness, multimodality, and data efficiency (Abeyruwan et al., 2023, Turcato et al., 8 Feb 2025).

7. Biological Inspiration and Analysis

Research draws on principles from mammalian motor neuroscience:

Hierarchical, modular control architectures are analogous to the organization of biological motor systems (Merel et al., 2019).
Emergent behaviors such as gaze direction, coordinated gaze–limb movement, and compositionality (multipotent skills) reflect strategies observed in animal motor control.
Robustness to perturbation and context-dependent adaptation are informed by active sensing and memory-equipped policy modules.

Comparison to human studies shows that recent robotic approaches can match or even exceed human consistency and accuracy in dynamic throwing tasks under controlled conditions, highlighting the potential for robots to take on ambitious manipulation roles in open environments.

In sum, prehensile whole-body throwing blends advanced hierarchical learning, trajectory optimization, model-based and residual control, robust object interaction, and biological inspiration. It enables robots to achieve general, accurate, and robust dynamic object delivery to arbitrary targets, laying a foundation for scalable, adaptive, and embodiment-agnostic manipulation capabilities in real-world settings.