Papers
Topics
Authors
Recent
Search
2000 character limit reached

Augmented HQP for Collaborative Robotics

Updated 9 April 2026
  • Augmented HQP is a real-time control framework for human-robot collaboration that hierarchically integrates vision-based action recognition, adaptive workspace constraints, and ergonomic optimization.
  • The framework simultaneously optimizes robot joint velocities and end-effector targets, ensuring precise, posture-friendly, and intuitive task execution.
  • Empirical results demonstrate improved operator usability, reduced workload, and enhanced task performance in tool handover and cooperative manipulation scenarios.

Augmented Hierarchical Quadratic Programming (AHQP) is a real-time, lexicographic multi-task control framework for human-robot collaboration that hierarchically integrates vision-based human action recognition, adaptive workspace soft constraints, and ergonomics optimization. AHQP adapts the standard "stack-of-tasks" hierarchical quadratic programming (HQP) paradigm by simultaneously optimizing both robot joint velocities and desired end-effector (EE) velocities, enabling fine-grained, ergonomic, and sociable robot behavior during collaborative tasks. This formulation allows intuitive human guidance of robotic systems, seamless online adaptation of shared workspaces, and direct incorporation of posture-friendly motion, as demonstrated in tasks such as tool handover, cooperative manipulation, and human-following mobile manipulation (Tassi et al., 2022).

1. Mathematical Structure and Optimization Hierarchy

The AHQP framework is grounded in the HQP formalism, which solves a stack of pp prioritized quadratic programs for a generic decision variable χRs\chi \in \mathbb{R}^s, subject at each priority level k=1,pk=1\,\ldots,p to distinct costs and constraints:

minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}

with Ak\mathbf{A}_k, bk\mathbf{b}_k defining the cost for the kk-th task, Ci\mathbf{C}_i, did_i the inequality constraints, and Ei\mathbf{E}_i, χRs\chi \in \mathbb{R}^s0 as equality constraints. Enforcing strict prioritization requires lower-priority solutions to lie in the nullspace of higher-priority tasks, operationalized by stacking equality constraints from superior levels.

A central innovation in AHQP lies in the design of the augmented decision variable:

χRs\chi \in \mathbb{R}^s1

where χRs\chi \in \mathbb{R}^s2 are the χRs\chi \in \mathbb{R}^s3-DoF robot joint velocities and χRs\chi \in \mathbb{R}^s4 are the desired end-effector velocities simultaneously optimized online. This enables a tight coupling of robot kinematics with workspace and ergonomics objectives (Tassi et al., 2022).

2. Task Levels and Cost Functions

AHQP organizes control objectives into a lexicographically ordered stack of three core levels:

  1. Primary Task: Closed-Loop Inverse Kinematics (CLIK) The top-priority QP enforces a closed-loop inverse-kinematics objective over χRs\chi \in \mathbb{R}^s5. With χRs\chi \in \mathbb{R}^s6 the robot Jacobian, χRs\chi \in \mathbb{R}^s7 the actual EE pose, and χRs\chi \in \mathbb{R}^s8 the previous desired pose, the cost is

χRs\chi \in \mathbb{R}^s9

where k=1,pk=1\,\ldots,p0 is the proportional gain and k=1,pk=1\,\ldots,p1 the integration step.

  1. Secondary Task: Soft Constraint on Shared Workspace (HRSW) To allow dynamic deformation of the human-robot shared workspace, the EE goal k=1,pk=1\,\ldots,p2 is constrained via a slackened box constraint

k=1,pk=1\,\ldots,p3

where k=1,pk=1\,\ldots,p4 is a slack vector penalized in a second-level QP

k=1,pk=1\,\ldots,p5

  1. Tertiary Task: Ergonomics Optimization Ergonomics is addressed by fitting a lightweight Cartesian map k=1,pk=1\,\ldots,p6 to approximate REBA-based human comfort scores from hand data, resulting in a quadratic tertiary cost

k=1,pk=1\,\ldots,p7

with k=1,pk=1\,\ldots,p8 and k=1,pk=1\,\ldots,p9 obtained offline from human demonstrations.

The solver proceeds lexicographically, at each level projecting the candidate solution into the nullspace of higher-priority tasks to ensure strict task ordering (Tassi et al., 2022).

3. Augmentation Relative to Standard HQP

Standard HQP for inverse kinematics fixes minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}0 a priori from a trajectory generator, optimizing minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}1 exclusively. AHQP augments this structure by treating minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}2 as an optimization variable, yielding several key modifications:

  • The CLIK cost incorporates minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}3 directly in the first-level QP.
  • The slack cost on minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}4 explicitly softens workspace boundaries, allowing adaptive shaping of the shared human-robot region in real time.
  • The ergonomics cost is handled directly within the same hierarchical stack, removing the need for external planners.

This unification enables real-time integration of kinematic consistency, workspace adaptation, and ergonomic optimization within a single lexicographic QP loop (Tassi et al., 2022).

4. Vision-Driven Task Adaptation and Human-Robot Interaction

AHQP tightly couples external vision modules with hierarchical control via:

  • Object-Surface Classification:

A ResNet+SVM classifier on RGB images provides a binary output minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}5; during collaborative states (recognized via action detection), this constraint dynamically restricts EE orientation by updating minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}6.

  • Action Recognition:

SlowOnly@ResNet50, pretrained on Kinetics-400, infers a class confidence vector minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}7. If the "start walk" action is top-ranked, the HRSW window shifts adaptively along direction minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}8, modifying workspace bounds as

minχ  12Akχbk2 s.t.C1χd1,  ,  Ckχdk, s.t.E1χ=f1,  ,  Ekχ=fk,\begin{aligned} &\min_{\chi}\; \tfrac12\|\mathbf{A}_k\,\chi - \mathbf{b}_k\|^2\ &\text{s.t.}\quad \mathbf{C}_1\chi\le d_1,\;\dots,\;\mathbf{C}_k\chi\le d_k,\ &\phantom{\text{s.t.}\quad}\mathbf{E}_1\chi= f_1,\;\dots,\;\mathbf{E}_k\chi= f_k, \end{aligned}9

where Ak\mathbf{A}_k0 is a step parameter.

  • 3D Human-Hand Tracking:

OpenPose processes RGB-D frames to estimate human hand positions Ak\mathbf{A}_k1. Assuming Ak\mathbf{A}_k2, the ergonomics cost Ak\mathbf{A}_k3 can be evaluated directly online.

This perception pipeline allows for instantaneous, vision-driven adaptation of both workspace and ergonomic constraints, enabling sociable human-commanded robot behavior (Tassi et al., 2022).

5. End-to-End Algorithmic Workflow

AHQP's real-time control stack, as implemented on the MOCA platform (Franka Panda with mobile base), follows this loop at approximately 1 kHz:

  1. Acquire camera streams (RGB and RGB-D).
  2. Classify object surfaces via ResNet+SVM to determine Ak\mathbf{A}_k4.
  3. Extract hand positions Ak\mathbf{A}_k5 using OpenPose keypoints.
  4. Recognize human actions using SlowOnly@ResNet50, yielding Ak\mathbf{A}_k6.
  5. Decode collaborative state and direction; update shared workspace constraints and EE goals accordingly.
  6. Assemble Level 1 cost (Ak\mathbf{A}_k7, Ak\mathbf{A}_k8) with joint and EE limits.
  7. Define Level 2 cost (Ak\mathbf{A}_k9) and constraints (bk\mathbf{b}_k0, bk\mathbf{b}_k1).
  8. Add Level 3 ergonomics cost (bk\mathbf{b}_k2).
  9. Solve the lexicographic QP for optimal bk\mathbf{b}_k3 and bk\mathbf{b}_k4. 10. Issue bk\mathbf{b}_k5 to robot control (impedance law) and update bk\mathbf{b}_k6 with bk\mathbf{b}_k7.

This pipeline enables fluid, closed-loop adaptation to human intent and environment while prioritizing both task performance and user comfort (Tassi et al., 2022).

6. Empirical Validation and Performance

The AHQP framework demonstrates the following quantitative results on the MOCA platform:

  • Object-surface classification: 100% accuracy with ResNet50+SVM over a 2,000-image benchmark.
  • Action recognition: 86.55% accuracy on the HRI30 dataset with SlowOnly@ResNet50 pretrained on Kinetics-400.
  • Ergonomics in tool-handover: AHQP maintains a human ergonomics score bk\mathbf{b}_k8 versus 4.2 without ergonomics optimization.
  • Iterative workspace adaptation: During “follow-the-human” trials, cumulative REBA-based scores remain below 1.0 with AHQP and rise above 3.5 without ergonomics.
  • Operator usability: NASA-TLX surveys reveal reduced mental and physical load, and improved perceived performance, for users assisted by AHQP's ergonomics optimization.

Collectively, these results highlight AHQP's effectiveness in fusing action-recognition signals, adaptive workspace soft constraints, and ergonomics into a unified real-time controller, yielding fluid and sociable human-robot interaction while promoting user comfort and trust in automation (Tassi et al., 2022).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Augmented HQP (AHQP).