Asymmetric Assistive Reasoning

Updated 4 July 2026

Asymmetric Assistive Reasoning is a design principle where assistive agents use richer internal models to execute heavy inference while humans provide high-level guidance.
It integrates probabilistic graphical models, symbolic logic, and multimodal techniques to selectively support actions under confidence, safety, or audit constraints.
Applications in teleoperation, robotics, and diagnostic systems demonstrate improved task efficiency, reduced workload, and maintained human agency.

to=arxiv.search 彩票招商_code=json {"^{^{^{^{^{^{^{^4query4}}}}}}} Assistive Reasoning\"^{^{^{^{^{^{^⁴}}}}}} OR ti:\4^{^{^{^{^{^{^{^"Asymmetric}}}}}}} Assistive Reasoning\"^{^{^{^{^{^{^⁴}}}}}} OR abs:\4^{^{^{^{^{^{^{^"asymmetric}}}}}}} assistive reasoning\"","max_results":^{^{^{^{4all:\4query4^{^{^{^{,"sort_by":"submittedDate","sort_order":"descending"}ുവനന്തപുരം}}}}}}}} to=arxiv.search 天天爱彩票{"^{^{^{^{^{^{^{^4query4}}}}}}} reasoning\" AND (asymmetric OR asymmetry)","max_results":^{^{^{^{4all:\4query4^{^{^{^{,"sort_by":"submittedDate","sort_order":"descending"}ുവനന്തപുരം}}}}}}}} Asymmetric assistive reasoning can be understood as a class of assistive inference and decision-making systems in which observation, world modeling, intervention authority, and verification are intentionally unevenly distributed across agents or subsystems. In this pattern, the assistant typically maintains a richer internal model, performs heavier latent inference, or holds privileged information, while the human supplies sparse commands, partial observations, or high-level goals; assistance is then applied selectively, often under confidence, safety, or audit constraints rather than uniformly (&&&^{^{^{^{4query4^{^{^{^&&&,}}}}}}} &&&^{^{^{^{4all:\4^{^{^{^&&&,}}}}}}} &&&^{^{^⁴}} OR ti:\4^{^{^{^&&&,}}} &&&^{^{^⁴}} OR abs:\4^{^{^{^&&&).}}} Across probabilistic graphical models, shared autonomy, multimodal robotics, and post-training of vision-language systems, the recurring objective is not symmetry of reasoning burden, but structured asymmetry that preserves usefulness, tractability, and, in many settings, human agency.

^{^{^{^{4all:\4^{^{^{^.}}}}}}} Conceptual organization

The literature converges on several distinct but related forms of asymmetry. One concerns informational access: a robot or assistant may observe latent structure, full environment state, or a formal task model unavailable to the human collaborator. A second concerns inferential burden: the assistant performs Bayesian inference, commonsense completion, or optimization over latent variables, whereas the human need not model the assistant’s internal state. A third concerns authority allocation: interventions are confidence-gated, user-confirmed, or audit-constrained rather than continuously dominant. A fourth concerns verifiability: some systems explicitly distinguish between internal reasoning and externally checkable artifacts (&&&^{^{^{^{4all:\4^{^{^{^&&&,}}}}}}} Neau et al., 2022, &&&^{^{^⁴}} OR abs:\4^{^{^{^&&&).}}}

Paradigm	Asymmetric element	Representative formalism
Bayesian multinets	Local structure depends on hypothesis subset	PRESERVED_PLACEHOLDER_^{^{^{^{4query4^{^{^{^}}}}}}} (&&&^{^{^{^{4query4^{^{^{^&&&)}}}}}}}
Assistive teleoperation	Latent intended action differs from measured action	PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4^{^{^{^}}}}}}} vs. PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4^{^{^{^}}} (&&&^{^{^{^{4all:\4^{^{^{^&&&)}}}}}}}
Empowerment-based assistance	Assistant observes actions but does not infer reward	PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4^{^{^{^}}} (&&&^{^{^⁴}} OR ti:\4^{^{^{^&&&)}}}
Explanation-aware coordination	Proposer knows private state; validator audits typed claims	$r=(c,t)$ (&&&^{^{^⁴}} OR abs:\4^{^{^{^&&&)}}}

This organization shows that the asymmetry is structural rather than tied to a single algorithmic family. In some papers the asymmetry is between hypotheses or contexts inside a probabilistic model; in others it is between a human and an assistive robot; in still others it is between a training-time teacher and a deployment-time student. A plausible implication is that asymmetric assistive reasoning is best viewed as a design principle for allocating modeling complexity and decision rights where they are most useful.

^{^{^⁴}} OR ti:\4^{^{^{^.}}} Probabilistic, symbolic, and graph-based foundations

A foundational formalization appears in work on asymmetric independence in probabilistic reasoning. Standard Bayesian networks encode independencies that must hold uniformly across all configurations of a conditioning set, but many assistive settings require context-specific assertions such as $X \perp Y \mid H=h_0$ without requiring the same independence for other values of $H$ . Bayesian multinets address this by replacing a single DAG with local networks $D_i$ over cells $A_i$ of a hypothesis partition, each encoding $P(U \mid h \in A_i)$ . Inference then computes

PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4query4^{^{^{^}}}}}}}

where PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4all:\4^{^{^{^}}}}}}} is evaluated in the local network whose cell contains PRESERVED_PLACEHOLDER_^{^{^{^4all:\4}}} OR ti:\4^{^{^{^.}}} The paper also defines ordinary similarity networks using a connected cover of hypotheses and proves recoverability of the full joint distribution when no hypothesis has zero prior probability. In the secured-building example, hypothesis-specific independence PRESERVED_PLACEHOLDER_^{^{^{^4all:\4}}} OR abs:\4^{^{^{^,}}} subset independence PRESERVED_PLACEHOLDER_^{^{^{^{4all:\44^{^{^{^,}}}}}}} and a reduction in parameter count from ^{^{^{^{4all:\4all:\4^{^{^{^}}}}}}} to 9 are used to illustrate how asymmetric structure improves both elicitation and inference (&&&^{^{^{^{4query4^{^{^{^&&&).}}}}}}}

Knowledge-driven socially assistive robotics extends this logic from probability structure to symbolic preference and explanation structure. In medication sorting, the robot maintains a Preference Reasoner, a Hint Engine, and an Explanation Synthesizer. Preferences are stored as first-order predicates such as PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4^{^{^{^5,}}}}}}} while an HTN domain model decomposes the task and incorporates those predicates into precondition checks. Assistance is triggered when plan mismatch and a need model indicate that help is required; explanation generation then constructs a causal chain from local plan steps, preferences, and commonsense facts. The contribution is explicitly transparency-oriented, but it remains a proof of concept without a large-scale user study or quantitative metrics (&&&^{^{^{^4all:\4}}} OR ti:\4^{^{^{^&&&).}}}

A third line uses scene graphs and commonsense enrichment to detect implicit need for help without explicit commands. The pipeline begins with Scene-Graph Generation, enriches the graph with ConceptNet and possible ATOMIC links, scores graph nodes by sentiment PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4^{^{^{^6,}}}}}}} and computes scene risk as

PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4^{^{^⁷}}}}}}

If the risk exceeds a dynamic threshold PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4^{^{^{^8,}}}}}}} the system concludes that an implicit need for help has been triggered; assistive-action generation then iteratively expands the graph with positively connoted affordances and selects

PRESERVED_PLACEHOLDER_^{^{^{^{4all:\4^{^{^⁹}}}}}}

Because this design uses symbolic graph completion rather than probabilistic belief tracking, it directly exposes a limitation noted by the authors: uncertainty quantification is absent, and robust ontology alignment between visual scene graphs and large commonsense graphs remains open (Neau et al., 2022).

^{^{^⁴}} OR abs:\4^{^{^{^.}}} Shared control, intent inference, and autonomy allocation

In assistive teleoperation, asymmetry is often instantiated as a distinction between what the user meant to do and what the interface measured. One explicit formulation introduces latent intended interface action PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4query4^{^{^{^,}}} measured interface action PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4all:\4^{^{^{^,}}} intended task primitive PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4 OR ti:\4^{^{^{^,}}} and human internal state PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4 OR abs:\4^{^{^{^.}}} The posterior over task primitives is computed as

PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\44^{^{^{^}}}

Assistance is applied only when PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4^{^{^⁵}} and PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4^{^{^{^6.}}} The system supports Filtering Assistance, which blocks the command, and Corrective Assistance, which replaces it with the inferred one. In a ^{^{^{^{4all:\4query4^{^{^{^-person}}}}}}} study with ^{^{^⁴}} OR abs:\4sort_by4query4^{^{^{^}}} trials, corrective assistance reduced completion time, final pose error, mode switches, NASA-TLX workload, and frustration while improving success rate and user satisfaction; corrective assistance was preferred in intuitiveness, helpfulness, and ease with average Likert PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4^{^{^⁷}} (&&&^{^{^{^{4all:\4^{^{^{^&&&).}}}}}}}

Open-world assistive teleoperation pushes the asymmetry further by letting the robot infer diverse intents from sparse human snippets and then execute parameterized skills. Casper combines open-world perception, VLM-based candidate generation, VLM-based intent selection over annotated visual histories, self-consistency confidence, and a skill library of eight primitives. Confirmation occurs only when repeated VLM calls agree above threshold PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4^{^{^{^8,}}} after which the robot executes a skill such as pick_up_object, pour_object, push_open_door, or go_to_landmark. In human studies with PRESERVED_PLACEHOLDER_^{^{^⁴}} OR ti:\4^{^{^⁹}} on a TIAGo mobile manipulator, Casper achieved task success rate PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4query4^{^{^{^}}} versus Full Teleop PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4all:\4^{^{^{^,}}} HAT PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4 OR ti:\4^{^{^{^,}}} and RBII PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4 OR abs:\4^{^{^{^,}}} with lower workload and higher user satisfaction than the baselines (&&&^{^{^{^{4all:\4^{^{^{^5&&&).}}}}}}}

Driving assistance offers a different asymmetry: the planner acts as a Stackelberg leader and the human driver as a bounded-rational follower. A dynamic game formulation with mixed leader strategy, quantal-response follower, and meta-learning over driver utilities is used to construct shared control. The follower does not choose at every time step; instead, a decision indicator PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\44^{^{^{^}}} controls when the driver intervenes, while the planner always acts. The meta-learned utility prior is then adapted from a small amount of driver data, yielding a receding-horizon assistance scheme that helps the driver reach the target destination, saves driving time relative to driver-only control, and is robust to bounded rationality and errors (&&&^{^{^{^{4all:\4^{^{^{^6&&&).}}}}}}}

A common misconception is that asymmetry implies one-way instruction. Embodied leader-follower navigation under privileged information shows the opposite. In the AI^{^{^⁴}} OR ti:\4^{^{^{^-THOR}}} framework with a knowledgeable Leader and sensor-limited Follower, the Leader has access to full object coordinates, while the Follower sees only objects within PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4^{^{^⁵}} and a PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4^{^{^⁶}} field of view. The reported Success Gap,

PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4^{^{^⁷}}

shows that nearly half of feasible plans fail because the Leader’s instructions are not grounded in the Follower’s perspective. A pull-enabled protocol in which the Follower actively requests clarification is more robust than push-only guidance; successful episodes have PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4^{^{^⁸}} the frequency of clarification requests (&&&^{^{^{^{4all:\4^{^{^{^7&&&).}}}}}}} This directly reframes assistive asymmetry as a problem of belief alignment, not merely delegated control.

4. Multimodal embodied assistance and diagnostic evaluation

Recent multimodal assistive systems increasingly decompose asymmetry across specialized agents rather than only between human and machine. MARS divides smart-home assistance into a Visual Perception Agent, a Risk Assessment and Reasoning Agent, a Planning and Task Allocation Agent, and an Evaluation and Optimization Agent. Risk assessment computes obstacle ratio, urgency, severity, and a combined score

PRESERVED_PLACEHOLDER_^{^{^⁴}} OR abs:\4^{^{^⁹}}

which is then converted into a priority index $r=(c,t)$ ^{^{^{^{4query4^{^{^{^.}}}}}}} Evaluation scores candidate plans on Assistance UX, Task Efficiency, Transparency, and Ethical & Social Alignment, feeds the weakest dimension back to planning, and iterates until a threshold is met. Under both AI and human expert ranking, MARS achieved the best overall performance, with average ranking $r=(c,t)$ ^{^{^{^{4all:\4^{^{^{^}}}}}}} versus the next best $r=(c,t)$ ^{^{^⁴}} OR ti:\4^{^{^{^;}}} ablations showed the steepest drops when Agent ^{^{^{^{4all:\4^{^{^{^}}}}}}} or Agent ^{^{^⁴}} OR ti:\4^{^{^{^}}} was removed (&&&^{^{^{^{4all:\4^{^{^{^8&&&).}}}}}}}

StretchBot applies a neuro-symbolic version of the same idea to short guided stretching sessions. Perception combines YOLOv8n, MediaPipe Pose, DeepFace, sentiment analysis, and speech recognition; reasoning combines a knowledge-graph retriever with an LLM planner and a verifier; execution is restricted to a constrained action vocabulary such as NEXT_EXERCISE, POINT_<OBJECT>, and STOP_ROUTINE. Emotion is fused by

$r=(c,t)$ ^{^{^⁴}} OR abs:\4^{^{^{^}}}

while pose monitoring uses rule-based geometric checks. In a within-subjects pilot with three participants, the adaptive condition improved perceived adaptability and object relevance, whereas the scripted condition remained competitive in comfort, trust, and naturalness; the authors explicitly position the results as exploratory and call for larger, longitudinal studies (&&&^{^{^{^{4all:\4^{^{^{^9&&&).}}}}}}}

Evaluation papers emphasize that current MLLM competence in assistive settings remains uneven. NetraLink benchmarks nine MLLMs on scene text, currency, navigation, multilingual menu reading, and long-form storybook QA using egocentric data from a head-mounted GoPro. FastVLM-7B reaches ANLS $r=(c,t)$ 4 on scene text, but the best ANLS on book QA is only $r=(c,t)$ 5, and latency ranges from $r=(c,t)$ 6 to $r=(c,t)$ 7; the paper states that assistive use demands $r=(c,t)$ 8 round-trip and highlights failure under motion blur, occlusion, low light, mixed scripts, and cross-page reasoning (&&&^{^{^⁴}} OR ti:\4query4^{^{^{^&&&).}}} CyclingVQA reaches a similar conclusion from a cyclist-centric perspective: on ^{^{^⁴}} OR ti:\4^{^{^{^{,^{^{^{^{4query4query4^{^{^⁹}}}}}}}}}} QA pairs from 695 real-world images, Gemini-^{^{^⁴}} OR ti:\4^{^{^{^.5-Flash}}} scores $r=(c,t)$ 9, Qwen^{^{^⁴}} OR abs:\4^{^{^{^-VL-8B}}} $X \perp Y \mid H=h_0$ ^{^{^{^{4query4^{^{^{^,}}}}}}} and several driving-specialized VLMs underperform strong generalist models, indicating limited transfer from vehicle-centric training to cyclist-assistive reasoning (&&&^{^{^⁴}} OR ti:\4all:\4^{^{^{^&&&).}}} Together these studies show that multimodal assistive asymmetry is currently bottlenecked not only by reasoning but by perception, grounding, and latency.

5. Verification, privileged information, and training-time asymmetry

One strand of the literature relocates asymmetry from deployment-time control to the structure of explanation and learning itself. In explanation-aware coordination under asymmetric information, an informed Proposer submits an action plus a reasoning artifact $X \perp Y \mid H=h_0$ ^{^{^{^{4all:\4^{^{^{^,}}}}}}} where $X \perp Y \mid H=h_0$ ^{^{^⁴}} OR ti:\4^{^{^{^}}} is a set of typed claims and $X \perp Y \mid H=h_0$ ^{^{^⁴}} OR abs:\4^{^{^{^}}} is short text; a Validator then audits claims with probability $X \perp Y \mid H=h_0$ 4 and budget $X \perp Y \mid H=h_0$ 5. The mechanism creates a “cost of silence”: in ambiguous, borderline cases, absent a verifiable artifact, conservative rejection causes approval and welfare collapse. With artifacts, approval remains $X \perp Y \mid H=h_0$ 6 across all $X \perp Y \mid H=h_0$ 7 with bad-approval $X \perp Y \mid H=h_0$ 8; under High Temptation $X \perp Y \mid H=h_0$ 9, bad-approval remains $H$ ^{^{^{^{4query4^{^{^{^}}}}}}} with $H$ ^{^{^{^{4all:\4^{^{^{^,}}}}}}} and even $H$ ^{^{^⁴}} OR ti:\4^{^{^{^}}} preserves bad-approval $H$ ^{^{^⁴}} OR abs:\4^{^{^{^}}} (&&&^{^{^⁴}} OR abs:\4^{^{^{^&&&).}}} For assistive systems, this shifts the emphasis from persuasive free text to partially verifiable reasoning.

Training-time asymmetry appears explicitly in DARC and AMVL. DARC decouples Questioner training from Solver training, then uses asymmetric self-distillation: a document-augmented teacher with access to $H$ 4 generates majority-voted pseudo-labels for a student that only sees the question $H$ 5. The Questioner is conditioned on explicit target difficulty $H$ 6, and the final method yields an average improvement of $H$ 7 points across nine reasoning benchmarks (&&&^{^{^⁴}} OR ti:\4 OR abs:\4^{^{^{^&&&).}}} AMVL introduces an inference-time prior $H$ 8 and a training-time posterior $H$ 9, then uses a forward KL to fit the prior and a reverse KL to regularize the posterior against “answer leakage” and “prior contamination.” On BLINK, AMVL improves the average score by $D_i$ ^{^{^{^{4query4^{^{^{^}}}}}}} and reports gains of up to $D_i$ ^{^{^{^{4all:\4^{^{^{^}}}}}}} on individual tasks (&&&^{^{^⁴}} OR ti:\44^{^{^{^&&&).}}} These are not assistive systems in the narrow HRI sense, but they formalize a recurring asymmetry: privileged training signals may assist learning only if they do not contaminate deployment-time inference.

Optimization studies on multimodal reasoning make the same point from another angle. A controlled analysis of VLM post-training shows a persistent perception-reasoning asymmetry: in supervised fine-tuning, token imbalance makes perception only $D_i$ ^{^{^⁴}} OR ti:\4^{^{^{^}}} to $D_i$ ^{^{^⁴}} OR abs:\4^{^{^{^}}} of tokens, and reweighting or NGDiff boosts end-to-end performance by up to $D_i$ 4; in reinforcement learning, outcome rewards correlate more strongly with reasoning than with perception, and perception-aware rewards improve end-to-end accuracy by up to $D_i$ 5, with surrogate rewards yielding gains of $D_i$ 6 points (&&&^{^{^⁴}} OR ti:\4^{^{^{^5&&&).}}} APO makes the asymmetry explicit at the policy-update level: DADS reduces KL penalty on hard but correct samples, STCR penalizes overlong incorrect trajectories, and the resulting View-R^{^{^{^{4all:\4^{^{^{^{-^{^{^⁴}}}}}}}}}} OR abs:\4^{^{^{^B}}} reports an average $D_i$ 7 gain over the base model while maintaining general-task performance (&&&^{^{^⁴}} OR ti:\4^{^{^{^6&&&).}}} Domain sequencing under GRPO is also asymmetric and order-sensitive: training on other domains improves math reasoning by approximately $D_i$ 8 accuracy, while transfer to logic and puzzle is negligible; moreover, math $D_i$ 9science yields $A_i$ ^{^{^{^{4query4^{^{^{^}}}}}}} on math/science, whereas science $A_i$ ^{^{^{^{4all:\4^{^{^{^math}}}}}}} drops to $A_i$ ^{^{^⁴}} OR ti:\4^{^{^{^}}} (&&&^{^{^⁴}} OR ti:\4^{^{^{^7&&&).}}} For assistive systems built on such models, these results imply that internal optimization asymmetries can materially shape downstream assistive competence.

6. Limitations, misconceptions, and open problems

A persistent misconception is that asymmetry necessarily reduces human autonomy. The surveyed systems do not support that reading. In unintended-interface teleoperation, the autonomy intervenes only if the inferred command differs from the measured command and entropy is below $A_i$ ^{^{^⁴}} OR abs:\4^{^{^{^}}} (&&&^{^{^{^{4all:\4^{^{^{^&&&).}}}}}}} In Casper, the robot proposes an intent but waits for user confirmation before skill execution (&&&^{^{^{^{4all:\4^{^{^{^5&&&).}}}}}}} In StretchBot, the robot makes proactive suggestions yet “always awaits user consent,” with a scripted fallback available (&&&^{^{^{^{4all:\4^{^{^{^9&&&).}}}}}}} Empowerment-based assistance goes further by refusing to infer or optimize a latent reward at all; instead it maximizes the human’s influence over future states, preserving freedom over which future to pursue (&&&^{^{^⁴}} OR ti:\4^{^{^{^&&&).}}}

The harder limitations concern modeling assumptions and robustness. Bayesian multinets and similarity networks assume connected covers and, in the standard theory, strictly positive distributions; hypothesis nodes are often treated as roots to avoid arc-reversal complications (&&&^{^{^{^{4query4^{^{^{^&&&).}}}}}}} Commonsense help detection assumes reliable perception and sufficient ConceptNet or ATOMIC coverage, and the current design lacks uncertainty quantification (Neau et al., 2022). Empowerment-based assistance assumes shared state representation, fixed or slowly adapting human policy, and can conflict with specific human goals in principle (&&&^{^{^⁴}} OR ti:\4^{^{^{^&&&).}}} NetraLink shows that current MLLMs remain brittle under motion blur, occlusion, varied lighting, and multilingual scripts, with unacceptable failures in navigation and long-form reading (&&&^{^{^⁴}} OR ti:\4query4^{^{^{^&&&).}}} MARS notes continued struggles with dynamic scenes and fine-grained multi-objective conflict resolution without dedicated human oversight (&&&^{^{^{^{4all:\4^{^{^{^8&&&).}}}}}}}

Open problems are correspondingly concrete. Several papers call for interactive clarification and uncertainty-aware dialogue rather than silent failure or hallucinated confidence (&&&^{^{^⁴}} OR ti:\4query4^{^{^{^&&&).}}} Partial observability and multi-human settings remain unresolved in empowerment-based assistance (&&&^{^{^⁴}} OR ti:\4^{^{^{^&&&).}}} The embodied ^{^{^{^{^{^{^{^4query4}}}}}}} work suggests that active uncertainty reduction should be a first-class mechanism, not a fallback (&&&^{^{^{^{4all:\4^{^{^{^7&&&).}}}}}}} Symbolic-neural systems call for richer user preference models, better latent context modeling, and real-time deployment with live trials (Neau et al., 2022, &&&^{^{^{^{4all:\4^{^{^{^9&&&).}}}}}}} A plausible implication is that future asymmetric assistive reasoning will depend less on a single monolithic assistant than on calibrated combinations of context-specific models, active ^{^{^{^{^{^{^{^4query4}}}}}}} verifiable artifacts, and multimodal perception modules whose limitations are explicitly exposed rather than ignored.