Papers
Topics
Authors
Recent
Search
2000 character limit reached

SMART: Adaptive and Semantic AI Systems

Updated 12 April 2026
  • SMART is an acronym for adaptive, self-aware frameworks employing AI/ML across LLM inference, robotics, hardware design, and semantic web services.
  • It encompasses methodologies like cost-efficient model selection, meta-reasoning for tool utilization, and robust multi-agent coordination with formal performance guarantees.
  • SMART systems offer scalable solutions with proven benefits such as significant cost reduction, improved tool efficiency, and enhanced energy performance in real-world applications.

SMART is an acronym recurrently used across disparate domains to denote methods, architectures, or platforms characterized by self-adaptation, semantic awareness, strategic reasoning, or integration of advanced AI/ML techniques. Prominent instantiations of SMART frameworks appear in LLM deployment, multi-agent systems, hardware design, human action recognition, and semantic web service orchestration. The following article synthesizes the key technical schemas and contributions of foundational SMART systems across arXiv-indexed literature.

1. Adaptive Model Selection in NLP: SMART for Cost-Efficient LLM Inference

SMART (Scaling Models Adaptively for Reduced Token Fees) defines a framework for reducing LLM inference costs under formal accuracy constraints by profiling, selecting, and combining multiple available LLMs. Given a powerful "oracle" model m\overline m (e.g., GPT-4) and lower-cost LLMs {mi}\{m_i\}, SMART enables users to specify an error budget δ[0,1]\delta\in[0,1] and confidence level γ(0,1)\gamma\in(0,1). It guarantees, with probability at least γ\gamma, that the fraction of disagreements between the selected output and the reference is at most δ\delta:

P(fm(x)fm(x))δ  with confidence  γ.P(f_m(x) \neq f_{\overline m}(x)) \le \delta \;\text{with confidence}\;\gamma.

SMART implements a profiling phase based on binomial estimation (Clopper–Pearson intervals) to determine which mim_i are δ,γ\langle\delta,\gamma\rangle-equivalent to the oracle. To minimize total processing cost, the framework introduces early termination of profiling using expected cost modeling:

Cost(k)=k(m.c+imi.c)+(nk)i(j<i(1pj))pimi.c+(nk)(i(1pi))m.c\mathrm{Cost}(k) = k(\overline m.c + \sum_i m_i.c) + (n-k) \sum_{i} \left( \prod_{j<i}(1-p_j) \right)p_i m_i.c + (n-k)\left( \prod_i (1-p_i)\right) \underline m.c

where {mi}\{m_i\}0 is the probability {mi}\{m_i\}1 will be validated after {mi}\{m_i\}2 more trials and {mi}\{m_i\}3 denotes the cheapest validated model.

Three inference strategies are then available: (i) always use the cheapest validated model; (ii) leverage early termination for reduced profiling; (iii) solve a cost-minimization MILP to assign fractional loads across multiple validated models while enforcing the refined accuracy constraint. Empirical results on IMDB, SMS-Spam, and AG News datasets demonstrate cost reductions up to {mi}\{m_i\}4 compared to exclusive oracle usage, with theoretical accuracy guarantees always met in nearly all runs (Jo et al., 2024).

2. Metacognitive Tool Use in LLM Agents: Strategic Model-Aware Reasoning with Tools

SMART—Strategic Model-Aware Reasoning with Tools—addresses the meta-reasoning gap in LLM agents that leads to unnecessary external tool invocations (termed "Tool Overuse") even when tasks are solvable within the model’s own parametric knowledge. Drawing inspiration from human metacognition, SMART introduces self-awareness signals into the agent’s workflow to minimize superfluous tool calls and optimize computational consumption.

The supporting dataset SMART-ER comprises 3000+ multi-domain questions decomposed into reasoning chains with explicit stepwise rationales justifying the necessity of each tool call. SMARTAgent models, fine-tuned with LoRA on this dataset, prepend reasoning-stage decision tokens (e.g., [[Reasoning]], [[Search]], [[AskUser]]) at every inference step, enabling explicit confidence-based calibration of internal versus tool-based reasoning.

Empirical results indicate that, across domains and models, SMARTAgent achieves 24% reduction in tool usage and over 37% accuracy improvement. Significantly, a 7B-parameter SMARTAgent can match GPT-4o-70B performance in tasks involving nontrivial tool usage patterns, with robust generalization documented on GSM8K and MINTQA benchmarks (Qian et al., 17 Feb 2025).

3. SMART in Multi-Agent Robotics, Learning, and Planning

Several platforms and frameworks leverage the SMART designation for scalable, adaptive, or semantically informed multi-agent robotics:

  • SMART-LLM orchestrates the translation of natural language instructions into skill-level, executable multi-robot plans by sequentially applying LLM-driven task decomposition, coalition formation, and parallelizable scheduling, demonstrating outperformance over random or rule-based allocation in both simulation and real-robot trials (Kannan et al., 2023).
  • SMART for Multi-Robot Reinforcement Learning provides an open platform comprising a ROS/Gazebo-based high-fidelity simulator, real-world robotic testbed, and unified RL-style API, supporting algorithmic research and benchmarking in scenarios with complex state, action, and reward structures. It exposes distinct challenges unique to real-world multi-robot RL, including robustness, sample inefficiency, and the sim-to-real gap (Liang et al., 2022).
  • SMART for Multi-Agent Path Finding (MAPF) introduces a physics-based, execution-aware C++ testbed. It realizes faithful robot kinematics, incorporates an Action Dependency Graph for robust execution monitoring, and achieves high scalability and realism for evaluating state-of-the-art MAPF algorithms in the context of real-world constraints and uncertainties (Yan et al., 3 Mar 2025).

4. SMART Architectures in Hardware and Perception

SMART also refers to heterogeneous hardware architectural schemes and action recognition systems characterized by the integration of domain-specific knowledge:

  • SFQ-Based CNN Accelerators: SMART is a heterogeneous scratchpad memory architecture augmenting shift-register (sequential) memory with CMOS–SFQ SRAM (random access) arrays, coordinated using ILP-based compilation for CNN mappings. It overcomes core limitations of prior shift-only designs, yielding up to {mi}\{m_i\}5 throughput and {mi}\{m_i\}6 energy reduction with negligible area penalty (Zokaee et al., 2021).
  • Scene-Motion-Aware Human Action Recognition: SMART combines skeleton-based motion perception with scene-aware features—specifically, human-scene interaction and 3D motion trajectory—integrated through a dedicated two-stage fusion network. This results in significant gains (up to {mi}\{m_i\}7 percentage points {mi}\{m_i\}8 on scene-generalization) for recognizing risky abnormal behaviors in mental health contexts (Lai et al., 2024).

5. Semantics and Integration in Web Services and Communication

SMART frameworks also address orchestration and integration from a semantics-driven or information-theoretic perspective:

  • Semantic Mashup of REST Services: The SMART platform merges OWL2 ontologies, SWRL-based reasoning, and query parsing to enable non-programmers to dynamically discover, invoke, and semantically compose RESTful web services, supporting on-the-fly chaining via automated inference of matching input/output types and relations (Chamoun, 2013).
  • Semantic Packet Aggregation for 6G-AIGC: SMART designates a two-tier approach—semantics-aware packet grouping (SemPA) and repeated transmission scheduling (SemRT)—to maximize semantic fidelity of text prompts transmitted over erasure channels for text-to-image generation. It optimizes expected CLIP-based text and image similarity scores under bit and latency constraints, using a blend of combinatorial and neural surrogate optimization (Lee et al., 31 Mar 2025).

6. Strategy Fusion in LLM-Driven Reasoning and Planning: SMaRT

SMaRT (Select, Mix, and ReinvenT) is a meta-framework enabling strategy fusion in LLM-driven workflows by aggregating outputs from diverse prompting methods (e.g., Chain-of-Thought, Program-aided Language, Direct) and prompting the LLM to serve as a cross-strategy integrator. The workflow consists of three stages: (1) Select—generate candidate solutions via distinct strategies; (2) Mix—LLM-fused solution synthesis through structured comparison and repair of candidate solutions; (3) ReinvenT—iterative, self-refining plan correction in sequential decision scenarios using failed trajectory summarization and exclusion. Empirical results show SMaRT consistently exceeds single-strategy and LLM-as-judge baselines across mathematical reasoning, ALFWorld, and planning benchmarks (Verma et al., 20 Oct 2025).

7. Broader Context and Extensions

Across infrastructure and smart building research, SMART or B-SMART paradigms delineate layered, autonomic architectures wherein real-time analytics, decision, and actuation stages are systematically decoupled. These are instantiated via digital twins, MAPE-K loops, and deep integration of ML/AI with edge/fog/cloud orchestration. For example, in B-SMART, closed-loop RL and real-time anomaly detection contribute to energy and emissions reductions in smart buildings by formalizing the division of functional layers and enabling continuous optimization (Genkin et al., 2022), while "smart infrastructure" in communications demands joint design of ML-in-the-loop sensing, agility, reliability, and virtualization to support IoT and urban-scale automation (Weitnauer et al., 2017).

Bibliography Table

SMART Use Case (short name) Core Domain/Technique arXiv ID
Adaptive LLM Inference Profiling, equivalence, mixed-model (Jo et al., 2024)
Metacognitive Tool Use (SMARTAgent, SMART-ER) LLM self-awareness, tool reasoning (Qian et al., 17 Feb 2025)
SMART-LLM Multi-agent robot task planning, LLM (Kannan et al., 2023)
SFQ CNN Systolic Accelerator (heterogeneous SPM) Superconductor hardware acceleration (Zokaee et al., 2021)
MRL Platform Multi-robot RL benchmarking (Liang et al., 2022)
MAPF Testbed Multi-agent pathfinding, physics sim (Yan et al., 3 Mar 2025)
Scene-Motion-Aware HAR Video, IoT, abnormal action detection (Lai et al., 2024)
Semantic Mashup Platform OWL2/SWRL, REST web service chaining (Chamoun, 2013)
AIGC Wireless Semantic Packetization Semantic-aware comm, T2I erasure (Lee et al., 31 Mar 2025)
Strategy Fusion for Reasoning (SMaRT) LLM output fusion, meta-reasoning (Verma et al., 20 Oct 2025)
Reference Smart Building (B-SMART) AI, RL, anomaly detection in buildings (Genkin et al., 2022)
Smart Infrastructure (generic) ML, SDN, IoT comm, resource mgmt (Weitnauer et al., 2017)

SMART, across these highly technical domains, generally denotes adaptive, self-aware, or semantically principled systems that address cost, generalization, or robustness constraints in complex AI- and ML-driven environments. Each instantiation is domain-specific, with rigorous quantitative or formal performance guarantees and a focus on scalable, composable, or self-optimizing architectures.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to SMART.