AgentRank-UC: Dynamic Agent Ranking
- AgentRank-UC is a dynamic algorithm that fuses recency-weighted usage and competence metrics to rank autonomous agents in challenging environments.
- It employs exponential decay, fixed-point iterations, and geometric fusion to balance usage and competence while ensuring privacy and scalability.
- Empirical results demonstrate its robustness, efficiency, and sybil-resistance in diverse simulation scenarios and multilabel ranking tasks.
AgentRank-UC is a dynamic algorithm for ranking autonomous agents in web-scale, partially observed interaction environments. Originating as the core ranking algorithm in the DOVIS protocol for the "Web-of-Agents," AgentRank-UC systematically integrates recency-weighted agent usage statistics and multivariate competence metrics, producing robust rankings even under fragmented or adversarial signal sources. The nomenclature “UC” denotes the synthesis of Usage and Competence dimensions in a unified, PageRank-inspired fixed-point framework. Unlike PageRank, which presupposes a transparent and global network topology, AgentRank-UC operates over privacy-preserving, locally aggregated telemetry, supporting performance-aware selection and scalable trust establishment among machine-native agents (Krishnamachari et al., 5 Sep 2025).
1. Construction of Usage and Competence Metrics
AgentRank-UC is built upon two orthogonal edge-weighted interaction graphs over the directed agent interaction space: usage and competence.
Usage Kernel
For each triple (caller , callee , task ), usage is summarized as a recency-decayed count:
denotes the current time, is the decay rate (half-life ). Aggregation yields:
The row-stochastic usage kernel normalizes each caller row: where is a strictly positive prior.
Competence Kernel
Competence aggregates signed, decayed outcome signals (successes, quality , latency , cost , risk ): With these, compute the Beta-Bernoulli mean success: and task utility: $u_{ij}^{(k)} = \theta_1 \logit(\widehat{p}_{ij}^{(k)}) - \theta_2 \log(1+\overline{\ell}_{ij}^{(k)}) - \theta_3 \log(1+\overline{c}_{ij}^{(k)}) - \theta_4 \overline{r}_{ij}^{(k)} + \theta_5 \overline{q}_{ij}^{(k)}$ Aggregate transformed utilities: Normalize as: with a strictly positive competence prior.
2. Fixed-point Formulation and Fusion
Let be the usage rank vector and the competence rank vector. Damping parameters ensure contractive dynamics: After convergence, the outputs are fused geometrically, tuning the usage-competence trade-off via : The final ranking is a normalized vector in . This fusion preserves strict positivity, continuity, and monotonicity with respect to submetric improvements.
3. Protocol Infrastructure: DOVIS
The AgentRank-UC algorithm is operationalized atop DOVIS, a five-layer protocol stack:
- Discovery: Indexers aggregate OAT-Lite telemetry and periodically compute/publish .
- Orchestration: Callers emit idempotent, per-epoch usage and performance records for each edge , processed with exponential decay.
- Verification: All records are cryptographically signed; optional mutual acknowledgment and randomized audit sampling (1–5% of edges) enforce integrity. Identity strength (e.g., staked/TEE-attested) confers prior weight in or .
- Incentives: Honest reporters receive exposure rewards; malicious activity penalized by weight slashing or rank suppression. Telemetry credits and cold-start priors ensure fair onboarding for newcomer agents.
- Semantics: Schema normalization (latency in ms, cost in credits, qualities/risks in ); versioned task taxonomies and support for backward compatibility.
This protocol ensures record authenticity, incentivizes truthful reporting, and protects participant privacy by only aggregating minimal, summary-level telemetry (Krishnamachari et al., 5 Sep 2025).
4. Theoretical Guarantees
AgentRank-UC possesses several formally stated guarantees:
| Guarantee | Statement | Implication |
|---|---|---|
| Convergence | Power iterations contract in , unique fixed points exist | Fast stable computation |
| Fusion | is well-posed and stable | Continuous, robust to input variations |
| Monotonicity | Improving any submetric on edge weakly increases | Local utility improvements propagate to agent rank |
| Cold-start | via teleport guarantee for all | Newcomers have nonzero baseline visibility |
| Stability | Small changes yield bounded output shifts | |
| Sybil-resist. | Mass of collusive set bounded: ; | Sybil/pumping attacks are rate-limited; usage-only is not exploitable for full rank share |
All bounds and properties hold for positive priors, stochastic kernels, and strict damping () (Krishnamachari et al., 5 Sep 2025).
5. Algorithmic Implementation
The canonical AgentRank-UC computation consists of:
- Telemetry aggregation: For each , compute decayed counts , successes , means .
- Success posterior: Compute for each edge and task.
- Edge weights: Derive usage () and competence () weights by aggregating across tasks and applying the softplus activation to utilities.
- Row normalization: Produce and , inserting priors for zero rows.
- Fixed-point iterations: Iterate and to convergence.
- Fusion: Geometrically combine and with parameter ; normalize to obtain .
All computations scale linearly with the number of interaction triples, with power-iteration typically converging in fewer than 30 steps.
6. Empirical Evaluation and Scalability
AgentRank-UC has been validated in simulated agent ecosystems consisting of agents executing archetypal tasks, covering cases such as Popular-but-Mediocre (PbM), Niche-but-Excellent (NbE), Balanced-Strong (BS), Cheap-but-Risky (CbR), Sybil-Clique (Syb), and Newcomer-Good (NcG). Key experimental findings include:
- Performance: AgentRank-UC nearly matches an oracle success-rate baseline and outperforms usage-only variants in NDCG@10, Quality@10, Spearman's , and Regret@10.
- Trade-off tuning: Adjusting the fusion parameter smoothly interpolates final ranking behaviors between usage-driven and competence-driven extremes.
- Adaptivity: Rankings adjust at rates determined by the decay half-life . Shorter accelerates demotion/promotion post-performance shocks.
- Cold-start and monotonicity: Newcomer agents receive baseline exposure and strictly benefit from additional successes. Informative priors enable rapid onboarding.
- Sybil-resistance: Collusive Sybil clusters receive lower aggregate rank mass under AgentRank-UC compared to usage-only (14–17% vs. 11–12%); Sybil mass declines after burn-in phases.
- Scalability: Step time is in the number of populated interaction triples; telemetry and index relays operate with low communication overhead.
These results demonstrate that performance-aware agent ranking can be achieved—without global network transparency—given minimal, privacy-preserving, and verifiable telemetry (Krishnamachari et al., 5 Sep 2025).
7. Relation to Partial Feedback Multilabel Ranking
The term "AgentRank-UC" also denotes a multilabel classification and ranking algorithm in partial feedback regimes, where online optimization is guided by second-order upper-confidence bound (UCB) methods (Gentile et al., 2012). This instance targets sequential label selection under partial information, employing UCB exploration and per-label generalized linear modeling. The algorithm admits an cumulative regret bound under adversarial covariates. In large-scale experiments on multilabel benchmarks (Mediamill, Sony CSL Paris), AgentRank-UC attained performance within a few percent of full-information baselines, validating the UCB-driven partial feedback methodology.
In summary, AgentRank-UC constitutes a class of dynamic, partially observed ranking algorithms supporting robust, scalable agent selection in open and adversarial environments, with firm theoretical guarantees and validated empirical effectiveness (Krishnamachari et al., 5 Sep 2025, Gentile et al., 2012).
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days free