Vendor-complete characterization of CLI grep vs. vector performance under increasing distraction
Determine, across provider-native CLI harnesses (Claude Code, Codex CLI, and Gemini CLI), how the accuracy of grep-based lexical retrieval changes as the number of distractor sessions increases relative to vector-based semantic retrieval, under matched session-limit configurations (e.g., s5, s10, s20, s30, full) on the LongMemEval subset.
References
Finally, incomplete rows (Codex vector intermediates; no Codex grep scaling row yet) mean we cannot yet state a vendor-complete picture of how "CLI grep" ages with distraction relative to "CLI vector" under matched caps.
— Is Grep All You Need? How Agent Harnesses Reshape Agentic Search
(2605.15184 - Sen et al., 14 May 2026) in Section 4.2.4 (Experiment 2: Discussion)