DAB v1
Release-gate reliability out of 100%.
The benchmark for agent memory systems
Competitive benchmark coverage across reliability, conversational memory, and scale. Quaid’s release gate lives here beside published and estimated peer results.
Grouped bar charts compare Quaid against peer memory systems across the benchmarks that decide whether a memory layer is real or just context stuffing.
Release-gate reliability out of 100%.
Published and measured dialogue-memory scores.
Extreme-scale memory performance from 100K to 10M tokens.
Estimated values show an asterisk in tooltips. Pending, not-benchmarked, and n/a states render as ghost bars at 0. GBrain is included from the AI Heroes benchmark, May 2026, which reports an 8.3x win over qmd on 150 real questions. The reported setup uses cloud-based enrichment and is not airgapped; numeric DAB, LoCoMo, LongMemEval, and BEAM scores remain pending until reproducible runs are published.
Release-gate scoring and thresholds.
Conversational memory benchmark details.
LongMemEval setup and per-type breakdown.
Extreme-scale memory benchmark methodology.
Version charts and the last 10 published runs.
Latest Quaid release-gate runs. Full trend charts live on the history page.
Latest MSMARCO snapshot: P@5 17.4% / R@5 38.9%.