Methodology

DAB v1

215-point regression gate. Runs on every Quaid release.

Scoring Breakdown

Each section is deterministic and release-blocking.

Name Points What it tests
Infrastructure 25 Install, version check, DB init, ingestion
FTS Search 25 Full-text search across 10 queries
Semantic Search 25 Vector similarity, paraphrase recall, 5 queries
MCP Server 25 JSON-RPC compliance, 20 tool calls
Performance 25 FTS <100ms, semantic <500ms latency targets
Data Integrity 25 Write atomicity, no loss under load
Retrieval Quality 30 P@5 and R@5 on FiQA 500-query corpus
Regression 35 Version consistency, schema migration

How To Run

MEMORY_CMD=quaid bash benchmarks/dab/run.sh

Thresholds

  • 215 = perfect
  • 193+ = passing
  • Below 193 = regression blocked

Current Score

213/215 (99%)