Qorx

Qorx Benchmark Report

Generated: 2026-05-03T12:34:00+00:00

Suite: live

Target: .

Qorx version: qorx 1.0.4-a.0

Git commit: b838c23

Summary

Metric	Value
Indexed local tokens	202986
Session visible tokens	69
Session reduction	2941.83x
Pack used tokens	484
Pack reduction	419.39x
Squeeze used tokens	419
Squeeze reduction	484.45x
Bench average reduction	400.60x
Strict task pass rate	100.0%
Expected refusal pass rate	100.0%
Agent provider calls	0

Strict Tasks

Question	Expected	Actual	Pass	Evidence	Used tokens
context fault proof pages resolver boundary	supported	supported	yes	3	380
galactic banana escrow treaty	not_found	not_found	yes	0	8

Bench Rows

Query	Used tokens	Omitted tokens	Reduction	Quarks
context fault proof pages resolver boundary	484	202502	419.39x	3
qorx carriers .qorx .qorxb qorx handle	511	202475	397.23x	3
strict answer refusal unsupported claims	527	202459	385.17x	4

Claim Notes

This benchmark uses Qorx local accounting only. Token counts are deterministic ceil(chars / 4) estimates unless the runtime reports another estimator. The report does not claim provider invoice savings, production throughput, or downstream model answer quality.

To reproduce:

python scripts/run-benchmark.py --target . --suite live --budget-tokens 600 --squeeze-budget-tokens 450 --query "context fault proof pages resolver boundary" --query "qorx carriers .qorx .qorxb qorx handle" --query "strict answer refusal unsupported claims" --supported-question "context fault proof pages resolver boundary" --unsupported-question "galactic banana escrow treaty" --agent-objective "prove context fault proof pages resolver boundary" --output-json docs/benchmarks/live.json --output-md docs/benchmarks/live.md

This site is open source. Improve this page.