How Sipcode compares to the field

Honest, anonymized side-by-side. Methodology below. Names available on request.

Capability	Sipcode	Tool A	Tool B	Tool C	Tool D
Reproducible benchmark anyone can verify	✓ 62.6% on locked 20-task corpus, range 37.4 to 80.6%	—	~40% (single anecdote, n=1)	99% theoretical ceiling, not a measured floor	—
Mid-session install works without restart, no stale context	✓ Verified Warm-Fill (v1.6.15)	—	—	—	—
Zero false-dedup risk by construction (not just by tests)	✓	mtime-only (known false-positive risk)	mtime-only	undocumented	undocumented
Drift detection (context-rot signal per session)	✓ drift v2 with persistent baselines	—	—	—	—
Per-rewriter integrity scoring (kept %)	✓ since v1.6.8	—	—	—	—
Forecast / impact / today spend telemetry	✓ since v1.6.10	—	—	—	—
Network calls during normal use	0	varies	varies	varies	varies
Pricing	Free, MIT	Free	Free	Free	Free

Methodology

Audited via public GitHub repositories and shipped documentation on 2026-06-15.
Sipcode is one of the tools audited; you can re-run the audit on Sipcode itself by reading the source.
Where a capability is binary (does it exist or not), the cell shows ✓ or —.
Where a capability has a measured value, the cell shows the value and a brief caveat.
Tools are anonymized as A, B, C, D. We are not in the business of attacking other open-source projects. Anyone curious enough to install all five and run them will arrive at this table independently.