Phase 2: verify linux baseline + graduate bench-check soft gate to hard (#302 follow-up)

Follow-up to #302 / PR #416.

## Status after PR #416

- [x] `make bench-check` invokes both gates (% delta + absolute ceiling).
- [x] `bench-detectors-check` runs on push-to-main via `.github/workflows/bench.yml`.
- [x] Absolute ceiling array (`allocs_gate` in `scripts/bench-registry.sh`) hard-fails on breach.
- [x] Mutation-verified (ceiling 2 → 1 on HBM trips the gate).

## Remaining for Phase 2

1. **Verify linux/amd64 baseline matches M1 numbers.** Initial ceilings were measured on Apple M1 Max; the README claims +/- 1 alloc on linux/amd64 (Go's `testing.B` allocs/op is hardware-invariant in theory). Confirm by:
   - Capture per-detector allocs/op output from the next 10 push-to-main `bench.yml` runs.
   - Compare medians vs `bench/detectors/baselines.json`.
   - If any detector shifts >= 2 allocs/op consistently, re-baseline on linux and update the JSON + ceilings.
2. **Graduate soft gate to hard.** Per `bench/detectors/README.md`:
   - N >= 10 consecutive PRs with no flake-driven false positive.
   - alloc-CV < 1% across those 10 PRs per detector.
   - Flip `$gate_mode` in `baselines.json` from `soft` to `hard`; change `scripts/bench-check-detectors.sh` final `exit 0` to `exit "$status"`.
3. **(Optional) Cross-CV captured in CI artifact.** Have `bench.yml` upload the bench output as an artifact each run; a small script can compute rolling CV across the last 10 runs to drive the graduation decision objectively.

## Out of scope

Per-detector hot-path optimization PRs (track separately as `bench-ratchet:` issues per detector).


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Phase 2: verify linux baseline + graduate bench-check soft gate to hard (#302 follow-up) #420

Status after PR #416

Remaining for Phase 2

Out of scope

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Phase 2: verify linux baseline + graduate bench-check soft gate to hard (#302 follow-up) #420

Description

Status after PR #416

Remaining for Phase 2

Out of scope

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions