Sample Code Mower dashboard
A concrete preview of the kind of private team signal CodeMower.com is meant to show after opt-in metadata uploads.
Illustrative sample data
This page uses example numbers to show the product shape before sign-in. It is not a live cross-team cohort benchmark. Real dashboards use your team's own metadata first; cohort comparison becomes useful only as enough teams opt in.
The minimal OSS example for this same loop lives in examples/demo-calibration: one known-clean control, one known-blocked control, and a sample reviewer value report. This page shows the richer dashboard shape once a team has uploaded more history.
Calibration cases
24
Known-clean and known-blocked PRs used to judge reviewer behavior.
Reviewer runs
128
Structured reviewer and lens runs across the calibration corpus.
30-day spend
$42
Reported API or subscription spend from local metadata.
Avg latency
84s
Average reviewer runtime for recent structured audits.
codex-audit
Strong useful-rate with a low false-positive rate on the sample corpus.
gemini-cli / operability
High false-positive rate means this lane should stay advisory until calibrated further.
Selective triggers
Run Codex on every candidate PR; trigger Claude quality lens on backend/auth/data changes.
What this dashboard should help you decide
Who should review every risky PR?
Use codex-audit as the baseline lane.
It caught the known-blocked control and stayed quiet on the known-clean control.
Who should be selective?
Use Claude for matching backend/auth/data classes.
It added useful independent signal, but with higher cost and latency in this sample.
Who should stay informational?
Keep the experimental lens out of branch protection.
It was cheap, but it blocked a clean control and missed a blocked control.
Reviewer value table
| Lane | Lens | Useful rate | False positives | Cost / run | Latency | Recommendation |
|---|---|---|---|---|---|---|
| codex-audit | base | 82% | 9% | $0.42 | 71s | Merge-gating eligible after one more clean cycle |
| claude-audit | context-driven-quality | 74% | 14% | $0.61 | 96s | Selective trigger for higher-risk PRs |
| gemini-cli | operability | 38% | 31% | $0.08 | 52s | Informational until more calibration data exists |
Why contribute metadata?
The local OSS tool already gives you private reports. Opt-in cloud sharing adds team history, dashboard rows, token management, export/delete controls, and a place to compare reviewer value over time without keeping every terminal artifact in someone's laptop. Cross-team comparisons come later, after enough teams opt in for the data to be honest.
Ready to try the local path? Follow the setup guide. Want the OSS source first? Open GitHub.