[feature not live] docs(merge-queue): add Testing Duration chart to metrics page#647
[feature not live] docs(merge-queue): add Testing Duration chart to metrics page#647samgutentag wants to merge 3 commits into
Conversation
Documents the new Testing Duration chart in the Merge Queue Health tab, including the available stat measures and the Outcome / Cycle Ended In filter dropdowns. Notes the separate bucketing behavior from the other health charts. Source: trunk-io/trunk2#3919 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
Merging to
After your PR is submitted to the merge queue, this comment will be automatically updated with its status. If the PR fails, failure details will also be posted here |
|
Claude finished @samgutentag's task in 2m 18s —— View job Review summaryReviewed Inline comments posted (all non-blocking):
Strengths
Tasks
|
|
Verification status (2026-05-12): Eng merged but flag off in prod. Hold off.
Wait for the flag to be enabled in prod, then re-run |
Folds in the strongest pieces from PR #649 so it can be closed without losing work: - Adds the CI-vs-queue-wait-time framing line - Wraps the restart caveat in a {% hint style="info" %} block (matches the page's existing pattern) - Adds a worked filter example combining Outcome and Cycle ended in - Cross-links to the Time in queue section - Promotes the stat measures table under its own #### subheading - Drops em dashes per house style Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
Claude finished @samgutentag's task in 1m 30s —— View job Review summaryReviewed A few non-blocking nits, mostly carryovers from the 2026-05-09 review plus one introduced by the new commit: Carried over (still unaddressed):
New in
Other small things:
Strengths
Tasks
|
- Clarify "the cycle" as "the testing cycle" on first reference - Rewrite Interrupted vs Cancelled filter rows with proto-defined distinctions (Interrupted: run cut short, cycle may continue; Cancelled: cycle ends mid-test) - Harmonize the Time in queue stat-measures table with the new Testing duration table (cleaner column layout, no trailing periods, "value below which" phrasing) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
|
Claude finished @samgutentag's task in 1m 20s —— View job Re-review after
|
| # | Item | Status |
|---|---|---|
| 1 | Stat-measures table style — harmonize Time in queue (L98–106) with Testing duration (L148–156) | ✅ Fixed. Both tables now use the un-padded layout, no trailing periods, and the "value below which N% … fall" phrasing. |
| 2 | "the cycle" introduced before being defined (L110) | ✅ Fixed. Now reads "when the testing cycle reaches its final state". |
| 3 | Interrupted vs Cancelled indistinguishable (L129–130) | ✅ Fixed with proto-grounded wording. The "cycle may continue" vs "cycle ends" distinction is exactly what a reader needs to pick the right filter. |
| 4 | No figure for the new chart | ⏸ Deferred — appropriate; capture once displayMergeHealthTestDuration is on in prod. |
| 5 | "Cycles cancelled before tests ran to completion" reads awkwardly (L130) | ✅ Fixed as part of #3 — now "Cycles cancelled mid-test". |
Still-open nits (non-blocking)
- L142 — American "canceled" inside a section that uses British "Cancelled".
"…without noise from canceled or failed runs."sits four lines below the Cancelled filter row at L139. Sam clarified earlier that the British spelling matches the proto enum / UI label, so the filter rows are correct — but the prose at L142 isn't referring to a UI label, just describing the data, and would read more consistently as "cancelled or failed runs". Same-paragraph spelling mix is the kind of thing that catches a reader's eye even when the page-wide mix doesn't. - L110 — uppercase
TESTING. Stylistically heavier than the lowercase state names used at L92 ("merged, failing, or being canceled"). Fine if it's matching a specific UI string / Prometheus label; worth a quick sanity check otherwise. - L126 vs L136 —
"All Outcomes"vs"All". If the UI labels genuinely differ between the two dropdowns, leave as-is; otherwise consider matching for symmetry.
Strengths preserved from prior commits
- Cross-chart bucketing footgun called out at L110 + the hint block at L114–116.
- CI-vs-queue-wait-time framing at L112.
- Worked filter example at L142 (Passed + Merged → clean CI baseline).
- Proto-grounded Interrupted/Cancelled distinction is genuinely useful — readers won't have to guess.
Tasks
- Read current state of
merge-queue/administration/metrics.md - Diff against
origin/mainand reviewd1080f4 - Verify carried-over issues from prior reviews are addressed
- Check for any new issues introduced
- Post review summary
• Branch:sam-gutentag/mq-testing-duration-chart
Summary
Source
Test plan
Generated by Claude Code