fix: deduplicate token metrics by message.id to prevent multi-counting by KorenKrita · Pull Request #462 · sirmalloc/ccstatusline

KorenKrita · 2026-06-17T07:24:30Z

Problem

Claude Code writes one JSONL transcript entry per content block within a single API response. A typical assistant turn with thinking + text + tool_use produces 2–4 JSONL rows, each carrying:

The same message.id
A full copy of the usage snapshot (identical input_tokens, output_tokens across all rows)
A non-null stop_reason (e.g. "tool_use" or "end_turn")

Both getTokenMetrics and collectSpeedMetricsFromLines sum every assistant row that has a usage field without checking whether multiple rows belong to the same API response. This causes token counts and speed metrics to be inflated by 1.4–2.5× depending on the average number of content blocks per turn.

The existing stop_reason-based dedup in getTokenMetrics does not help because all rows from the same response carry the same non-null stop_reason.

Measured impact (real transcripts)

Metric	Before fix	After fix	Inflation factor
`requestCount`	114	61	1.87×
`outputTokens`	69,720	29,888	2.33×
`outTps`	85.5 t/s	36.6 t/s	2.34×

Verified across 20 models with 50k+ assistant rows — all affected (1.36×–2.46× depending on model's tendency to produce multi-block responses).

Fix

Deduplicate by message.id in both code paths:

collectSpeedMetricsFromLines (speed metrics): maintain a Map<messageId, SpeedRequest>. First row per id seeds the request with usage and interval. Subsequent rows for the same id only extend interval.endMs to the turn's final timestamp. Rows without a message.id fall back to per-row counting.
getTokenMetrics (token totals): single-pass filter that applies both the existing stop_reason check and a Set<messageId> dedup, keeping only the first row per id.
TranscriptLine type: add optional id?: string to message.

Testing

All 1,589 existing tests pass
Verified against real transcripts: deduplicated requestCount matches the actual number of unique turns (confirmed via message.id grouping)
Smoke-tested full statusline rendering with patched build

Claude Code writes one JSONL transcript entry per content block (thinking, text, tool_use) within a single API response. Each entry carries the same message.id and a full copy of the usage snapshot with a non-null stop_reason. Both getTokenMetrics and collectSpeedMetricsFromLines were summing every assistant row without deduplication, causing token counts and speed metrics to be inflated by 1.4-2.5x depending on the number of content blocks per turn. Fix: dedupe by message.id in both paths. For speed metrics, later rows for the same id only extend the interval end timestamp. Rows without a message.id fall back to per-row counting for backwards compatibility.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: deduplicate token metrics by message.id to prevent multi-counting#462

fix: deduplicate token metrics by message.id to prevent multi-counting#462
KorenKrita wants to merge 1 commit into
sirmalloc:mainfrom
KorenKrita:fix/speed-dedup-message-id

KorenKrita commented Jun 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

KorenKrita commented Jun 17, 2026

Problem

Measured impact (real transcripts)

Fix

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant