Skip to content

refactor(telemetry): OTel-format ids and explicit session id#935

Open
EhabY wants to merge 1 commit intotelemetry/command-dispatcherfrom
telemetry/schema-cleanup
Open

refactor(telemetry): OTel-format ids and explicit session id#935
EhabY wants to merge 1 commit intotelemetry/command-dispatcherfrom
telemetry/schema-cleanup

Conversation

@EhabY
Copy link
Copy Markdown
Collaborator

@EhabY EhabY commented May 5, 2026

Follow-up to #934. Two schema rough edges showed up in the local JSONL output once command.invoked events were flowing:

  1. session_id was vscode.env.sessionId verbatim, which is a UUID concatenated with a ms timestamp (e.g. 0f465473-72c8-49d9-9b17-bed92cb4ed3a1777982179036). Looks like a malformed UUID and is awkward to grep.
  2. event_id and trace_id were UUIDv4 with hyphens, not the lowercase-hex form OTel uses, so a future exporter would need a translation layer.

This PR:

  • Adds src/telemetry/ids.ts with newTraceId (16 bytes / 32 hex), newSpanId (8 bytes / 16 hex), and newSessionId (16 bytes / 32 hex). Names and widths match OTel.
  • buildSession now takes sessionId as a parameter instead of reading vscode.env.sessionId, decoupling our schema from VS Code's quirk.
  • TelemetryService accepts sessionId in its constructor and forwards it to buildSession.
  • ServiceContainer generates one sessionId and threads it to both the JSONL sink (filename slug) and TelemetryService (event payload), so the on-disk filename and the session_id field always match.
  • service.ts: every crypto.randomUUID() becomes newSpanId / newTraceId.

trace_id stays on every event (including single-event "traces"). You can't know at emit time whether a phase child will follow, and a consistent schema is more valuable to consumers than 36 bytes per event.

Stacked on #934. Retarget once that lands.

@EhabY EhabY self-assigned this May 5, 2026
@EhabY EhabY force-pushed the telemetry/command-dispatcher branch from 4b4c555 to 0e6d1a1 Compare May 5, 2026 13:57
The local JSONL output exposed two schema rough edges. First, session_id
was vscode.env.sessionId verbatim, which is a UUID concatenated with a
ms timestamp ("0f465473-...-bed92cb4ed3a1777982179036"), looking like a
malformed UUID. Second, event_id and trace_id were UUIDv4 with hyphens,
not the lowercase-hex form OTel uses, so a future exporter would need a
translation layer for no real reason.

- Add src/telemetry/ids.ts with newTraceId (16 bytes / 32 hex),
  newSpanId (8 bytes / 16 hex), and newSessionId (16 bytes / 32 hex).
  Names and widths match OTel.
- buildSession takes sessionId as a parameter instead of reading
  vscode.env.sessionId, decoupling our schema from VS Code's quirks.
- TelemetryService accepts sessionId in its constructor and forwards
  it to buildSession.
- ServiceContainer generates one sessionId via newSessionId() and
  threads it to both LocalJsonlSink (filename slug) and TelemetryService
  (event payload), so the on-disk filename and the session_id field
  always match.
- service.ts: replace crypto.randomUUID() with newSpanId / newTraceId
  at every event emission.
- Tests updated for the new sessionId parameter and the new id format
  regex (/^[0-9a-f]{16}$/ for event_id, sessionId is now an explicit
  test fixture).

trace_id stays on every event (including single-event "traces"). You
cannot know at emit time whether a phase child will follow, and a
consistent schema is more valuable to consumers than 36 bytes per
event.
@EhabY EhabY force-pushed the telemetry/schema-cleanup branch from 41c010d to 6601664 Compare May 5, 2026 13:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant