Description:
Add metrics collection for key operational indicators: token usage, latency, costs, errors, and sampling/validation outcomes. This provides quantitative observability complementing the existing tracing system.
Current State:
- No metrics collection system exists
- Token usage is extracted during backend operations but only used for tracing attributes
- No latency histograms, error counters, or cost tracking
- No Prometheus or OTLP metrics export
Acceptance Criteria:
Description:
Add metrics collection for key operational indicators: token usage, latency, costs, errors, and sampling/validation outcomes. This provides quantitative observability complementing the existing tracing system.
Current State:
Acceptance Criteria: