CCM-33842: Capture LiteLLM token usage in spans by shreyas70 · Pull Request #6 · harness/otel-python-sdk

shreyas70 · 2026-06-25T12:12:02Z

Summary

Capture LiteLLM response usage on the SDK-owned span before it ends so UDP ingest receives gen_ai.usage.input_tokens, gen_ai.usage.output_tokens, and gen_ai.usage.total_tokens.
Normalize LiteLLM GenAI semantic convention fields by emitting gen_ai.provider.name instead of gen_ai.system.
Add LiteLLM response metadata before span end: gen_ai.response.model, gen_ai.response.id, and gen_ai.response.finish_reasons.
Emit dotted cache/reasoning token attributes when available: gen_ai.usage.cache_read.input_tokens, gen_ai.usage.cache_creation.input_tokens, and gen_ai.usage.reasoning.output_tokens.
Do not register LiteLLM's own OTEL callback from the SDK wrapper, since it adds legacy gen_ai.system and duplicates response attributes after the wrapper now enriches the span directly.

Why

llm-model-service Bedrock/LiteLLM responses include token usage, but exported litellm_request spans showed input_tokens=0 and output_tokens=0 because LiteLLM response enrichment happened after the wrapper-owned span had already ended. Copying response metadata before span.end() makes non-streaming LiteLLM spans usable for cost attribution.

Out of scope

Streaming usage accumulation.
Tenant/user allocation attributes.
Resource-level deployment.environment.name changes.

Test plan

.venv/bin/python -m pytest test/instrumentation/litellm/litellm_instrumentation_test.py

Result: 7 passed.

Copy response usage metadata onto the SDK-owned LiteLLM span before ending it so OTLP exports include input and output token attributes. Co-authored-by: Cursor <cursoragent@cursor.com>

CLAassistant · 2026-06-25T12:12:09Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Emit canonical provider, response metadata, and dotted cache/reasoning token attributes on LiteLLM spans. Co-authored-by: Cursor <cursoragent@cursor.com>

Rely on the SDK wrapper for LiteLLM response enrichment so exported spans do not retain the legacy gen_ai.system attribute. Co-authored-by: Cursor <cursoragent@cursor.com>

t-santoshsahu · 2026-06-25T14:03:26Z

            import litellm  # pylint: disable=import-outside-toplevel

            otel_logger = _get_otel_logger()
-            _register_otel_callback(otel_logger)


Can you test this, I guess this is required for the instrumentation.

Drop the stale local variable left after removing LiteLLM callback registration. Co-authored-by: Cursor <cursoragent@cursor.com>

CCM-33842: fix LiteLLM token usage capture

c866d66

Copy response usage metadata onto the SDK-owned LiteLLM span before ending it so OTLP exports include input and output token attributes. Co-authored-by: Cursor <cursoragent@cursor.com>

CCM-33842: normalize LiteLLM GenAI semconv fields

60b68dd

Emit canonical provider, response metadata, and dotted cache/reasoning token attributes on LiteLLM spans. Co-authored-by: Cursor <cursoragent@cursor.com>

t-santoshsahu previously approved these changes Jun 25, 2026

View reviewed changes

CCM-33842: avoid LiteLLM callback duplicate attrs

44ff5ec

Rely on the SDK wrapper for LiteLLM response enrichment so exported spans do not retain the legacy gen_ai.system attribute. Co-authored-by: Cursor <cursoragent@cursor.com>

shreyas70 dismissed t-santoshsahu’s stale review via 44ff5ec June 25, 2026 13:51

t-santoshsahu reviewed Jun 25, 2026

View reviewed changes

CCM-33842: remove unused LiteLLM logger variable

1f96b2f

Drop the stale local variable left after removing LiteLLM callback registration. Co-authored-by: Cursor <cursoragent@cursor.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

CCM-33842: Capture LiteLLM token usage in spans#6

CCM-33842: Capture LiteLLM token usage in spans#6
shreyas70 wants to merge 4 commits into
harness:mainfrom
shreyas70:CCM-33842-litellm-token-usage

shreyas70 commented Jun 25, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Jun 25, 2026

Uh oh!

t-santoshsahu Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

shreyas70 commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Out of scope

Test plan

Uh oh!

CLAassistant commented Jun 25, 2026

Uh oh!

t-santoshsahu Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

shreyas70 commented Jun 25, 2026 •

edited

Loading