Skip to content

Reduce record payload memory for large prompts#8

Draft
weireweire wants to merge 2 commits into
SemiAnalysisAI:cjq/agentx-v0.3from
weireweire:record-strip-payload-bytes-v0.3
Draft

Reduce record payload memory for large prompts#8
weireweire wants to merge 2 commits into
SemiAnalysisAI:cjq/agentx-v0.3from
weireweire:record-strip-payload-bytes-v0.3

Conversation

@weireweire

Copy link
Copy Markdown

Summary

  • Add an opt-in AIPERF_RECORD_STRIP_PAYLOAD_BYTES setting for large-prompt runs.
  • Drop canonical request payload bytes from RecordContext after dispatch when the setting is enabled.
  • Add unit coverage for the payload-stripping path.

Validation

  • PYTHONPATH=tests uv run --extra dev pytest tests/unit/workers/test_inference_client.py -q
  • uv run --extra dev ruff check src/aiperf/common/environment.py src/aiperf/workers/inference_client.py tests/unit/workers/test_inference_client.py

@github-actions

github-actions Bot commented Jun 11, 2026

Copy link
Copy Markdown

Try out this PR

Quick install:

pip install --upgrade --force-reinstall git+https://github.com/ai-dynamo/aiperf.git@3bcd555dfedc18899c8c1465c3ad8d3c82307192

Recommended with virtual environment (using uv):

uv venv --python 3.12 && source .venv/bin/activate
uv pip install --upgrade --force-reinstall git+https://github.com/ai-dynamo/aiperf.git@3bcd555dfedc18899c8c1465c3ad8d3c82307192

Last updated for commit: 3bcd555Browse code

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant