Validate agent-based LLMObs meta_struct export [python@0829a33d95872059d52363b6472a5f7735c6f30d]#7047
Draft
mabdinur wants to merge 2 commits into
Draft
Validate agent-based LLMObs meta_struct export [python@0829a33d95872059d52363b6472a5f7735c6f30d]#7047mabdinur wants to merge 2 commits into
mabdinur wants to merge 2 commits into
Conversation
Point the integration_frameworks scenario at the dev-tagged dd-apm-test-agent image (dev-llmobs-meta-struct) that synthesizes EVP LLMObs requests from APM trace meta_struct["_llmobs"], so the suite can validate dd-trace-py's agent-based LLMObs export (PR #18254). Co-authored-by: Cursor <cursoragent@cursor.com>
Contributor
|
|
|
Extend the dev-tagged dd-apm-test-agent (dev-llmobs-meta-struct) to the remaining LLM-adjacent surfaces so the draft validates no regressions: - PARAMETRIC: exercises tests/parametric/test_llm_observability. - VCRCassettesContainer: backs INTEGRATION_FRAMEWORKS plus the AI_GUARD and AI_GUARD_TELEMETRY scenarios. Non-LLM references (APMTestAgentContainer used by DOCKER_SSI, and the k8s lib-injection test agent) are intentionally left on their pinned versions. Co-authored-by: Cursor <cursoragent@cursor.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Validation-only draft PR. Not for merge. The goal is purely to run the full system-tests CI and confirm it passes against:
munir/agentbased-llmo, commit0829a33) — agent-based LLMObs export, where LLMObs payloads ride APM traces viameta_struct["_llmobs"]for kept traces.meta_struct, published as the dev imageddapm-test-agent:dev-llmobs-meta-struct.Changes
Bump the test agent to the dev
dev-llmobs-meta-structimage across all LLM-adjacent surfaces (other scenarios stay on the released image):INTEGRATION_FRAMEWORKS— anthropic / openai / google_genai LLMObs tests.PARAMETRIC— coverstests/parametric/test_llm_observability/.VCRCassettesContainer— backsINTEGRATION_FRAMEWORKS,AI_GUARD, andAI_GUARD_TELEMETRY.How the tracer build is selected
The
[python@0829a33d95872059d52363b6472a5f7735c6f30d]marker in the PR title puts Python into system-tests dev mode. dd-trace-py no longer publishes wheels keyed by dev-branch name, but per-commit wheels are still available in S3, so the marker pins the exact commit SHA of the PR head (load-binary.sh python->python-load-from-s3-> prebuilt wheel fromdd-trace-py-builds).Expected CI behavior
Fail if target branch is specifiedjob is expected to FAIL by design — it is a merge-guard that fires whenever a[lang@...]marker is present. This is not a test failure.Test plan
INTEGRATION_FRAMEWORKSLLMObs suites pass in CI against the tracer commit + dev test agentPARAMETRICLLMObs tests passAI_GUARD/AI_GUARD_TELEMETRYpass