test(onboard): add strict tool-call probe e2e#4541
Conversation
|
No actionable comments were generated in the recent review. 🎉 ℹ️ Recent review info⚙️ Run configurationConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Enterprise Run ID: 📒 Files selected for processing (1)
✅ Files skipped from review due to trivial changes (1)
📝 WalkthroughWalkthroughAdds a hermetic Bash+Node E2E that validates strict chat-completions tool-call payloads and retry/recovery behavior against a local mock server, and wires the test into the regression-e2e CI with selector gating and failure-only log upload. ChangesStrict Tool-Call Probe E2E Coverage
Estimated code review effort🎯 3 (Moderate) | ⏱️ ~22 minutes Possibly related PRs
Suggested labels
Suggested reviewers
Poem
🚥 Pre-merge checks | ✅ 4 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (4 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches📝 Generate docstrings
🧪 Generate unit tests (beta)
Warning There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure. 🔧 ESLint
ESLint skipped: no ESLint configuration detected in root package.json. To enable, add Comment |
E2E Advisor RecommendationRequired E2E: Dispatch hint: Full advisor summaryE2E Recommendation AdvisorBase: Required E2E
Optional E2E
New E2E recommendations
Dispatch hint
|
E2E Scenario Advisor RecommendationRequired scenario E2E: None Full scenario advisor summaryE2E Scenario AdvisorBase: Required scenario E2E
Optional scenario E2E
Relevant changed files
|
PR Review AdvisorFindings: 1 needs attention, 2 worth checking, 0 nice ideas Review findings🛠️ Needs attention
🔎 Worth checking
🌱 Nice ideas
Since last review detailsCurrent findings:
This is an automated advisory review. A human maintainer must make the final merge decision. |
There was a problem hiding this comment.
Actionable comments posted: 1
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.
Inline comments:
In @.github/workflows/regression-e2e.yaml:
- Around line 270-276: Update the GitHub Actions E2E job to harden the Checkout
and Node setup steps: in the checkout step (the actions/checkout usage) add
persist-credentials: false to prevent credential persistence, and replace the
floating actions/setup-node@v6 reference with a pinned full commit SHA for
setup-node to avoid unexpected changes; locate the lines using the
actions/checkout and actions/setup-node symbols and apply those two changes.
🪄 Autofix (Beta)
Fix all unresolved CodeRabbit comments on this PR:
- Push a commit to this branch (recommended)
- Create a new PR with the fixes
ℹ️ Review info
⚙️ Run configuration
Configuration used: Path: .coderabbit.yaml
Review profile: CHILL
Plan: Enterprise
Run ID: 3a6bca40-67ad-44ed-a275-361bc2ad75e4
📒 Files selected for processing (3)
.github/workflows/regression-e2e.yamlsrc/lib/inference/onboard-probes.tstest/e2e/test-strict-tool-call-probe.sh
|
✨ |
Summary
Adds a hermetic regression E2E for the strict Chat Completions tool-call validation path introduced around Local Ollama onboarding. The test uses a local OpenAI-compatible mock to assert the bounded payload shape, transient retry behavior, and fail-closed behavior when no structured tool call is returned.
Related Issue
Fixes #4537
Changes
test/e2e/test-strict-tool-call-probe.shto exercise strict tool-call validation against a mock OpenAI-compatible endpoint.strict-tool-call-probe-e2einto.github/workflows/regression-e2e.yamlfor workflow dispatch.max_tokenscap insrc/lib/inference/onboard-probes.ts.Type of Change
Verification
npx prek run --all-filespassesnpm testpassesnpm run docsbuilds without warnings (doc changes only)Signed-off-by: Carlos Villela cvillela@nvidia.com
Summary by CodeRabbit
Tests
Chores
Documentation