test(uipath-troubleshoot): Word Add Picture failure diagnosis tasks#1447
Conversation
Four e2e diagnose-mode coder-eval tasks, one per branch of the add-picture-failures playbook: - missing-scope (C1): Add Picture sits outside the Use Word File scope - com-interop (C2): environmental TYPE_E_LIBNOTREGISTERED on the robot host - bookmark-missing (C3): InsertRelativeTo=Bookmark, bookmark absent at runtime - image-variable (C4): ImagePath bound to a UiPath.Core.Image via .ToString() Passing run: 4/4 tasks, 2/2 replicates each at score 1.0 (runs/2026-06-12_13-58-58, claude-sonnet-4-6 coder, --repeats 2 -j 3). run_limits normalized to the troubleshoot standard 5400/60/3600. Depends on #1439 (adds the playbook the llm_judge prompts reference). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
|
Claude finished @Stefan-Virgil's task in 3m 25s —— View job PR Review: test(uipath-troubleshoot): Word Add Picture failure diagnosis tasks
SummaryAdds four Change-by-Change Review1.
|
| Area | Status | Notes |
|---|---|---|
| Frontmatter | N/A | Tasks, not skills — no SKILL.md frontmatter |
| E2E Tests | OK | 4 well-crafted e2e tasks, one per playbook branch; passing run claimed |
| Task Body | OK | Lean criteria (skill_triggered + llm_judge only), correct run_limits, minimal prompts |
| References & Assets | OK | Fixtures, process snapshots, and RESOLUTION.md are thorough and internally consistent |
| Repo Hygiene | OK | CODEOWNERS covers the path; no secrets/PII; all host/robot names scrubbed; no cross-skill references |
Issues for Manual Review
- Domain accuracy of mock error messages and stack traces: The synthetic error messages (e.g.,
UiPath.Word.Activities.WordAddImage.ResolveInsertLocation,WordApplicationScopeRuntime.EnsureWordApplication) and stack frames are plausible but fabricated for the test scenario. If these don't match real Word Activities error output, agents that have seen real errors may be confused by the mismatch. Only verifiable by someone who has seen these errors in production. docsai askpassthrough: All four manifests proxydocsai askto the real CLI, which requires validuipauth at test runtime. Confirm CI runners have this configured.
Conclusion
Clean, well-structured PR. All four tasks follow the troubleshoot test conventions closely: lean criteria, detailed ground truths, correct mock structures, and faithful process snapshots that reproduce each defect. The tag taxonomy note and missing rpa tag are both Low — neither blocks merge. No changes requested.
|
Claude finished @Stefan-Virgil's task in 2m 0s —— View job Coder-eval task lint (advisory)4 task YAMLs changed; verdicts: 0 Critical, 0 High, 0 Medium, 0 Low, 4 OK. Rubric: .claude/commands/lint-task.md. This check is advisory and never blocks merge. Evidence of passing run✅ Author confirms passing run: "4/4 tasks, 2/2 replicates each at Per-task lint
|
…) diagnosis task Add word-addpicture-word-crash e2e/mode:diagnose task: WINWORD.EXE crashes mid-insert and the job faults with InvalidCastException to Word._Document / 0x8001010E (RPC_E_WRONG_THREAD) on WordInteropActivity.EndExecute. Workflow is correct; the COM error is a downstream symptom of the process crash. Ground truth = environmental Word crash (capture faulting module, repair Office, bitness, orphaned WINWORD.EXE) and/or pre-resize the large image (Add Picture has no resize property) — not an XAML edit. Grades against the new word-com-interop-failures.md playbook (E4). Repoint the existing word-addpicture-com-interop task to the same package-level playbook (E1, reached via add-picture-failures.md C2). Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
What
Four
e2e/mode:diagnosecoder-eval tasks for the Word Add Picture (WordAddImage) failure family, one per branch of theadd-picture-failures.mdplaybook added in #1439:word-addpicture-missing-scopeAdd Pictureis a sibling of (outside) theUse Word Filescope → no open documentword-addpicture-com-interopTYPE_E_LIBNOTREGISTERED(HRESULT0x8002801D) on the robot host — workflow is correctword-addpicture-bookmark-missingInsertRelativeTo="Bookmark"but the runtime document lacks the bookmarkword-addpicture-image-variableImagePathbound to aUiPath.Core.Imagevia.ToString()→ opens a file literally namedUiPath.Core.ImageEach grades
skill_triggered(w 1.0) + a branch-specificllm_judge(w 3.0,pass_threshold 0.7) against aRESOLUTION.mdground truth, with wrong-branch answers capped at 0.5.Passing run
--repeats 2 -j 3,claude-sonnet-4-6coder,runs/2026-06-12_13-58-58:4/4 tasks, 2/2 replicates each at
weighted_score = 1.0.An earlier run clipped two replicates at the under-spec
max_turns: 45; normalizing all four to the troubleshoot standardtask_timeout 5400 / max_turns 60 / turn_timeout 3600produced the clean sweep.Lint
/lint-taskon all four: OK (0 Critical/High/Medium/Low). No self-report, no over-specification, distinct branches (good scaffold reuse), nocommand_executedverbs to check,sandboxispython: {}only,run_limitstop-level.Merge order
🤖 Generated with Claude Code
Update — Word process-crash task (RPC_E_WRONG_THREAD) + playbook split
Added a 5th task and refactored the COM branch into a package-level playbook (see companion commit on #1439):
word-addpicture-word-crash0x8001010E(RPC_E_WRONG_THREAD)InvalidCastExceptiontoWord._DocumentonWordInteropActivity.EndExecuteis a downstream symptom of the process crash. Workflow correct → diagnose the crash (faulting module / Office repair / bitness / orphaned WINWORD.EXE) and/or pre-resize the large image (Add Picturehas no resize property).The environmental COM family was lifted out of
add-picture-failures.md(C2) into the new package-levelword-com-interop-failures.mdplaybook (causes E1 type-library/class not registered, E2 bitness, E3 busy/blocked, E4 process crash), since it applies to all Word activities, not just Add Picture. The existingword-addpicture-com-interoptask is repointed to that playbook (E1, reached via the delegating C2).verify_manifest_commands.py: 72/72 shapes valid./lint-taskand a fresh coder-eval run still pending.