Expose verifier feedback properties for hint generation by dzorlu · Pull Request #14 · fleet-ai/OpenEnv

dzorlu · 2026-03-16T01:18:26Z

Summary

Add _tool_errors, _verifier_stdout, _verifier_error instance vars to FleetTaskEnv
Accumulate tool errors during step_async() (both MCP errors and exceptions)
Capture verifier stdout/error in _compute_reward() after verifier runs
Expose via properties: verifier_stdout, verifier_error, tool_errors_list

This enables SkyRL to build hints from failed rollout feedback without an LLM call. Used by fleet-ai/SkyRL#hint-augmentation PR.

Test plan

Verify existing tests still pass
Verify properties return None/[] when no errors occur
Verify tool errors accumulate across multiple steps
Verify verifier stdout is captured after _compute_reward()

🤖 Generated with Claude Code

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

Add _tool_errors, _verifier_stdout, _verifier_error to FleetTaskEnv so SkyRL can build hints from failed rollout feedback without an LLM call. - Tool errors accumulated in step_async() on MCP errors and exceptions - Verifier stdout/error captured in _compute_reward() after verifier runs - Verifier exceptions also captured in _verifier_error (not just failures) - All feedback properties reset in reset_async() to prevent cross-episode leakage - Properties: verifier_stdout, verifier_error, tool_errors_list Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cursor Bot reviewed Mar 16, 2026

View reviewed changes

Comment thread src/envs/fleet_env/task_env.py

Comment thread src/envs/fleet_env/task_env.py

dzorlu force-pushed the deniz/hint-feedback branch from 4d9d1c9 to 3566e7c Compare March 16, 2026 01:27

dzorlu changed the base branch from main to deniz/fleet_client March 16, 2026 01:27

dzorlu merged commit 438c7b3 into deniz/fleet_client Mar 17, 2026
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose verifier feedback properties for hint generation#14

Expose verifier feedback properties for hint generation#14
dzorlu merged 1 commit into
deniz/fleet_clientfrom
deniz/hint-feedback

dzorlu commented Mar 16, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dzorlu commented Mar 16, 2026

Summary

Test plan

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant