Skip to content

basic skeleton for creating a client from docker image'd server#8

Merged
pankit-eng merged 1 commit into
mainfrom
env_code
Oct 8, 2025
Merged

basic skeleton for creating a client from docker image'd server#8
pankit-eng merged 1 commit into
mainfrom
env_code

Conversation

@pankit-eng

Copy link
Copy Markdown
Contributor

No description provided.

@meta-cla meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 8, 2025
@pankit-eng pankit-eng merged commit 719de6f into main Oct 8, 2025
1 check passed
pankit-eng pushed a commit that referenced this pull request Nov 3, 2025
Fix(client): Correctly pass timeout parameter to parent class
rycerzes pushed a commit to rycerzes/OpenEnv that referenced this pull request Nov 19, 2025
basic skeleton for creating a client from docker image'd server
rycerzes pushed a commit to rycerzes/OpenEnv that referenced this pull request Nov 19, 2025
Fix(client): Correctly pass timeout parameter to parent class
lilyzhng added a commit to lilyzhng/OpenEnv that referenced this pull request Mar 8, 2026
Renders HTML to PNG via Playwright, sends to Claude Sonnet via OpenRouter
with chain-of-thought prompt (describe shape → classify → PASS/FAIL).

Tested: correctly FAILs diamond shape, PASSes hourglass shape.
GPT-4o failed this discrimination — Claude Sonnet's CoT is necessary.

Adds criterion huggingface#8 to hand-draw env: "visually depicts an hourglass"
(only checked at final scoring, not per-step, to save API cost).

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

CLA Signed This label is managed by the Meta Open Source bot.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant