Add BaseHTTPClient to talk to Envs via JSON over RPC. by pankit-eng · Pull Request #4 · huggingface/OpenEnv

pankit-eng · 2025-10-06T22:15:03Z

The change is primarily adding HTTP base client to talk over JSON RPC to a container. The container bootstrap code is not added yet. It will be in the next PR.

Dipg research

Openenv cli/v1 ci

Add BaseHTTPClient to talk to Envs via JSON over RPC.

Dipg research

feat: webarena torchforge grpo integration

Old: every no-progress turn counted as wasted (25/29 in Run 10) New: each no-progress streak gets 2-turn grace period (exploration), only turns beyond grace count as wasted (stuck in a loop) This aligns with principle huggingface#4: agent MUST explore (ls, cat, grep) to discover context. Exploration turns shouldn't be penalized. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Per Section F done-gate huggingface#4: measured the heuristic scaffold's composite reward on hero seeds 9500/9501/9502 (Qwen2.5-3B-Instruct loaded on Colab T4; heuristic eval identical to the dry-run's evaluate_heuristic path). Measured totals: 9500 (planned tier_a): 0.3287 R3=0.01 9501 (planned tier_b): 0.4623 R3=0.99 9502 (planned tier_c): 0.1892 R3=0.01 Measured ordering 9501 > 9500 > 9502 does NOT match the planned easy > medium > hard (9500 > 9501 > 9502). The 9500↔9501 swap fails the monotonicity gate, so labels stay tier_a / tier_b / tier_c. Mechanism: 9500 has only 2 mismatches but the heuristic over-claims on one of them and trips Rule 36(4) (R3=0.01). 9501's 5 mismatches spread across more suppliers; no single supplier's claim exceeds 2B cap so R3=0.99 dominates. A real trained policy may reverse this — re-measure after real Colab training before the pitch demo. Artifact committed: data/hero_baseline.json. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

core env changes

7db1092

meta-cla Bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 6, 2025

pankit-eng merged commit 95af407 into main Oct 7, 2025
1 check passed

pankit-eng pushed a commit that referenced this pull request Nov 3, 2025

Merge pull request #4 from surfiniaburger/dipg-research

569e902

Dipg research

burtenshaw added a commit that referenced this pull request Nov 3, 2025

Merge pull request #4 from burtenshaw/openenv-cli/v1-ci

284ffdc

Openenv cli/v1 ci

Darktex mentioned this pull request Nov 7, 2025

Add Env for playing Pokémon using poke-env #131

Closed

rycerzes pushed a commit to rycerzes/OpenEnv that referenced this pull request Nov 19, 2025

Merge pull request huggingface#4 from facebookexternal/env_code

6f83de1

Add BaseHTTPClient to talk to Envs via JSON over RPC.

rycerzes pushed a commit to rycerzes/OpenEnv that referenced this pull request Nov 19, 2025

Merge pull request huggingface#4 from surfiniaburger/dipg-research

d41c356

Dipg research

Darktex mentioned this pull request Nov 25, 2025

[RFC 003] Add MCP (Model Context Protocol) support - Phase 1 #224

Merged

3 tasks

EchoRaven pushed a commit to BillChan226/openenv-gen that referenced this pull request Jan 5, 2026

Merge pull request huggingface#4 from BillChan226/feature/web-training

bdc535b

feat: webarena torchforge grpo integration

This was referenced Jan 13, 2026

Add agentic-first workflow infrastructure for Claude Code #287

Merged

[RFC] Add RFC 004: Rubrics #237

Merged

Add custom task system for BrowserGym environment #201

Closed

Darktex mentioned this pull request May 8, 2026

Add harness session runtime and BrowserGym adapter for training and evaluation #471

Closed

This was referenced May 22, 2026

[feature] Import external envs (ors, verifiers) #726

Open

feat(mini_swe_env): add SWE-Gym async GRPO environment with Pi interception and HF Space deployment #695

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add BaseHTTPClient to talk to Envs via JSON over RPC.#4

Add BaseHTTPClient to talk to Envs via JSON over RPC.#4
pankit-eng merged 1 commit into
mainfrom
env_code

pankit-eng commented Oct 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

pankit-eng commented Oct 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant