Skip to content

v0.4.1

Choose a tag to compare

@garrett4wade garrett4wade released this 14 Nov 05:38
· 85 commits to main since this release
a3c5ac2

What's Changed

  • feat: add raise_timeout parameter to allow quiet waiting for inference results by @garrett4wade in #547
  • Fix batch size in example examples/vlm/clevr_count_70k_grpo.yaml by @wangruohui in #549
  • chore: format dataset and reward folders with ruff by @garrett4wade in #551
  • refactor: rename the should_accept argument in rollout/prepare_batch to should_accept_fn by @garrett4wade in #555
  • chore: delete not-planned experimental features by @garrett4wade in #554
  • feat: add grpo trainer and simplify gsm8k grpo example by @dhh1995 in #552
  • feat: add launch_server and teardown_server in inference engine api by @garrett4wade in #550
  • [Refactor] refactor stats_tracker usage in engines and examples by @nuzant in #556
  • refactor: allow passing string paths and init kwargs as rollout workflows by @garrett4wade in #525
  • feat: introduces session-centric tracing APIs by @rchardx in #539
  • doc: Add notes about asynchronous RL training by @garrett4wade in #558
  • format: ruff format examples directory by @fishcrap in #559
  • feat: support proxy server and client for training openai-compatible agents by @dhh1995 in #500
  • chore: change type annotations and minor fixes for single-controller mode by @garrett4wade in #560
  • docs: add "Performance Profiling" guide to best practices by @rchardx in #538
  • add README for proxy_agent by @yulangz in #561
  • chore: extends engine perf instrumentation by @rchardx in #562
  • [FEAT] add pause/resume generation for vLLM server by @fishcrap in #563
  • doc: update AReaL design doc with the current dev status by @garrett4wade in #568
  • doc: update documentation to align the current dev status by @garrett4wade in #570
  • refactor: extend allocation mode to support allocation naming and composition by @garrett4wade in #565
  • feat: align perf_tracer with task hierarchy by @rchardx in #569
  • chore: add hint for the breaking change of allocation mode by @garrett4wade in #572
  • [FIX] fix atrace_session_phase in workflow by @fishcrap in #573
  • chore: Quick fix for GSPO missing in doc by @ZiyiTsang in #576
  • ci: build docker images with GCP by @garrett4wade in #564
  • refactor: restrict the usage scope of the rollout_batch method by @garrett4wade in #567
  • chore: add issue template for questsions by @garrett4wade in #571
  • ci: automatically tag the dev image upon new releases by @garrett4wade in #574
  • chore: remove the old script used for validating installation by @garrett4wade in #575
  • [FEAT] Add Qwen3-VL model support for fsdp by @fishcrap in #557
  • bump v0.4.1 by @garrett4wade in #577

New Contributors

Full Changelog: v0.4.0...v0.4.1