v0.4.1
What's Changed
- feat: add
raise_timeoutparameter to allow quiet waiting for inference results by @garrett4wade in #547 - Fix batch size in example
examples/vlm/clevr_count_70k_grpo.yamlby @wangruohui in #549 - chore: format dataset and reward folders with ruff by @garrett4wade in #551
- refactor: rename the
should_acceptargument inrollout/prepare_batchtoshould_accept_fnby @garrett4wade in #555 - chore: delete not-planned experimental features by @garrett4wade in #554
- feat: add grpo trainer and simplify gsm8k grpo example by @dhh1995 in #552
- feat: add launch_server and teardown_server in inference engine api by @garrett4wade in #550
- [Refactor] refactor
stats_trackerusage in engines and examples by @nuzant in #556 - refactor: allow passing string paths and init kwargs as rollout workflows by @garrett4wade in #525
- feat: introduces session-centric tracing APIs by @rchardx in #539
- doc: Add notes about asynchronous RL training by @garrett4wade in #558
- format: ruff format examples directory by @fishcrap in #559
- feat: support proxy server and client for training openai-compatible agents by @dhh1995 in #500
- chore: change type annotations and minor fixes for single-controller mode by @garrett4wade in #560
- docs: add "Performance Profiling" guide to best practices by @rchardx in #538
- add README for proxy_agent by @yulangz in #561
- chore: extends engine perf instrumentation by @rchardx in #562
- [FEAT] add pause/resume generation for vLLM server by @fishcrap in #563
- doc: update AReaL design doc with the current dev status by @garrett4wade in #568
- doc: update documentation to align the current dev status by @garrett4wade in #570
- refactor: extend allocation mode to support allocation naming and composition by @garrett4wade in #565
- feat: align perf_tracer with task hierarchy by @rchardx in #569
- chore: add hint for the breaking change of allocation mode by @garrett4wade in #572
- [FIX] fix atrace_session_phase in workflow by @fishcrap in #573
- chore: Quick fix for GSPO missing in doc by @ZiyiTsang in #576
- ci: build docker images with GCP by @garrett4wade in #564
- refactor: restrict the usage scope of the
rollout_batchmethod by @garrett4wade in #567 - chore: add issue template for questsions by @garrett4wade in #571
- ci: automatically tag the dev image upon new releases by @garrett4wade in #574
- chore: remove the old script used for validating installation by @garrett4wade in #575
- [FEAT] Add Qwen3-VL model support for fsdp by @fishcrap in #557
- bump v0.4.1 by @garrett4wade in #577
New Contributors
- @wangruohui made their first contribution in #549
Full Changelog: v0.4.0...v0.4.1