Skip to content

Commit d7a6d4e

Browse files
nlevin-uicsahithicquil11functionstackxishandhanani
authored
Add DSR1 FP8 H200 Dynamo TRT-LLM configurations (#570)
* add h200 srtslurm setup * fix slurm parameters * refactor logs * fix log path * refactor * fix slurm output * fix * fix * Add H200 dynamo-trt disaggregated configs for 1k1k and 8k1k Expand dsr1-fp8-h200-dynamo-trt section with full configuration set: - 1k1k MTP configs (c4-c512) with CONFIG_FILE references - 1k1k STP configs (c4-c512) with CONFIG_FILE references - 8k1k MTP configs (c4-c512) with CONFIG_FILE references - 8k1k STP configs (c4-c512) with CONFIG_FILE references All configs reference recipe YAMLs in srt-slurm-trtllm repo under recipies/trtllm/h200/{1k1k,8k1k}/{mtp,stp}/ * add uv and sqsh file * Update H200 dynamo-trt container image to nvcr.io#nvidia/ai-dynamo/tensorrtllm-runtime:0.8.0 * Fix H200 CONFIG_FILE references to match corrected recipe filenames * Fix H200 dp-attn values to match recipe filenames dep8 = enable_attention_dp: true = dp-attn: true tep8 = enable_attention_dp: false = dp-attn: false * Fix H200 model path to use DeepSeek-R1-0528 Update MODEL_PATH from /models/dsr1-fp8 (old DeepSeek-R1) to /models/DeepSeek-R1-0528 (new version matching nvidia-master.yaml) * add uv install and image fix * fix squash file path * fix srt-slurm repo * update h200 runner details * Add perf-changelog entry for dsr1-fp8-h200-dynamo-trt config * Add H200 sglang disagg configs from srtslurm (#580) - Add dsr1-fp8-h200-dynamo-sglang config to nvidia-master.yaml - Include 1k1k configs: aggregated, low-latency (1P9D), high-throughput TEP/DEP (1P6D) - Include 8k1k configs: aggregated, TEP variants (1P7D, 1P6D, 1P3D, 2P3D), DEP (1P1D) - Add perf-changelog entry for new configuration - Document recipe registration process in AGENT.md * Revert "Add H200 sglang disagg configs from srtslurm (#580)" (#581) This reverts commit f6609d9. * recipies -> recipes * Update dsr1-fp8-h200-dynamo-trt image to 0.8.1.post1 * Add dynamic container mapping for srtslurm.yaml - Update SQUASH_FILE to use /data/containers/ with + separators - Strip nvcr.io/ prefix from path to match actual .sqsh filenames - Add CONTAINER_KEY to convert IMAGE to srt-slurm format (nvcr.io#) - Map container key to .sqsh path dynamically in srtslurm.yaml * Pin srt-slurm to sa-submission-q1-2026 branch Use the release branch for Q1 2026 submission instead of main. * Add srt-slurm GitHub URLs above h200 CONFIG_FILE entries Link each CONFIG_FILE to its source in srt-slurm sa-submission-q1-2026 branch. * Update perf-changelog.yaml to modify DSR1 configurations Removed outdated DSR1 FP8 H200 Dynamo TRT configuration details and re-added them in a new section. --------- Co-authored-by: Sahithi Chigurupati <schigurupati@nvidia.com> Co-authored-by: Sahithi Chigurupati <chigurupati.sahithi@gmail.com> Co-authored-by: Cameron Quilici <cjquilici@gmail.com> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com> Co-authored-by: ishandhanani <82981111+ishandhanani@users.noreply.github.com>
1 parent efe2f53 commit d7a6d4e

3 files changed

Lines changed: 656 additions & 0 deletions

File tree

0 commit comments

Comments
 (0)