Chun reima/sglang fp8 v0.5.8 mi355 by chunfangamd · Pull Request #572 · SemiAnalysisAI/InferenceX

chunfangamd · 2026-01-27T13:49:34Z

Update MI355X DeepSeek R1 FP8 SGLang image from v0.5.5.post3 to v0.5.8

Key fix: Disable mla persistent kernel when not using fp8 kv_cache (sgl-project/sglang#17327)

Contributor list:

1am9trash
HaiShaw
chunfangamd
rkarhila-amd

1. activate FP8 KV cache 2. use the MLA persistent kernel

perf-changelog.yaml

functionstackx · 2026-01-27T16:15:11Z

@pr-claude review plz

claude · 2026-01-27T16:15:34Z

Claude finished @functionstackx's task in 51s —— View job

PR Review Complete

Gather context and read changed files
Check for CLAUDE.md instructions (none found)
Validate master config and perf-changelog requirements
Review code changes for bugs/issues
Post review summary

Summary: This PR updates the MI355X DeepSeek R1 FP8 SGLang Docker image from v0.5.5.post3 to v0.5.8. The master config change is properly accompanied by a perf-changelog entry.

I left one suggestion: the perf-changelog description should mention the key fix (disabling MLA persistent kernel when not using fp8 kv_cache) as noted in the PR body and requested by @functionstackx in the inline comment.

No blocking issues found.

Add description: Disables mla persistent kernel when not using fp8 kv_cache Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>

perf-changelog.yaml

functionstackx · 2026-01-27T16:50:11Z

e2e tests here already so cancelled sweep-enabled https://github.com/InferenceMAX/InferenceMAX/actions/runs/21399947035

* Update SGLang Docker Image for MI355 to v0.5.8 1. activate FP8 KV cache 2. use the MLA persistent kernel * Do not activate FP8 KV cache and the MLA persistent kernel explicitly * Add config-keys (v0.5.5.post3 --> v0.5.8) * Update perf-changelog.yaml with key fix description for v0.5.8 Add description: Disables mla persistent kernel when not using fp8 kv_cache Co-authored-by: functionstackx <functionstackx@users.noreply.github.com> --------- Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com> Co-authored-by: functionstackx <functionstackx@users.noreply.github.com> Co-authored-by: functionstackx <47992694+functionstackx@users.noreply.github.com>

chunfangamd added 2 commits January 27, 2026 13:37

Update SGLang Docker Image for MI355 to v0.5.8

dc52006

1. activate FP8 KV cache 2. use the MLA persistent kernel

Do not activate FP8 KV cache and the MLA persistent kernel explicitly

e784c52

chunfangamd requested a review from a team as a code owner January 27, 2026 13:49

github-project-automation bot added this to InferenceMAX Board Jan 27, 2026

Add config-keys (v0.5.5.post3 --> v0.5.8)

a4a5549

functionstackx added the sweep-enabled label Jan 27, 2026

functionstackx reviewed Jan 27, 2026

View reviewed changes

perf-changelog.yaml Show resolved Hide resolved

Update perf-changelog.yaml with key fix description for v0.5.8

38297f3

Add description: Disables mla persistent kernel when not using fp8 kv_cache Co-authored-by: functionstackx <functionstackx@users.noreply.github.com>

claude bot reviewed Jan 27, 2026

View reviewed changes

perf-changelog.yaml Show resolved Hide resolved

functionstackx removed the sweep-enabled label Jan 27, 2026

Merge branch 'main' into chun_reima/sglang_fp8_v0.5.8_mi355

f50e14b

functionstackx merged commit 2a796d1 into main Jan 27, 2026
11 checks passed

functionstackx deleted the chun_reima/sglang_fp8_v0.5.8_mi355 branch January 27, 2026 17:36

github-project-automation bot moved this to Done in InferenceMAX Board Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chun reima/sglang fp8 v0.5.8 mi355#572

Chun reima/sglang fp8 v0.5.8 mi355#572
functionstackx merged 5 commits intomainfrom
chun_reima/sglang_fp8_v0.5.8_mi355

chunfangamd commented Jan 27, 2026

Uh oh!

Uh oh!

functionstackx commented Jan 27, 2026

Uh oh!

claude bot commented Jan 27, 2026 •

edited

Loading

Uh oh!

Uh oh!

functionstackx commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

chunfangamd commented Jan 27, 2026

Uh oh!

Uh oh!

functionstackx commented Jan 27, 2026

Uh oh!

claude bot commented Jan 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review Complete

Uh oh!

Uh oh!

functionstackx commented Jan 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

claude bot commented Jan 27, 2026 •

edited

Loading