-
Notifications
You must be signed in to change notification settings - Fork 39
Open
Labels
FeatureenhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is needed
Description
Checklist
- If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sgl-jax/discussions/new/choose Otherwise, it will be closed.
- Please use English, otherwise it will be closed.
Motivation
The tpu-inference released a new fast moe kernel. We should integrate it. See Fused MOE Pallas Kernel.
Related resources
Metadata
Metadata
Assignees
Labels
FeatureenhancementNew feature or requestNew feature or requesthelp wantedExtra attention is neededExtra attention is needed