sgl-project repositories

sglang

Public

SGLang is a fast serving framework for large language models and vision language models.

cuda inference pytorchtransformer openai moe llama vlm kimi blackwell

Python

•

Apache License 2.0

•3.7k•21k•637•1k•Updated

Dec 9, 2025

sgl-project.github.io

Public

This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.

HTML

•24•92•8•1•Updated

Dec 9, 2025

sgl-kernel-npu

Public

SGLang kernel library for NPU

C++

•

MIT License

•58•82•12•18•Updated

Dec 9, 2025

sglang-jax

Public

JAX backend for SGL

Python

•

Apache License 2.0

•39•190•65•19•Updated

Dec 9, 2025

sgl-kernel-xpu

Public

SGLang kernel library for Intel XPU

Python

•

MIT License

•13•15•0•13•Updated

Dec 9, 2025

SpecForge

Public

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

training eagle pytorchllm fsdp sglang eagle3

Python

•

MIT License

•115•534•46•17•Updated

Dec 9, 2025

whl

Public

Kernel Library Wheel for SGLang

cuda cutlass sglangflashinfer

HTML

•

MIT License

•3•16•1•0•Updated

Dec 8, 2025

ome

Public

OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)

k8s llama oracle-cloudmodel-serving model-as-a-service multi-node-kubernetes llm llm-inference deepseek sglang

Go

•

MIT License

•49•327•30•15•Updated

Dec 7, 2025

rbg

Public

A workload for deploying LLM inference services on Kubernetes

k8s llm sglangpd-disagg

Go

•

Apache License 2.0

•34•126•11•6•Updated

Dec 3, 2025

sgl-learning-materials

Public

Materials for learning SGLang

MIT License

•48•678•0•0•Updated

Dec 1, 2025

DeepGEMM

Public

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda

•

MIT License

•771•21•0•1•Updated

Nov 30, 2025

genai-bench

Public

Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

Python

•

MIT License

•38•237•4•10•Updated

Nov 28, 2025

sgl-test-files

Public

The test files for SGLang.

MIT License

•2•1•0•1•Updated

Nov 22, 2025

FlashMLA

Public

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++

•

MIT License

•912•0•0•0•Updated

Nov 20, 2025

sgl-flash-attn

Public

Fast and memory-efficient exact attention

Python

•

BSD 3-Clause "New" or "Revised" License

•2.2k•14•0•0•Updated

Nov 18, 2025

fast-hadamard-transform

Public

Fast Hadamard transform in CUDA, with a PyTorch interface

C

•

BSD 3-Clause "New" or "Revised" License

•49•0•0•0•Updated

Oct 15, 2025

sgl-whl

Public

SGLang wheels for multiple platforms

MIT License

•1•11•1•0•Updated

Oct 13, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sgl-project

All

All

17 repositories

sglang

sgl-project.github.io

sgl-kernel-npu

sglang-jax

sgl-kernel-xpu

SpecForge

whl

ome

rbg

sgl-learning-materials

DeepGEMM

genai-bench

sgl-test-files

FlashMLA

sgl-flash-attn

fast-hadamard-transform

sgl-whl

All

All

Repositories list

17 repositories