dev-dockers

This repository contains Docker configurations for various development tasks.

Structure

The repository is organized as a set of docker configuration directories.

dockers/<docker-dir>/

Each directory contains relevant Dockerfiles, Docker Compose files, and related scripts for a specific environment or tool.

llama.cpp inference (infer-dev)

dockers/infer-dev is a PeiDocker-based CUDA dev container that can auto-launch llama-server on startup.

1) Configure (required after changes)

# Keep durable edits in user_config.persist.yml, then copy to user_config.yml
cp dockers/infer-dev/src/user_config.persist.yml dockers/infer-dev/src/user_config.yml

# Regenerate generated artifacts under dockers/infer-dev/src/
cd dockers/infer-dev
./pei-configure.sh --with-merged

2) Build the images

docker compose -f dockers/infer-dev/src/docker-compose.yml build stage-1
docker compose -f dockers/infer-dev/src/docker-compose.yml build stage-2

3) Start llama-server via TOML (auto-launch hook)

The entry hook can auto-start llama-server instances, but it is off by default.

Set AUTO_INFER_LLAMA_CPP_ON_BOOT=1 (or true) to enable auto-start on boot.
Set AUTO_INFER_LLAMA_CPP_CONFIG to point at a TOML file with instance definitions.
If auto-start is disabled, you can run /soft/app/llama-cpp/check-and-run-llama-cpp.sh manually inside the container.

Example (GLM-4.7 Q2_K):

Config: dockers/infer-dev/model-configs/glm-4.7-q2k.toml
Host port 11980 → container port 8080 (see dockers/infer-dev/src/docker-compose.yml)

Run with the env var set (publish service ports and mount the config directory into the container):

docker compose -f dockers/infer-dev/src/docker-compose.yml run -d --service-ports --name infer-glm \
  -v "$PWD/dockers/infer-dev/model-configs:/model-configs:ro" \
  -e AUTO_INFER_LLAMA_CPP_ON_BOOT=1 \
  -e AUTO_INFER_LLAMA_CPP_CONFIG=/model-configs/glm-4.7-q2k.toml \
  stage-2 sleep infinity

Verify:

curl http://127.0.0.1:11980/v1/models
curl http://127.0.0.1:11980/v1/chat/completions -H 'Content-Type: application/json' -d '{
  "model": "glm4",
  "messages": [{"role": "user", "content": "Hello"}],
  "max_tokens": 64
}'

Notes:

The sample config mounts a specific model directory to /llm-models/... (not the entire host model tree); adjust dockers/infer-dev/src/user_config.persist.yml + rerun ./dockers/infer-dev/pei-configure.sh to test other models.
AUTO_INFER_LLAMA_CPP_PKG_PATH + AUTO_INFER_LLAMA_CPP_GET_PKG_ON_BOOT=1|true installs a prebuilt llama.cpp bundle into /soft/app/llama-cpp on boot (archive is cached under /soft/app/cache). If auto-install is off, run /soft/app/llama-cpp/get-llama-cpp-pkg.sh inside the container.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.github/workflows		.github/workflows
.specify		.specify
context		context
dockers		dockers
docs		docs
extern		extern
magic-context @ 904623c		magic-context @ 904623c
models		models
openspec		openspec
scripts		scripts
src/dev_dockers		src/dev_dockers
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
AGENTS.md		AGENTS.md
GEMINI.md		GEMINI.md
README.md		README.md
add-my-keys.sh		add-my-keys.sh
bunfig.toml		bunfig.toml
pixi.lock		pixi.lock
pyproject.toml		pyproject.toml
setup-envs.sh		setup-envs.sh
setup-proxy.sh		setup-proxy.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dev-dockers

Structure

llama.cpp inference (infer-dev)

1) Configure (required after changes)

2) Build the images

3) Start llama-server via TOML (auto-launch hook)

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

dev-dockers

Structure

llama.cpp inference (infer-dev)

1) Configure (required after changes)

2) Build the images

3) Start llama-server via TOML (auto-launch hook)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages