Hot-Standby Operator (HotStandbyDeployment)

Keep a constant buffer of idle hot-standby Pods for instant capacity.
When some Pods become busy, the operator scales up so that:

desiredReplicas = busyPods + idleTarget

Example: idleTarget=3, and 2 Pods turn busy ⇒ scale to 5 total (2 busy + 3 idle).

Features

Custom Resource: HotStandbyDeployment (namespaced).
Two ways to detect “busy”:
- busyProbe.mode: annotation — your app flips an annotation (paia.tech/busy: "true"|"false").
- busyProbe.mode: http — operator probes http://<pod-ip>:<port><path>.
Works like a Deployment: you provide .spec.selector + .spec.podTemplate.
Status is reported (busyCount, idleCount, desiredReplicas).

Quickstart (local dev)

Prereqs

Python 3.10+
A working kubeconfig (kubectl get ns works)
pip install kopf kubernetes requests

1) Apply the CRD (if you don’t already have it):

kubectl apply -f crd/hsd-crd.yaml

2) Apply an example HSD (annotation mode shown below):

# examples/hsd-annotation.yaml
apiVersion: apps.paia.tech/v1alpha1
kind: HotStandbyDeployment
metadata:
  name: game-ws
  namespace: default
spec:
  idleTarget: 3                   # keep 3 always idle (standby)
  minReplicas: 0
  maxReplicas: 50
  selector:
    matchLabels:
      app: game-ws
  busyProbe:
    mode: annotation
    annotationKey: paia.tech/busy
  podTemplate:
    metadata:
      labels:
        app: game-ws
    spec:
      serviceAccountName: pod-self-annotator   # if your app annotates itself
      containers:
        - name: app
          image: ghcr.io/your-org/game-ws:latest
          ports:
            - name: http
              containerPort: 8080

kubectl apply -f examples/hsd-annotation.yaml

3) Run the operator locally (uses your kubeconfig):

# watch only default ns
kopf run --namespace default ./operate.py
# or watch all namespaces
# kopf run --all-namespaces ./operate.py

Check it working:

kubectl get hotstandbydeployments
kubectl describe hsd game-ws
kubectl get deploy game-ws-workload

When some Pods are marked busy, you should see the child Deployment scale to busy + idleTarget.

How to mark Pods “busy”

A) Annotation mode (recommended; simplest & robust)

Your app toggles its own Pod annotation:

# busy_marker.py
import os
from kubernetes import client, config

ANNOTATION_KEY = os.getenv("BUSY_KEY", "paia.tech/busy")
POD = os.environ["POD_NAME"]
NS  = os.environ["POD_NAMESPACE"]

def set_busy(is_busy: bool):
    # in cluster: config.load_incluster_config(); locally: load_kube_config()
    try:
        config.load_incluster_config()
    except Exception:
        config.load_kube_config()
    v1 = client.CoreV1Api()
    body = {"metadata": {"annotations": {ANNOTATION_KEY: "true" if is_busy else "false"}}}
    v1.patch_namespaced_pod(POD, NS, body)

Add Downward API envs in your pod:

env:
  - name: POD_NAME
    valueFrom: { fieldRef: { fieldPath: metadata.name } }
  - name: POD_NAMESPACE
    valueFrom: { fieldRef: { fieldPath: metadata.namespace } }

RBAC: give the Pod’s ServiceAccount get,patch on pods in its namespace.

B) HTTP mode

Expose an endpoint in your container:

GET /busy -> 200 when busy, 503 when idle   # (configurable via successIsBusy)

Configure the CR:

busyProbe:
  mode: http
  http:
    port: 8080
    path: /busy
    successIsBusy: true
    timeoutSeconds: 1
    periodSeconds: 10

In-cluster deployment

Build & deploy:

# build/push your image (example)
docker build -t ghcr.io/your-org/hsd-operator:latest .
docker push ghcr.io/your-org/hsd-operator:latest

kubectl apply -f crd/hsd-crd.yaml
kubectl apply -f manifests/rbac.yaml
kubectl apply -f manifests/operator.yaml
kubectl apply -f examples/hsd-annotation.yaml

Operational tips

Do not combine another HPA on the same child Deployment; both would fight over .spec.replicas. If you must, use HPA for upper bound protection only (carefully).
To avoid sending new traffic to busy Pods, drop readiness when busy (or use a custom readinessGate).
For gradual scale-downs, consider implementing a cool-down window in your app or operator (this repo’s reference operator keeps it simple).

Troubleshooting

ConfigException: Service host/port is not set.
You ran locally but tried in-cluster config. The provided operator auto-detects; run with:
```
kopf run --namespace default ./operate.py
```
FutureWarning: namespaces or cluster-wide flag will become an error
Add --namespace default or --all-namespaces to kopf run.
Forbidden / RBAC errors when creating/patching Deployments or updating status
Ensure the operator ServiceAccount has the permissions from RBAC above.
Windows: OS signals are ignored
Benign warning from Kopf on Windows; safe to ignore when developing locally.

API recap

spec:
  idleTarget: <int>              # keep this many idle pods at all times
  minReplicas: <int>             # optional
  maxReplicas: <int>             # optional
  selector:
    matchLabels: {...}           # labels to select child pods
  podTemplate: {...}             # like Deployment.spec.template (camelCase OK)
  busyProbe:
    mode: annotation|http        # default: annotation
    annotationKey: paia.tech/busy
    http:
      port: 8080
      path: /busy
      successIsBusy: true
      timeoutSeconds: 1
      periodSeconds: 10
status:
  busyCount: <int>
  idleCount: <int>
  desiredReplicas: <int>
  observedGeneration: <int>

License

Apache License Version 2.0

Contributing

Issues and PRs welcome!
If you want a Helm chart or scale-down cool-down logic, open an issue with your use-case.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
crd		crd
examples		examples
manifests		manifests
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
operate.py		operate.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hot-Standby Operator (HotStandbyDeployment)

Features

Quickstart (local dev)

How to mark Pods “busy”

A) Annotation mode (recommended; simplest & robust)

B) HTTP mode

In-cluster deployment

Operational tips

Troubleshooting

API recap

License

Contributing

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hot-Standby Operator (HotStandbyDeployment)

Features

Quickstart (local dev)

How to mark Pods “busy”

A) Annotation mode (recommended; simplest & robust)

B) HTTP mode

In-cluster deployment

Operational tips

Troubleshooting

API recap

License

Contributing

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages