kevy

A pure-Rust, zero-dependency, Redis-compatible key–value store — usable as a standalone server or as an embedded library, built to run as fast as the hardware allows.

kevy speaks the Redis wire protocol (RESP2), so redis-cli, valkey-cli, and every Redis client library talk to it unchanged. Underneath, the engine is a modern thread-per-core, shared-nothing design written entirely in Rust — the only C it touches is the unavoidable OS syscall boundary.

cargo run -p kevy --bin kevy --release      # loopback, AOF on, port 6004
redis-cli -p 6004 SET hello world

Why kevy

Fast — 2.3–2.7× valkey 9.1's throughput at high concurrency, 2.7× on pub/sub fan-out, and ~18 M ops/s per core when embedded (numbers below).
Tiny footprint — a 768 KB server binary that boots into under 5 MB of RAM. Fits a container sidecar, a small VM, or an edge box.
Modern architecture — thread-per-core, shared-nothing, no locks on the hot path, io_uring on Linux. No global lock, no GIL-style bottleneck.
No supply-chain risk — zero crates.io dependencies. The whole tree is std + kevy's own crates; the only C is the OS syscall boundary, hand-bound in one crate. There is nothing else to audit.
Drop-in compatible — RESP2 wire protocol, 98-command parity with valkey 9.1 (incl. pattern pub/sub and WATCH/UNWATCH optimistic CAS), reply-checked byte-for-byte. Existing clients and tools just work.
Embeddable — kevy-store is a plain Rust library: no network, no runtime, also builds for wasm32. The same engine, in your process.
Resource-adaptive — runs full-speed when memory is unbounded, degrades cleanly when it isn't, and refuses loudly at the edge instead of corrupting silently (details).

Honest about scope: kevy is single-node — no replication, clustering, AUTH/TLS, or public-internet exposure (see when to use kevy).

Performance

All figures below were measured on one bare-metal Intel Core i7-10700K (8 cores / 16 threads, 3.8 GHz base / 5.1 GHz boost), 62 GB RAM, Linux 6.12.90, in-memory. Every benchmark is reproducible with the scripts in bench/; full method and caveats in bench/REPORT.md.

Server throughput (over the network)

Beating valkey 9.1 is the floor, not the goal — kevy targets the hardware ceiling.

redis-benchmark, each server pinned to cores 0–9 with the client on isolated cores and run in isolation. Every engine uses its fastest config (kevy: io_uring at -c50, epoll at -c1; valkey/redis: io-threads):

workload	kevy	valkey 9.1	redis 7.4
-c50 -P16 GET	4.4 M/s	2.5 M/s	2.3 M/s
-c50 -P16 SET	4.7 M/s	1.9 M/s	2.0 M/s
-c1 GET	86 k/s	65 k/s	48 k/s
-c1 SET	72 k/s	63 k/s	54 k/s

Against the C reference for io_uring: kevy's hand-written bindings reach a 148 ns nop round-trip vs liburing 2.9's 152 ns — at the Linux kernel floor, with no liburing linked. Reproduce with bench/loopback_c50.sh and bench/loopback_c1.sh.

Cluster routing (key-aware client)

A single-port client that lands on the wrong shard pays an internal cross-shard forwarding hop. The cluster-aware ClusterClient routes each key straight to its owner shard and removes that hop. Clean lx64 16-core box, server/client on disjoint cores, GET at concurrency 64:

client path	throughput	p99 latency
single-shard proxy (cross-shard hop)	333 k/s	3858 µs
`ClusterClient` (zero hop)	533 k/s	260 µs

1.6× throughput, ~15× lower tail latency — purely from removing the forwarding hop, with no measurable overhead vs a hand-rolled raw router. Full method in docs/cluster.md.

Embedded throughput (in-process, no network)

Drop kevy-store into your app and call it directly — no socket, no RESP parsing, no reactor. Single core, Store API:

operation	latency (median)	throughput
`get` (hit)	54 ns	~18.5 M ops/s
`get` (miss)	14 ns	—
`set` (overwrite)	76 ns	~13 M ops/s
`incr`	86 ns	—

That's roughly 3× the per-core throughput of the network server — the embedded path skips the entire wire layer. Reproduce with cargo run -p kevy-store --example bench_keyspace --release.

Pub/sub fan-out (server mode)

1 publisher → 50 subscribers, 200 000 messages, 16-byte payload. kevy is the fastest broker on the TCP / RESP path:

system	delivered msg/s	vs valkey
Aeron 1.45 (IPC, shared memory)	26.5 M	3.90×
kevy	18.2 M	2.68×
ZeroMQ 4.3.5	9.3 M	1.37×
redis 7.4	8.5 M	1.25×
valkey 9.1	6.8 M	1.00×
Zenoh 1.9	2.7 M	0.40×

Aeron's shared-memory IPC is the structural ceiling (no kernel network stack); among TCP brokers kevy leads — 2× ZeroMQ on the same transport. Pub/sub is a server-mode feature; the embedded library is pure key–value. Method + the 6-way harness: bench/pubsub-compare/.

Binary size & memory


Server binary (`release`, stripped)	768 KB
Server binary (`release-min`, `opt-level="s"`)	640 KB
Idle RSS (default, 16 threads)	4.9 MB
Idle RSS (`--threads 1`)	2.5 MB
Memory per key (at 8.6 M keys)	~190 B (key + value + table overhead)

SmallBytes inlines payloads ≤ 22 B with zero heap allocation. A complete kevy server is a sub-megabyte binary that boots into under 5 MB of RAM.

Quick start

Install

Pre-built kevy server binaries are attached to every GitHub Release. Supported targets:

platform	archive
Linux x86_64	`kevy-<TAG>-x86_64-unknown-linux-gnu.tar.gz`
Linux aarch64	`kevy-<TAG>-aarch64-unknown-linux-gnu.tar.gz`
macOS Apple Silicon	`kevy-<TAG>-aarch64-apple-darwin.tar.gz`

Windows: kevy's OS layer is POSIX sockets + epoll/kqueue + io_uring, so there is no native Windows build. Use the Docker image below — Docker Desktop on Windows runs Linux containers transparently.

One-liner (Linux / macOS — pick your target):

TAG=v1.2.0
TARGET=x86_64-unknown-linux-gnu      # or aarch64-unknown-linux-gnu, aarch64-apple-darwin
curl -L "https://github.com/goliajp/kevy/releases/download/$TAG/kevy-$TAG-$TARGET.tar.gz" | tar -xz
sudo install "kevy-$TAG-$TARGET/kevy" /usr/local/bin/kevy
kevy --port 6004

Each archive ships the kevy binary plus kevy.toml.example, README.md, and both license files. A matching .sha256 is published alongside each asset. Or build from source as below.

Run with Docker

The official image is published on every release to both Docker Hub (goliakk/kevy) and GitHub Container Registry (ghcr.io/goliajp/kevy), multi-arch (linux/amd64 + linux/arm64). Tags on both registries: :<semver> (e.g. :1.0.0-rc6), :rc (rolling latest RC), and :latest (stable releases only — never RC).

# One-shot
docker run --rm -p 6379:6379 goliakk/kevy:rc

# Persistent (snapshot + AOF survive restarts via a named volume)
docker run -d --name kevy -p 6379:6379 -v kevy-data:/data goliakk/kevy:rc
redis-cli -p 6379 SET foo bar

Image defaults: KEVY_BIND=0.0.0.0, KEVY_PORT=6379, KEVY_DIR=/data, KEVY_AOF=1. Override any with -e or by passing flags after the image: docker run ... goliakk/kevy:rc --threads 4 --port 7000.

On Linux, kevy auto-selects io_uring when the host can build the ring (kernel ≥ 5.19, and io_uring_setup not blocked by seccomp) and otherwise falls back to the epoll reactor — startup never fails on either. Docker's default seccomp profile blocks io_uring_setup, so the default image runs on epoll; allow io_uring for the faster reactor:

docker run --rm -p 6379:6379 \
  --security-opt seccomp=unconfined goliakk/kevy:rc

Override the auto-pick with KEVY_IO_URING=0 (force epoll) or KEVY_IO_URING=1 (force io_uring — fail loudly if unavailable, for benchmarks). macOS/BSD always use kqueue.

Prefer the GitHub registry? Swap any goliakk/kevy above for ghcr.io/goliajp/kevy — identical image, same tags.

As a server

# Build + run with defaults (loopback only, AOF on, port 6004)
cargo run -p kevy --bin kevy --release

# Or with a TOML config file
cp crates/kevy/kevy.toml.example ./kevy.toml
cargo run -p kevy --bin kevy --release -- --config ./kevy.toml

redis-cli -p 6004 SET foo bar
redis-cli -p 6004 GET foo

Precedence is CLI flags > env vars > TOML file > built-in defaults:

kevy --bind 0.0.0.0 --port 7000 --threads 4 --dir /var/lib/kevy
# env equivalents: KEVY_BIND  KEVY_PORT  KEVY_THREADS  KEVY_DIR  KEVY_AOF

See crates/kevy/kevy.toml.example for the fully annotated config schema.

Cluster mode (single node, key-aware routing)

--cluster (or KEVY_CLUSTER=1 / [cluster] enabled = true) exposes each shard as a virtual cluster node: shard i gets a deterministic extra port at port + 1 + i, CLUSTER SLOTS / SHARDS / NODES report the real topology (CRC16 {hashtag} slots, one contiguous range per shard), and a wrong-shard key on a cluster port answers -MOVED instead of being forwarded. Stock cluster-aware clients (redis-cli -c, redis-benchmark --cluster, client libraries) then talk straight to the owning shard — no cross-shard forwarding tax. The main port keeps full proxy-style behaviour for everything else.

kevy --threads 8 --cluster          # main port 6004, shard ports 6005-6012
redis-cli -c -p 6005 SET foo bar    # follows MOVED automatically

For Rust callers, kevy-client 1.9.0 ships a typed ClusterClient — discover the topology once, then route every key to its owner shard with no -MOVED and no forwarding hop (the 1.6× throughput / 15× tail-latency win above):

// Cargo.toml: kevy-client = "1.9.0"
use kevy_client::ClusterClient;

let mut cc = ClusterClient::connect("127.0.0.1", 6005)?;  // any shard port as seed
cc.set(b"user:42", b"alice")?;                            // routed by CRC16 slot
let v = cc.get(b"user:42")?;
let removed = cc.del(&[b"a", b"b", b"c"])?;               // multi-key may span shards
# Ok::<(), std::io::Error>(())

It wraps string / hash / list / set / sorted-set / del / exists / dbsize / flushall / ping / publish; full guide, command table, and same-slot rules in docs/cluster.md. Use it when one client drives enough load that the hop shows up; the plain single-port Connection stays correct and simpler for ordinary use.

Superset notes vs Redis Cluster (single machine, single process — there is no failover, resharding, MIGRATE/ASK, or gossip): cross-slot multi-key commands (MGET, SUNION, transactions, blocking fan-outs) execute instead of failing with -CROSSSLOT, and keyspace-wide views (KEYS, SCAN, DBSIZE) stay whole-keyspace on every port. Switching an existing data dir in or out of cluster mode re-homes keys once at startup (sources are backed up as *.premigration.<ts>).

As an embedded library

// Cargo.toml: kevy-store = "0.1"
use kevy_store::Store;

let mut s = Store::default();
s.set(b"key".to_vec(), b"value".to_vec(), None, false, false);
assert_eq!(s.get(b"key").unwrap().unwrap(), b"value");

Resource-adaptive by design

kevy follows one rule about resources: release performance when there's room, stay alive when there isn't, gate hard at the edge, and fail loudly — never silently. This runs end to end through the engine:

Unbounded = full speed. With maxmemory = 0 (the default) there is no accounting overhead at all — the eviction bookkeeping is compiled past on a single not-taken branch. You pay nothing for a limit you don't set.
Bounded = graceful eviction. Set maxmemory + a policy (LRU / LFU / Random / TTL, 8 in total) and writes evict sampled keys back to 5% below the limit — headroom so the next write doesn't immediately re-enter eviction.
Edge = loud refusal, not corruption. Under NoEviction (the default policy) a write that would exceed the budget is refused with Redis's classic OOM error before it runs — an O(1) precheck on the hot path. Only memory-growing verbs are gated; shrinkers (DEL, LPOP, SREM, EXPIRE, …) and FLUSH* always go through, so you can always recover a full instance.
Capability degrades, not crashes. io_uring is probed at startup and falls back to epoll on older kernels / seccomp sandboxes (force either with KEVY_IO_URING). The wasm32 embedded build runs with a host-fed clock and reduced surface rather than refusing to build. A non-loopback --bind prints a warning (kevy has no AUTH/TLS) instead of silently exposing you.

The cluster-aware ClusterClient is the same philosophy on the client: spend the connections to skip the forwarding hop when load justifies it, stay on the simple single port when it doesn't.

When to use kevy

kevy is production-ready for four scenarios:

Local dev — cargo run -p kevy + your favourite Redis client.
docker-compose internal — KEVY_BIND=0.0.0.0 inside the network; the trust boundary is the docker network itself.
Embedded library — drop kevy-store into your app: no network, no reactor.
Cache — fronted by a real database, kevy holds hot data with TTL + maxmemory + LRU / LFU eviction.

Out of scope by design: replication, clustering, AUTH / TLS, and direct public-internet exposure. For HA / multi-host, use a Kubernetes StatefulSet or a sidecar-proxy pattern. The full scope rationale and the 98-command parity table live in MIGRATION-FROM-VALKEY.md.

Crates

kevy ships as small, reusable crates — 8 publishable libraries plus the server-internal pieces:

crate	role
`kevy-bytes`	owned byte string with inline-or-heap small-string optimization
`kevy-hash`	fast non-cryptographic hash for single-trust-domain keyspaces
`kevy-map`	Swiss-table hashmap with SIMD group scan
`kevy-resp`	zero-alloc RESP2 / 3 parser
`kevy-ring`	bounded lock-free SPSC queue
`kevy-madvise`	Linux `MADV_HUGEPAGE` wrapper, no-op elsewhere
`kevy-uring`	pure-Rust io_uring bindings, no liburing
`kevy-resp-client`	blocking RESP2 client
`kevy-config` · `kevy-store` · `kevy-rt` · `kevy-persist`	config, keyspace, runtime, persistence
`kevy-sys`	the sole libc boundary (server-internal)
`kevy`	the server binary

Embedded ↔ server with one URL

kevy-client v1.6.0+ + kevy-embedded v1.1.0+ let the same code switch between an in-process backend and a TCP kevy server with a single URL string — including pub/sub (channels + patterns), WATCH-driven transactions, and typed Transaction::exec_typed reply cursors:

use kevy_client::{Connection, Subscriber, PubsubEvent};

let url = std::env::var("KEVY_URL").unwrap_or_else(|_| "mem://app".into());
let mut sub  = Subscriber::open(&url, &[b"events"])?;  // consumer
let mut conn = Connection::open(&url)?;                // producer
let _ack = sub.recv()?;                                 // drain SUBSCRIBE ack
conn.publish(b"events", b"hello")?;
match sub.recv()? {
    PubsubEvent::Message { channel, payload } => { /* same code in dev + prod */ }
    _ => {}
}
# Ok::<(), std::io::Error>(())

URL	Backend
`mem://`	anonymous in-process, per-open fresh — no shared bus
`mem://<name>`	shared in-process bus keyed by `<name>`
`file:///abs/path`	shared in-process with snapshot + AOF persistence
`kevy://host:port` · `redis://…` · `tcp://…`	TCP RESP server

Full walkthrough + caveats: docs/pubsub.md.

Commands

All five Redis data types — String, Hash, List, Set, Sorted Set — plus Streams (XADD / XREAD / XRANGE / consumer groups), blocking pops (BLPOP / BRPOP / XREAD BLOCK / XREADGROUP BLOCK — single- and multi-key, across shards), pub/sub (SUBSCRIBE / PSUBSCRIBE — pattern glob), transactions (MULTI / EXEC / DISCARD / WATCH / UNWATCH — optimistic CAS), persistence (SAVE / BGSAVE / BGREWRITEAOF), and operations (INFO / CONFIG (real hot-modification) / CLIENT / …). Multi-key commands, pub/sub, WATCH, and blocking pops all work across the per-core shards, and WRONGTYPE behaves as in Redis.

The full command list with valkey-parity notes is in MIGRATION-FROM-VALKEY.md. Durability — AOF, snapshots, TTL semantics, rewrite/compaction, crash recovery, and the embedded introspection API — is documented in docs/persistence.md.

Build & test

cargo build --workspace --release
cargo test  --workspace
bash bench/run.sh        # portability smoke (docker, no pipeline) — NOT a perf benchmark
bash bench/loopback_c50.sh   # headline perf vs valkey/redis (Linux, host-loopback, pinned)

Stable Rust 1.95, Rust 2024 edition. Builds on Linux (x86_64, aarch64) and macOS. kevy-embedded and its dependency closure also build for wasm32-unknown-unknown / wasm32-wasip1 — see docs/wasm.md for the WebAssembly walkthrough.

Roadmap & stability

kevy is in the v1.x line (current workspace v1.2.x, with v1.3.0 in flight). Everything that v1.x promises to keep — persistence format, RESP wire protocol, public Rust API, CLI flags, env vars, TOML schema, eviction semantics — is add-only across the v1.x line: a file written by v1.0 loads on any later v1.x build, and additive features (WATCH, pattern pub/sub, real CONFIG SET, typed transaction cursors) land in minor releases without breaking earlier code. The full stability contract is in MIGRATION-FROM-VALKEY.md.

Name		Name	Last commit message	Last commit date
Latest commit History 419 Commits
.githooks		.githooks
.github/workflows		.github/workflows
bench		bench
crates		crates
docs		docs
examples		examples
.dockerignore		.dockerignore
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
GIT-FLOW.md		GIT-FLOW.md
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
MIGRATION-FROM-VALKEY.md		MIGRATION-FROM-VALKEY.md
README.ja.md		README.ja.md
README.md		README.md
README.zh-CN.md		README.zh-CN.md
rust-toolchain.toml		rust-toolchain.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kevy

Why kevy

Performance

Server throughput (over the network)

Cluster routing (key-aware client)

Embedded throughput (in-process, no network)

Pub/sub fan-out (server mode)

Binary size & memory

Quick start

Install

Run with Docker

As a server

Cluster mode (single node, key-aware routing)

As an embedded library

Resource-adaptive by design

When to use kevy

Crates

Embedded ↔ server with one URL

Commands

Build & test

Roadmap & stability

License

About

Licenses found

Uh oh!

Releases 27

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

kevy

Why kevy

Performance

Server throughput (over the network)

Cluster routing (key-aware client)

Embedded throughput (in-process, no network)

Pub/sub fan-out (server mode)

Binary size & memory

Quick start

Install

Run with Docker

As a server

Cluster mode (single node, key-aware routing)

As an embedded library

Resource-adaptive by design

When to use kevy

Crates

Embedded ↔ server with one URL

Commands

Build & test

Roadmap & stability

License

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 27

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages