Red-Green-Refactor

Orchestrate a Red–Green–Refactor loop with three LLM roles (tester, implementor, refactorer). Each step applies a JSON patch, runs tests, and commits to git. Works with Gemini and OpenAI-compatible APIs (e.g., DeepSeek, GitHub Models); mock mode for offline runs.

Features

Three roles with independent models/providers per step
JSON patch protocol for safe file edits
Git commit at the end of each role (clean history, easy rollback)
Test runner is configurable (Cargo, pytest, npm, etc.)
Refactor step auto-reverts if tests break

Install

cargo install red-green-refactor

Create a sample config

rgr init-config --out red-green-refactor.yaml

Build from source

If you prefer to build from source, run:

cargo build --release
# Create a sample config
./target/release/rgr init-config --out red-green-refactor.yaml

Configure

Edit your YAML (e.g., red-green-refactor.yaml) to pick providers and your test command.

Provider kinds: gemini, open_ai, mock
OpenAI-compatible (DeepSeek, Perplexity, Groq, OpenRouter, GitHub Models, local servers) uses kind: open_ai + base_url + api_key_env
Optional header customization for OpenAI-compatible:
- api_key_header: custom header name (default: Authorization)
- api_key_prefix: prefix for header value (default: "Bearer "; set to "" for raw keys)

Important: put your kata's rules in your kata repo at docs/kata-rules.md and tell each role to read it in their system_prompt. The tool automatically includes Markdown files in the model context.

Example (Gemini tester/refactorer + DeepSeek implementor):

tester:
  provider:
    kind: gemini
    model: gemini-1.5-pro
    api_key_env: GEMINI_API_KEY
  system_prompt: "Read docs/kata-rules.md. Add exactly one failing test per the rules. Output ONLY JSON LlmPatch."

implementor:
  provider:
    kind: open_ai
    model: deepseek-chat
    base_url: https://api.deepseek.com
    api_key_env: DEEPSEEK_API_KEY
  system_prompt: "Read docs/kata-rules.md. Make tests pass with the smallest change. Output ONLY JSON LlmPatch."

refactorer:
  provider:
    kind: gemini
    model: gemini-1.5-pro
    api_key_env: GEMINI_API_KEY
  system_prompt: "Read docs/kata-rules.md. Refactor without changing behavior. Output ONLY JSON LlmPatch."

test_cmd: "cargo test --color never"
max_context_bytes: 200000

Export keys (adjust to your config):

export GEMINI_API_KEY=your_gemini_key
export DEEPSEEK_API_KEY=your_deepseek_key

GitHub Models (OpenAI-compatible)

Most setups work with standard Bearer auth (defaults):

provider:
  kind: open_ai
  model: gpt-4o-mini
  base_url: https://models.github.ai/inference
  api_key_env: GITHUB_TOKEN  # or GITHUB_MODELS_TOKEN
  # uses defaults: Authorization + "Bearer "

If your endpoint expects an api-key header without Bearer:

provider:
  kind: open_ai
  model: gpt-4o-mini
  base_url: https://models.github.ai/inference
  api_key_env: GITHUB_MODELS_API_KEY
  api_key_header: api-key
  api_key_prefix: ""

Note: Some GitHub-hosted endpoints may require additional headers (e.g., X-GitHub-Api-Version). If you need that, open an issue—support can be added easily.

Run on a kata

Option A: Bowling Game (Rust)

# New kata project
cargo new bowling_kata
cd bowling_kata

## Make sure to add the kata rules in the kata project, for example by adding a docs/kata-rules.md file

# From the red-green-refactor repo (adjust path if needed)
../red-green-refactor/target/release/red-green-refactor --project . --config ../red-green-refactor/red-green-refactor.yaml
# Or continuous mode
../red-green-refactor/target/release/red-green-refactor --project . --config ../red-green-refactor/red-green-refactor.yaml run

What happens each cycle:

Tester adds one failing test and commits
Implementor makes tests pass and commits (up to configurable retries)
Refactorer improves code and commits; if tests fail, that commit is automatically reverted

Inspect:

git --no-pager log --oneline

Option B: Mars Rover (Python + pytest)

# Simple Python project with pytest
mkdir mars_rover && cd mars_rover
git init
python3 -m venv .venv
. .venv/bin/activate
pip install pytest

# Edit your YAML to use pytest
# test_cmd: "pytest -q"

# Run red-green-refactor
../red-green-refactor/target/release/red-green-refactor --project . --config ../red-green-refactor/red-green-refactor.yaml

Providers

Gemini: kind: gemini, set api_key_env (e.g., GEMINI_API_KEY). Models like gemini-1.5-pro.
OpenAI-compatible (e.g., DeepSeek, GitHub Models, Perplexity): kind: open_ai, set base_url and api_key_env. Optional:
- api_key_header (e.g., api-key)
- api_key_prefix (e.g., "" for raw keys)
Mock: kind: mock for offline dry runs (appends to red-green-refactor-mock.log).

Some provider endpoints (without the /chat/completions suffix, which is automatically appended):

DeepSeek: https://api.deepseek.com (available models: deepseek-chat, deepseek-reasoner)
Perplexity: https://api.perplexity.ai (some available models: sonar, sonar-pro, sonar-reasoning, full list here)
GitHub Models: https://models.github.ai/inference (available models here)

Notes

Context is collected from src/**, tests/**, Cargo.toml, README and Markdown files, truncated at max_context_bytes.
Each role must output only a JSON LlmPatch:
- files: list of edits { path, mode: "rewrite"|"append", content }
- commit_message (optional)
Implementor retries: set implementor_max_attempts (default 3). On exhaustion, the tool branches attempts/implementor-... and resets to the tester commit.
Git repo is auto-initialized; refactor commit is reverted if tests break.

Troubleshooting

Missing API key: ensure api_key_env matches your exported variable.
Tests not running: set test_cmd to your runner (e.g., pytest -q, npm test, mvn -q test).
Large repos: raise max_context_bytes.
Broken refactor: the tool hard-resets the last commit; re-run to continue.

Commands

# One cycle (default)
./target/release/red-green-refactor --project <path> --config red-green-refactor.yaml
# Continuous
./target/release/red-green-refactor --project <path> --config red-green-refactor.yaml run
# Generate sample config
./target/release/red-green-refactor init-config --out red-green-refactor.yaml

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
TODO.md		TODO.md
red-green-refactor-example.yaml		red-green-refactor-example.yaml
red-green-refactor-mock.log		red-green-refactor-mock.log

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Red-Green-Refactor

Features

Install

Create a sample config

Build from source

Configure

GitHub Models (OpenAI-compatible)

Run on a kata

Option A: Bowling Game (Rust)

Option B: Mars Rover (Python + pytest)

Providers

Some provider endpoints (without the /chat/completions suffix, which is automatically appended):

Notes

Troubleshooting

Commands

About

Uh oh!

Releases

Packages

Languages

xpepper/red-green-refactor

Folders and files

Latest commit

History

Repository files navigation

Red-Green-Refactor

Features

Install

Create a sample config

Build from source

Configure

GitHub Models (OpenAI-compatible)

Run on a kata

Option A: Bowling Game (Rust)

Option B: Mars Rover (Python + pytest)

Providers

Some provider endpoints (without the /chat/completions suffix, which is automatically appended):

Notes

Troubleshooting

Commands

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages