PlatformNetwork
diff --git a/‎README.md‎
Lines changed: 138 additions & 101 deletions b/‎README.md‎
Lines changed: 138 additions & 101 deletions
@@ -1,130 +1,167 @@
 # BaseAgent - SDK 3.0
 
-High-performance autonomous agent for [Term Challenge](https://term.challenge). Supports multiple LLM providers with **Chutes API** (Kimi K2.5-TEE) as the default.
+High-performance autonomous agent for [Term Challenge](https://term.challenge). **Does NOT use term_sdk** - fully autonomous with Chutes API.
 
-## Quick Start
+## Installation
 
 ```bash
-# 1. Install dependencies
+# Via pyproject.toml
+pip install .
+
+# Via requirements.txt
 pip install -r requirements.txt
+```
 
-# 2. Configure Chutes API (default provider)
-export CHUTES_API_TOKEN="your-token-from-chutes.ai"
+## Usage
 
-# 3. Run the agent
-python3 agent.py --instruction "Your task description here..."
+```bash
+python agent.py --instruction "Your task here..."
 ```
 
-### Alternative: OpenRouter
+The agent receives the instruction via `--instruction` and executes the task autonomously.
 
-```bash
-export LLM_PROVIDER="openrouter"
-export OPENROUTER_API_KEY="your-openrouter-key"
-python3 agent.py --instruction "Your task description here..."
+## Mandatory Architecture
+
+> **IMPORTANT**: Agents MUST follow these rules to work correctly.
+
+### 1. Project Structure (MANDATORY)
+
+Agents **MUST** be structured projects, NOT single files:
+
+```
+my-agent/
+├── agent.py              # Entry point with --instruction
+├── src/                  # Modules
+│   ├── core/
+│   │   ├── loop.py       # Main loop
+│   │   └── compaction.py # Context management (MANDATORY)
+│   ├── llm/
+│   │   └── client.py     # LLM client (Chutes API)
+│   └── tools/
+│       └── ...           # Available tools
+├── requirements.txt      # Dependencies
+└── pyproject.toml        # Project config
 ```
 
-## Documentation
+### 2. Session Management (MANDATORY)
+
+Agents **MUST** maintain complete conversation history:
 
-📚 **Full documentation available in [docs/](docs/)**
-
-### Getting Started
-- [Overview](docs/overview.md) - What is BaseAgent
-- [Installation](docs/installation.md) - Setup instructions
-- [Quick Start](docs/quickstart.md) - First task in 5 minutes
-
-### Core Concepts
-- [Architecture](docs/architecture.md) - Technical deep-dive with diagrams
-- [Configuration](docs/configuration.md) - All settings explained
-- [Usage Guide](docs/usage.md) - CLI commands and examples
-
-### Reference
-- [Tools Reference](docs/tools.md) - Available tools
-- [Context Management](docs/context-management.md) - Token optimization
-- [Best Practices](docs/best-practices.md) - Performance tips
-
-### LLM Providers
-- [Chutes Integration](docs/chutes-integration.md) - **Default provider setup**
-
-## Architecture Overview
-
-```mermaid
-graph TB
-    subgraph User
-        CLI["python3 agent.py --instruction"]
-    end
-    
-    subgraph Core
-        Loop["Agent Loop"]
-        Context["Context Manager"]
-    end
-    
-    subgraph LLM
-        Chutes["Chutes API (Kimi K2.5)"]
-        OpenRouter["OpenRouter (fallback)"]
-    end
-    
-    subgraph Tools
-        Shell["shell_command"]
-        Files["read/write_file"]
-        Search["grep_files"]
-    end
-    
-    CLI --> Loop
-    Loop --> Context
-    Loop -->|default| Chutes
-    Loop -->|fallback| OpenRouter
-    Loop --> Tools
+```python
+messages = [
+    {"role": "system", "content": system_prompt},
+    {"role": "user", "content": instruction},
+]
+
+# Add each exchange
+messages.append({"role": "assistant", "content": response})
+messages.append({"role": "tool", "tool_call_id": id, "content": result})
 ```
 
-## Key Features
+### 3. Context Compaction (MANDATORY)
 
-| Feature | Description |
-|---------|-------------|
-| **Fully Autonomous** | No user confirmation needed |
-| **LLM-Driven** | All decisions made by the language model |
-| **Chutes API** | Default: Kimi K2.5-TEE (256K context, thinking mode) |
-| **Prompt Caching** | 90%+ cache hit rate |
-| **Context Management** | Intelligent pruning and compaction |
-| **Self-Verification** | Automatic validation before completion |
+Compaction is **CRITICAL** for:
+- Avoiding "context too long" errors
+- Preserving critical information
+- Enabling complex multi-step tasks
+- Improving response coherence
 
-## Environment Variables
+```python
+# Recommended threshold: 85% of context window
+AUTO_COMPACT_THRESHOLD = 0.85
 
-| Variable | Required | Default | Description |
-|----------|----------|---------|-------------|
-| `CHUTES_API_TOKEN` | Yes* | - | Chutes API token |
-| `LLM_PROVIDER` | No | `chutes` | `chutes` or `openrouter` |
-| `LLM_MODEL` | No | `moonshotai/Kimi-K2.5-TEE` | Model identifier |
-| `LLM_COST_LIMIT` | No | `10.0` | Max cost in USD |
-| `OPENROUTER_API_KEY` | For OpenRouter | - | OpenRouter API key |
+# 2-step strategy:
+# 1. Pruning: Remove old tool outputs
+# 2. AI Compaction: Summarize conversation if pruning insufficient
+```
+
+## Features
+
+### LLM Client (Chutes API)
 
-*\*Required for default Chutes provider*
+```python
+from src.llm.client import LLMClient
 
-## Project Structure
+llm = LLMClient(
+    model="deepseek/deepseek-chat",
+    temperature=0.0,
+    max_tokens=16384,
+)
 
+response = llm.chat(messages, tools=tool_specs)
 ```
-baseagent/
-├── agent.py                 # Entry point
-├── src/
-│   ├── core/
-│   │   ├── loop.py          # Main agent loop
-│   │   └── compaction.py    # Context management
-│   ├── llm/
-│   │   └── client.py        # LLM client
-│   ├── config/
-│   │   └── defaults.py      # Configuration
-│   ├── tools/               # Tool implementations
-│   └── prompts/             # System prompt
-├── docs/                    # 📚 Full documentation
-├── rules/                   # Development guidelines
-└── astuces/                 # Implementation techniques
+
+### Prompt Caching
+
+Caches system and recent messages to reduce costs:
+- Cache hit rate: **90%+** on long conversations
+- Significant API cost reduction
+
+### Self-Verification
+
+Before completing, the agent automatically:
+1. Re-reads the original instruction
+2. Verifies each requirement
+3. Only confirms completion if everything is validated
+
+### Context Management
+
+- **Token-based overflow detection** (not message count)
+- **Tool output pruning** (removes old outputs)
+- **AI compaction** (summarizes if needed)
+- **Middle-out truncation** for large outputs
+
+## Available Tools
+
+| Tool | Description |
+|------|-------------|
+| `shell_command` | Execute shell commands |
+| `read_file` | Read files with pagination |
+| `write_file` | Create/overwrite files |
+| `apply_patch` | Apply patches |
+| `grep_files` | Search with ripgrep |
+| `list_dir` | List directories |
+| `view_image` | Analyze images |
+
+## Configuration
+
+See `src/config/defaults.py`:
+
+```python
+CONFIG = {
+    "model": "deepseek/deepseek-chat",
+    "max_tokens": 16384,
+    "max_iterations": 200,
+    "auto_compact_threshold": 0.85,
+    "prune_protect": 40_000,
+    "cache_enabled": True,
+}
 ```
 
-## Development Guidelines
+## Environment Variables
+
+| Variable | Description |
+|----------|-------------|
+| `CHUTES_API_KEY` | Chutes API key |
+
+## Documentation
+
+### Rules - Development Guidelines
+
+See [rules/](rules/) for comprehensive guides:
+
+- [Architecture Patterns](rules/02-architecture-patterns.md) - **Mandatory project structure**
+- [LLM Usage Guide](rules/06-llm-usage-guide.md) - **Using Chutes API**
+- [Best Practices](rules/05-best-practices.md)
+- [Error Handling](rules/08-error-handling.md)
+
+### Tips - Practical Techniques
+
+See [astuces/](astuces/) for techniques:
 
-For agent developers, see:
-- [rules/](rules/) - Architecture patterns, best practices, anti-patterns
-- [astuces/](astuces/) - Practical techniques (caching, verification, etc.)
-- [AGENTS.md](AGENTS.md) - Comprehensive building guide
+- [Prompt Caching](astuces/01-prompt-caching.md)
+- [Context Management](astuces/03-context-management.md)
+- [Local Testing](astuces/09-local-testing.md)
 
 ## License