docs: 更新 README 反映高级功能和图像生成已完成

linshenkx · linshenkx · commit 7b71801c8524 · 2025-09-30T23:03:08.000+08:00
## 变更内容

### 核心特性更新
- ✅ 添加图像生成功能说明（T2I和I2I）
- ✅ 添加高级测试模式功能说明
- ✅ 将这些功能从"预览版(Beta)"升级为正式功能

### 高级功能章节重构
**之前状态**: "高级功能预览(Beta)" - 标注为开发中
**当前状态**: "高级功能" - 正式发布

#### 图像生成模式
- 文生图（T2I）：文本提示词生成图像
- 图生图（I2I）：基于本地图片变换优化
- 多模型支持：Gemini、Seedream等
- 模型参数：支持各模型特有参数配置（尺寸、风格等）
- 预览与下载：实时预览生成结果，支持下载保存

#### 高级测试模式
- 上下文变量管理：自定义变量、批量替换
- 多轮会话测试：模拟真实对话场景
- 工具调用支持：Function Calling集成
- 灵活调试能力

### 开发路线更新
- ✅ 高级模式：变量管理、上下文测试、工具调用
- ✅ 图像生成：文生图（T2I）和图生图（I2I）支持
- ❌ 移除"支持图片输入和多模态处理"（已完成）

## 文档同步
- README.md (中文版)
- README_EN.md (英文版)
- 添加图像模式文档链接

关联文档: docs/image-mode.md
diff --git a/README.md b/README.md
@@ -46,20 +46,28 @@ Prompt Optimizer是一个强大的AI提示词优化工具，帮助你编写更
 - 📝 **双模式优化**：支持系统提示词优化和用户提示词优化，满足不同使用场景
 - 🔄 **对比测试**：支持原始提示词和优化后提示词的实时对比，直观展示优化效果
 - 🤖 **多模型集成**：支持OpenAI、Gemini、DeepSeek、智谱AI、SiliconFlow等主流AI模型
+- 🖼️ **图像生成**：支持文生图（T2I）和图生图（I2I），集成Gemini、Seedream等图像模型
+- 📊 **高级测试模式**：上下文变量管理、多轮会话测试、工具调用（Function Calling）支持
 - 🔒 **安全架构**：纯客户端处理，数据直接与AI服务商交互，不经过中间服务器
 - 📱 **多端支持**：同时提供Web应用、桌面应用、Chrome插件和Docker部署四种使用方式
 - 🔐 **访问控制**：支持密码保护功能，保障部署安全
 - 🧩 **MCP协议支持**：支持Model Context Protocol (MCP) 协议，可与Claude Desktop等MCP兼容应用集成
 
-## 🚀 高级功能预览 (Beta)
+## 🚀 高级功能
 
-> **预览环境**：[https://prompt-dev.always200.com](https://prompt-dev.always200.com) | 欢迎体验新功能并反馈
+### 图像生成模式
+- 🖼️ **文生图（T2I）**：通过文本提示词生成图像
+- 🎨 **图生图（I2I）**：基于本地图片进行图像变换和优化
+- 📐 **灵活配置**：支持生成1-4张图片，自定义尺寸和参数
+- 🔌 **多模型支持**：集成Gemini、Seedream等主流图像生成模型
 
-- 📊 **上下文变量管理**：自定义变量、多轮会话测试、变量替换预览
+### 高级测试模式
+- 📊 **上下文变量管理**：自定义变量、批量替换、变量预览
+- 💬 **多轮会话测试**：模拟真实对话场景，测试提示词在多轮交互中的表现
 - 🛠️ **工具调用支持**：Function Calling集成，支持OpenAI和Gemini工具调用
-- 🎯 **高级测试模式**：更灵活的提示词测试和调试能力
+- 🎯 **灵活调试**：更强大的提示词测试和调试能力
 
-*注：高级功能正在开发完善中，未来版本将正式集成到主版本*
+详细使用说明请查看 [图像模式文档](docs/image-mode.md)
 
 ## 快速开始
 
@@ -310,7 +318,7 @@ pnpm dev:fresh        # 完整重置并重新启动开发环境
 - [x] 桌面应用发布
 - [x] MCP服务发布
 - [x] 高级模式：变量管理、上下文测试、工具调用
-- [ ] 支持图片输入和多模态处理
+- [x] 图像生成：文生图（T2I）和图生图（I2I）支持
 - [ ] 支持工作区/项目管理
 - [ ] 支持提示词收藏和模板管理
 
@@ -361,25 +369,22 @@ pnpm dev:fresh        # 完整重置并重新启动开发环境
    - 提供最完整、最稳定的功能体验
    - 从 [GitHub Releases](https://github.com/linshenkx/prompt-optimizer/releases) 下载
 
-2. **使用Docker部署**（服务端方案）
-   - Docker部署运行在服务端，没有浏览器跨域限制
-   - 支持内网环境，数据不出内网
-   - 请求流向：Docker容器→模型服务提供商
-
-3. **使用自部署的API中转服务**（专业方案）
+2. **使用自部署的API中转服务**（专业方案）
    - 部署如OneAPI、NewAPI等开源API聚合/代理工具
    - 在设置中配置为自定义API端点
    - 请求流向：浏览器→中转服务→模型服务提供商
    - 完全控制安全策略和访问权限
 
+**注意**：Web版（包括在线版、Vercel部署、Docker部署）都是纯前端应用，都会受到浏览器CORS限制。只有桌面版或使用API中转服务才能解决跨域问题。
+
 #### Q4: 我已正确配置本地模型（如Ollama）的跨域策略，为什么使用在线版依然无法连接？
 **A**: 这是由浏览器的**混合内容（Mixed Content）安全策略**导致的。出于安全考虑，浏览器会阻止安全的HTTPS页面（如在线版）向不安全的HTTP地址（如您的本地Ollama服务）发送请求。
 
 **解决方案**：
-为了绕过此限制，您需要让应用和API处于同一种协议下（例如，都是HTTP）。推荐以下几种方式：
-1. **使用桌面版**：桌面应用没有浏览器限制，是连接本地模型最稳定可靠的方式。
-2. **docker部署**：docker部署也是http
-3. **使用Chrome插件**：插件在某些情况下也可以绕过部分安全限制。
+为了绕过此限制，您需要让应用和API处于同一种协议下（例如，都是HTTP）。推荐以下方式：
+1. **使用桌面版**：桌面应用没有浏览器限制，是连接本地模型最稳定可靠的方式
+2. **使用Docker部署（HTTP）**：通过 `http://localhost:8081` 访问，与本地Ollama都是HTTP
+3. **使用Chrome插件**：插件在某些情况下也可以绕过部分安全限制
 
 </details>
 
diff --git a/README_EN.md b/README_EN.md
@@ -46,20 +46,28 @@ Prompt Optimizer is a powerful AI prompt optimization tool that helps you write
 - 📝 **Dual Mode Optimization**: Support for both system prompt optimization and user prompt optimization to meet different usage scenarios
 - 🔄 **Comparison Testing**: Real-time comparison between original and optimized prompts for intuitive demonstration of optimization effects
 - 🤖 **Multi-model Integration**: Support for mainstream AI models including OpenAI, Gemini, DeepSeek, Zhipu AI, SiliconFlow, etc.
+- 🖼️ **Image Generation**: Support for Text-to-Image (T2I) and Image-to-Image (I2I) with models like Gemini, Seedream
+- 📊 **Advanced Testing Mode**: Context variable management, multi-turn conversation testing, Function Calling support
 - 🔒 **Secure Architecture**: Pure client-side processing with direct data interaction with AI service providers, bypassing intermediate servers
 - 📱 **Multi-platform Support**: Available as web application, desktop application, Chrome extension, and Docker deployment
 - 🔐 **Access Control**: Password protection feature for secure deployment
 - 🧩 **MCP Protocol Support**: Supports Model Context Protocol (MCP), enabling integration with MCP-compatible AI applications like Claude Desktop
 
-## 🚀 Advanced Features Preview (Beta)
+## 🚀 Advanced Features
 
-> **Preview Environment**: [https://prompt-dev.always200.com](https://prompt-dev.always200.com) | Experience new features and provide feedback
+### Image Generation Mode
+- 🖼️ **Text-to-Image (T2I)**: Generate images from text prompts
+- 🎨 **Image-to-Image (I2I)**: Transform and optimize images based on local files
+- 📐 **Flexible Configuration**: Generate 1-4 images with customizable dimensions and parameters
+- 🔌 **Multi-model Support**: Integrated with mainstream image generation models like Gemini, Seedream
 
-- 📊 **Context Variable Management**: Custom variables, multi-turn conversation testing, variable replacement preview
+### Advanced Testing Mode
+- 📊 **Context Variable Management**: Custom variables, batch replacement, variable preview
+- 💬 **Multi-turn Conversation Testing**: Simulate real conversation scenarios to test prompt performance in multi-turn interactions
 - 🛠️ **Function Calling Support**: Function Calling integration with support for OpenAI and Gemini tool calling
-- 🎯 **Advanced Testing Mode**: More flexible prompt testing and debugging capabilities
+- 🎯 **Flexible Debugging**: Enhanced prompt testing and debugging capabilities
 
-*Note: Advanced features are currently in development and will be officially integrated into the main version in future releases*
+For detailed usage instructions, please refer to the [Image Mode Documentation](docs/image-mode.md)
 
 ## Quick Start
 
@@ -313,7 +321,7 @@ pnpm dev:fresh        # Complete reset and restart development environment
 - [x] Desktop application release
 - [x] MCP service release
 - [x] Advanced mode: Variable management, context testing, function calling
-- [ ] Support for image input and multimodal processing
+- [x] Image generation: Text-to-Image (T2I) and Image-to-Image (I2I) support
 - [ ] Support for workspace/project management
 - [ ] Support for prompt favorites and template management
 
@@ -363,25 +371,22 @@ For detailed project status, see [Project Status Document](docs/project-status.m
    - Provides the most complete and stable feature experience
    - Download from [GitHub Releases](https://github.com/linshenkx/prompt-optimizer/releases)
 
-2. **Use Docker Deployment** (Server-side solution)
-   - Docker deployment runs on the server side with no browser CORS restrictions
-   - Supports internal network environments, data stays within your network
-   - Request flow: Docker container → Model service provider
-
-3. **Use Self-deployed API Proxy Service** (Professional solution)
+2. **Use Self-deployed API Proxy Service** (Professional solution)
    - Deploy open-source API aggregation/proxy tools like OneAPI, NewAPI
    - Configure as custom API endpoint in settings
    - Request flow: Browser → Proxy service → Model service provider
    - Full control over security policies and access permissions
 
+**Note**: All web versions (including online version, Vercel deployment, Docker deployment) are pure frontend applications and subject to browser CORS restrictions. Only the desktop version or using an API proxy service can solve CORS issues.
+
 #### Q4: I have correctly configured CORS policies for my local model (like Ollama), why can't I still connect using the online version?
 **A**: This is caused by the browser's **Mixed Content security policy**. For security reasons, browsers block secure HTTPS pages (like the online version) from sending requests to insecure HTTP addresses (like your local Ollama service).
 
 **Solutions**:
 To bypass this limitation, you need to have the application and API under the same protocol (e.g., both HTTP). We recommend the following approaches:
-1. **Use the desktop version**: Desktop applications have no browser restrictions and are the most stable and reliable way to connect to local models.
-2. **Docker deployment**: Docker deployment also uses HTTP
-3. **Use Chrome extension**: Extensions can bypass some security restrictions in certain situations.
+1. **Use the desktop version**: Desktop applications have no browser restrictions and are the most stable and reliable way to connect to local models
+2. **Use Docker deployment (HTTP)**: Access via `http://localhost:8081`, both the app and local Ollama use HTTP
+3. **Use Chrome extension**: Extensions can bypass some security restrictions in certain situations
 
 </details>