提升 Codex 响应速度

这一篇用 6 分钟换什么：把"Codex 慢了怎么办"从"换模型"重新理解成4 步排查——先压上下文，再关无关 MCP，再换小模型，最后才开 Fast mode。读完后你不会一上来就用 credits 换延迟。

Codex 的 speed（速度）不是只有"换更快模型"一种方式。官方文档里有两个概念需要分清：

Fast mode 快速模式

Codex 支持用更多 credits 换取更快模型速度。

Fast mode 会把支持模型的速度提升到 1.5x，同时比 Standard mode（标准模式）消耗更多 credits。

当前 Fast mode 支持：

Model	Speed	Credit consumption
GPT-5.5	1.5x	2.5x Standard rate
GPT-5.4	1.5x	2x Standard rate

在 CLI 中可以用下面三个命令切换或查看状态：

/fast on
/fast off
/fast status

如果你希望默认启用 Fast mode，可以在 config.toml 中持久化配置：

service_tier = "fast"

[features]
fast_mode = true

Fast mode 可用于：

前提是你使用 ChatGPT 登录。

如果你使用 API key，Codex 会走 standard API pricing（标准 API 价格），不能使用 Fast mode credits。

官方示例视频：

GPT-5.3-Codex-Spark 是一个独立的 Codex 模型。它速度更快、能力更轻，目标是 near-instant, real-time coding iteration（近乎即时的实时编程迭代）。

它和 Fast mode 的区别是：

项	Fast mode	GPT-5.3-Codex-Spark
本质	给支持的模型加速	一个独立模型
代价	按更高倍率消耗 credits	使用自己的 usage limits
适合	你仍想用 GPT-5.5 或 GPT-5.4，但希望更快	日常快速 coding iteration
可用范围	ChatGPT 登录下的 IDE extension、CLI、app	research preview 阶段仅 ChatGPT Pro

官方说明：在 research preview 阶段，Codex-Spark 只面向 ChatGPT Pro subscribers（ChatGPT Pro 订阅用户）开放。

很多“慢”不是模型慢，而是上下文太重、任务太宽、工具太多。官方 Pricing 页也明确建议用这些方式延长 usage limits：

优先级建议：

Fast mode 是“用 credits 换延迟”，不是质量优化。任务本身不清楚时，开 Fast mode 只会更快地消耗额度。

场景	推荐
高风险重构但希望快一点	`gpt-5.5` 或 `gpt-5.4` + Fast mode。
日常小改、解释、轻量扫描	`gpt-5.4-mini`。
快速 text-only coding iteration	有资格时试 `gpt-5.3-codex-spark`。
CI 或 shared automation	API key + standard API pricing，不能用 Fast mode credits。
多 agent 扫描	explorer 用小模型，主 agent 用强模型汇总。