You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Auto-compaction triggers at 95% of model_context_window by default; in CLI v0.100.0+ user-defined limit is silently clamped to 90%
Server-side compaction via OpenAI Responses API; emits compaction output item in stream
After compaction, Codex re-reads up to 5 recently edited files (50k token budget, 5k per file)
Configurable via model_auto_compact_token_limit in config.toml
ChatGPT account sign-in (no manual API key needed)
Standalone /responses/compact endpoint available for stateless compaction
Configuration Example
model = "gpt-5.5"model_context_window = 400000model_auto_compact_token_limit = 350000tool_output_token_limit = 50000
Deprecation Timeline
Model
Shutdown Date
Replacement
codex-mini-latest
Feb 12, 2026 (retired)
gpt-5.3-codex
o3-mini
Feb 23, 2026 (retired)
o3 (also deprecated)
gpt-5-codex
Jul 23, 2026
gpt-5.3-codex
gpt-5.1-codex
Jul 23, 2026
gpt-5.3-codex
gpt-5.2-codex
Jul 23, 2026
gpt-5.3-codex
o3-deep-research
Jul 23, 2026
gpt-5.4-pro
o4-mini
Oct 23, 2026
gpt-5-mini
Cursor
Anthropic Models
Model
Context Window
Max Output
Input $/M
Output $/M
Notes
Claude 4.7 Opus
200k (1M Max)
128k
$5.00
$25.00
Current flagship Anthropic
Claude 4.6 Opus
200k (1M Max)
128k
$5.00
$25.00
Current; deep reasoning
Claude 4.6 Sonnet
200k (1M Max)
64k
$3.00
$15.00
Current; 1M Max Mode
Claude 4.5 Opus
200k (200k Max)
64k
$5.00
$25.00
Legacy; Max Mode capped at 200k
Claude 4.5 Sonnet
200k (1M Max)
64k
$3.00
$15.00
Legacy (active)
Claude 4.5 Haiku
200k (200k Max)
64k
$1.00
$5.00
Hidden by default
Claude 4 Sonnet 1M
200k (1M Max)
64k
$3.00
$15.00
Hidden; 2x cost over 200k tokens
Claude 4 Sonnet
200k (200k Max)
64k
$3.00
$15.00
Hidden by default
OpenAI Models
Model
Context Window
Max Output
Input $/M
Output $/M
Notes
GPT-5.5
200k (1M Max)
128k
TBD
TBD
Newest OpenAI in Cursor
GPT-5.4
200k (1M Max)
128k
$2.50
TBD
Briefly defaulted to Max Mode for legacy billing
GPT-5.4 Mini
200k (1M Max)
128k
TBD
TBD
Mini variant
GPT-5.4 Nano
200k (1M Max)
128k
TBD
TBD
Smallest variant
GPT-5.3 Codex
272k (272k Max)
128k
TBD
TBD
Coding-optimized
GPT-5.2
272k (272k Max)
128k
$2.00
$8.00
Active
GPT-5.2 Codex
272k (272k Max)
128k
$1.75
$14.00
Active
GPT-5.1 Codex
272k (272k Max)
128k
$1.25
$10.00
Active
GPT-5
272k (272k Max)
128k
$1.25
$10.00
Active
GPT-5 Mini
200k (200k Max)
128k
$0.40
$1.60
Active
Google Models
Model
Context Window
Max Output
Input $/M
Output $/M
Notes
Gemini 3.1 Pro
200k (1M Max)
64k
$2.00
$12.00
Current Google leader
Gemini 3 Pro
200k (1M Max)
64k
$2.00
$12.00
Current
Gemini 3 Pro Image Preview
200k (1M Max)
32k
$2.00
$12.00
Multimodal preview
Gemini 3 Flash
200k (1M Max)
64k
$0.50
$3.00
Fast inference
Gemini 2.5 Flash
200k (1M Max)
65k
$0.30
$2.50
Legacy (active)
xAI Models
Model
Context Window
Max Output
Input $/M
Output $/M
Notes
Grok 4.20
200k (2M Max)
—
$3.00
$15.00
Primary xAI model; replaces Grok Code
Grok Code Fast 1
256k
—
$0.60
$4.00
Active; may be hidden by default
Grok Code (original)
256k
—
$2.00
$10.00
Removed from default list; re-add via custom model ID
Cursor Native Models (Composer)
Model
Context Window
Max Output
Input $/M
Output $/M
Notes
Composer 2
200k
—
$0.50 (Std) / $1.50 (Fast)
—
Default in Auto mode (Mar 19, 2026); built on Kimi K2.5 base
Composer 1.5
200k
—
N/A
N/A
Hidden by default; adds Thinking support
Composer 1
200k
—
N/A
N/A
First-gen; hidden by default
Other Providers
Model
Context Window
Max Output
Input $/M
Output $/M
Notes
Kimi K2.5 (Moonshot)
200k (1M Max)
—
TBD
TBD
Active
DeepSeek V4 Pro
1M (1M Max)
—
TBD
TBD
Custom model / API key required
DeepSeek V4 Flash
1M (1M Max)
—
TBD
TBD
Custom model / API key required
Note: Cursor uses subscription pricing (Pro / Business). Composer models have unlimited access on paid plans in Auto mode. Third-party models draw from the monthly credit pool. API prices shown are underlying provider rates, not what Cursor users pay directly.
Gemini CLI v0.40.1 (Apr 30, 2026) is current stable
Intelligent auto-routing: simple queries → Flash, complex tasks → Pro
Manual override: /model command or -m flag (e.g., gemini -m gemini-2.5-flash)
Gemma 4 experimental local-inference support added in v0.41.0-preview builds
Deprecation Timeline
Model
Shutdown Date
gemini-3-pro-preview (original)
March 26, 2026 (discontinued)
gemini-2.5-flash-lite-preview-09-2025
March 31, 2026 (discontinued)
gemini-robotics-er-1.5-preview
April 30, 2026 (shut down)
Gemini 2.0 Flash
June 1, 2026 (EOL)
Gemini 2.0 Flash-Lite
June 1, 2026 (EOL)
Gemini 2.5 Pro
Not before October 16, 2026
Gemini 2.5 Flash
Not before October 16, 2026
Retired Models
Model
Context Window
Retirement Date
text-embedding-004
—
January 14, 2026
Gemini 1.5 Pro
2M
Sept 24, 2025
Gemini 1.5 Flash
1M
Sept 24, 2025
Note: The 2M token context window is no longer available in any active Gemini model. Maximum context is now 1M tokens for all current-gen Gemini Pro and Flash models.