OpenAI recently introduced OAuth authentication for ChatGPT subscriptions, allowing tools like Cline to use your existing subscription instead of managing API keys. Here are the key restrictions: cline
The main restriction is rate limiting tied to your ChatGPT subscription tier: reddit
- Plus users: Around 300–1,500 local messages every 5 hours, or 50–400 cloud tasks depending on complexity
- Pro users: Higher limits but still capped with 5-hour and weekly limits
- Usage counts against your overall ChatGPT subscription quota, not separate API credits github
Many users report hitting limits quickly, especially with intensive agentic coding workflows. community.openai
The models available depend on your subscription tier :
- gpt-5.2-codex: Optimized for agentic coding
- gpt-5.2: General-purpose with strong reasoning
- gpt-5-mini: Fast and cost-effective
- o3 / o4-mini: Advanced and everyday reasoning tasks
| Aspect | OAuth (ChatGPT Subscription) | API Keys |
|---|---|---|
| Pricing | Flat-rate (subscription) cline | Pay-per-token reddit |
| Limits | Time-based (5h/weekly caps) github | Token/request limits platform.openai |
| Setup | One-click sign-in | Manual key management |
| Credential exposure | Tokens handled automatically cline | Risk of accidental commits |
- Shared quota: Usage in Cline draws from the same pool as your ChatGPT web/app usage github
- No separate budget: You cannot add API credits to bypass OAuth limits—they're tied strictly to your subscription tier github
- Security: Your OpenAI credentials never leave OpenAI's servers; Cline only receives access tokens cline
For heavy coding use, some users find the 5-hour and weekly caps frustrating and prefer third-party providers or direct API access for more flexibility. community.openai
That's likely a significant factor. Claude Code CLI uses intelligent model routing to minimize costs, while third-party tools like Cline typically send everything to the most capable (and expensive) model.
How Claude Code CLI Optimizes Costs
The official Claude Code CLI implements a multi-tier system:claudelog+1
Haiku runs 4-5x faster than Sonnet with sub-200ms latency, making it ideal for high-frequency lightweight tasks. The CLI analyzes query complexity first, then routes to the most cost-effective model.notchrisgroves+1[youtube]
Why This Matters for Anthropic's Subscription
The economics are stark:news.ycombinator+1
A heavy Claude Code user might consume $1,000+ worth of API tokens per month
Anthropic charges $200/month for the Max subscription
This only works if most requests go to cheap models like Haiku
When third-party tools bypass this routing and send everything to Sonnet or Opus, Anthropic loses money on every request. As one Hacker News commenter noted: "By treating Anthropic models as a commodity within a neutral harness, you empower users at the cost of Anthropic".[reddit]
The Cost Difference Is Massive
Users who've implemented their own routing report 60-80% cost reductions compared to using expensive models for everything. Without routing:reddit+1
90% of requests that could use cheap models instead hit premium models
Simple tasks like variable renaming or code comments use the same compute as architectural decisions
The "all-you-can-eat" subscription becomes financially unsustainable for Anthropic
So yes—Cline always hitting Sonnet/Opus while Claude Code intelligently routes most tasks to Haiku likely made third-party usage far more expensive for Anthropic to subsidize, contributing to the crackdown.[news.ycombinator]
How Claude Code CLI Optimizes Costs
The official Claude Code CLI implements a multi-tier system: claudelog
Haiku runs 4-5x faster than Sonnet with sub-200ms latency, making it ideal for high-frequency lightweight tasks. The CLI analyzes query complexity first, then routes to the most cost-effective model. notchrisgroves
Why This Matters for Anthropic's Subscription
The economics are stark: news.ycombinator
When third-party tools bypass this routing and send everything to Sonnet or Opus, Anthropic loses money on every request. As one Hacker News commenter noted: "By treating Anthropic models as a commodity within a neutral harness, you empower users at the cost of Anthropic". [reddit](https://www.reddit.com/r/ClaudeCode/comments/1qa4h1q/crazy_to_see_openai_step_up_since_anthropic_has/)
The Cost Difference Is Massive
Users who've implemented their own routing report 60-80% cost reductions compared to using expensive models for everything. Without routing: reddit
So yes—Cline always hitting Sonnet/Opus while Claude Code intelligently routes most tasks to Haiku likely made third-party usage far more expensive for Anthropic to subsidize, contributing to the crackdown. [news.ycombinator](https://news.ycombinator.com/item?id=46549823)