Inference Platforms
The Problem with Direct Subscriptions
Section titled “The Problem with Direct Subscriptions”If you sign up directly with OpenAI, Anthropic, or Google, you’re locked into their ecosystem:
“Don’t sign up for Google AI stuff or ChatGPT AI stuff or Anthropic AI stuff. Because AI services are going to be buying on the open market for a very low price.”
Direct subscriptions mean:
- You can only use their models
- You pay their prices (often higher)
- Switching costs are high
- You miss out on competition
Open Router: The Aggregator Approach
Section titled “Open Router: The Aggregator Approach”Open Router provides access to 635+ models through a single API:
# Same API, any modelcurl https://openrouter.ai/api/v1/chat/completions \ -H "Authorization: Bearer $OPENROUTER_API_KEY" \ -d '{ "model": "minimax/minimax-01", "messages": [{"role": "user", "content": "Hello"}] }'Why This Matters
Section titled “Why This Matters”- Single billing - One account, all models
- Easy switching - Change model name, that’s it
- Transparent pricing - 5% markup, that’s all
- Competition - Route to cheapest provider automatically
Observability (Optional)
Section titled “Observability (Optional)”Open Router can integrate with external observability tools (request logging/monitoring) if you need it.
“You can connect external observability platforms into Open Router.”
This is useful when you operate a shared team setup and want auditing, cost monitoring, or debugging across many models.
Price Comparison Tool
Section titled “Price Comparison Tool”Use models.dev to compare pricing across models. It’s maintained by the Open Router team and shows real-time prices.
Free Model Promotions
Section titled “Free Model Promotions”New models often launch with free tiers to gather training data:
“New models, as they come out, they need more training data… they give it to inference platforms for one or two weeks for free.”
This means you can often use cutting-edge models for free if you’re willing to experiment.
Some interfaces also run their own free promotions via their preferred inference backends. For example, Open Code has an inference option called Open Code Zen.
Anonymous Models
Section titled “Anonymous Models”Some platforms offer “anonymous” models:
“Big Pickle is a name for an unknown model. If you’re feeling adventurous, you can try it, and then maybe next week it’s going to turn out it was Facebook’s newest open source model.”
These are often pre-release models being tested. Fun for experimentation, not for production.
Recommendation
Section titled “Recommendation”- Get an Open Router account - It’s the most flexible option
- Set a budget - Easy to track spending
- Default to cheap models - MiniMax, Kimi for daily work
- Route to expensive models only for complex tasks
Terminology
Section titled “Terminology”You will hear a few overlapping words:
- Inference platform/provider: runs models and serves an API (OpenAI, Anthropic, Open Router providers, etc.)
- Router/aggregator: a layer that gives you one API and routes to many providers (Open Router)
- Harness: industry slang for the UI/tool that wraps chat + tool calls (see the interfaces page)