Models
55+ models from 12+ providers, including a free tier.
Available Providers
Franklin gives you access to 55+ models from 12+ providers through a single interface. No API keys needed — everything is routed through BlockRun's unified gateway. You pay provider cost + 5%, settled instantly in USDC via x402.
- Anthropic — Claude Opus 4.6, Claude Opus 4.5, Claude Sonnet 4.6, Claude Haiku 4.5
- OpenAI — GPT-5.4, GPT-5.4 Pro, GPT-5.3, o1, o3, GPT-5-mini, GPT-5-nano
- Google — Gemini 3.1 Pro, Gemini 3 Flash Preview, Gemini 2.5 Pro, Gemini 2.5 Flash
- xAI — Grok-3, Grok-3 Fast
- DeepSeek — DeepSeek Chat V3.2, DeepSeek Reasoner
- Z.AI — GLM-5, GLM-5 Turbo
- Moonshot — Kimi K2.5 (262K context, MoE)
- MiniMax — MiniMax M2.7 (204K context, reasoning)
- NVIDIA — GPT-OSS 120B/20B, Nemotron, Llama, Qwen (free)
List All Models
See the full list of available models and their current status:
franklin modelsThe output shows each model's name, provider, tier, and cost per million tokens.
Free Tier
The following models are always free — no wallet, no USDC, no sign-up:
- NVIDIA Nemotron
- Llama 3.3 70B (via NVIDIA)
- Qwen 2.5 (via NVIDIA)
- DeepSeek R1 (via NVIDIA)
# Use free models only
/model freeFree models are capable
Switching Models
Switch to a specific model mid-conversation with the /model command:
# Switch to a specific model
/model claude-sonnet-4.6
/model gpt-5
/model gemini-2.5-pro
# Switch to a routing profile
/model auto
/model freeModel changes take effect immediately on the next message.
Prompt Caching
Anthropic models (Claude) support prompt caching, which reduces cost by approximately 75% for repeated context. This is enabled automatically — long conversations and system prompts are cached and reused across messages.
Caching is automatic