Models

55+ models from 12+ providers, including a free tier.

Available Providers

Franklin gives you access to 55+ models from 12+ providers through a single interface. No API keys needed — everything is routed through BlockRun's unified gateway. You pay provider cost + 5%, settled instantly in USDC via x402.

  • Anthropic — Claude Opus 4.6, Claude Opus 4.5, Claude Sonnet 4.6, Claude Haiku 4.5
  • OpenAI — GPT-5.4, GPT-5.4 Pro, GPT-5.3, o1, o3, GPT-5-mini, GPT-5-nano
  • Google — Gemini 3.1 Pro, Gemini 3 Flash Preview, Gemini 2.5 Pro, Gemini 2.5 Flash
  • xAI — Grok-3, Grok-3 Fast
  • DeepSeek — DeepSeek Chat V3.2, DeepSeek Reasoner
  • Z.AI — GLM-5, GLM-5 Turbo
  • Moonshot — Kimi K2.5 (262K context, MoE)
  • MiniMax — MiniMax M2.7 (204K context, reasoning)
  • NVIDIA — GPT-OSS 120B/20B, Nemotron, Llama, Qwen (free)

List All Models

See the full list of available models and their current status:

bash
franklin models

The output shows each model's name, provider, tier, and cost per million tokens.

Free Tier

The following models are always free — no wallet, no USDC, no sign-up:

  • NVIDIA Nemotron
  • Llama 3.3 70B (via NVIDIA)
  • Qwen 2.5 (via NVIDIA)
  • DeepSeek R1 (via NVIDIA)
bash
# Use free models only
/model free

Free models are capable

The free NVIDIA models are surprisingly strong for most coding tasks. Try them before spending credits — they may be all you need.

Switching Models

Switch to a specific model mid-conversation with the /model command:

bash
# Switch to a specific model
/model claude-sonnet-4.6
/model gpt-5
/model gemini-2.5-pro

# Switch to a routing profile
/model auto
/model free

Model changes take effect immediately on the next message.

Prompt Caching

Anthropic models (Claude) support prompt caching, which reduces cost by approximately 75% for repeated context. This is enabled automatically — long conversations and system prompts are cached and reused across messages.

Caching is automatic

You don't need to configure anything. Franklin detects cacheable content and sends the appropriate cache headers to Anthropic's API automatically.