Features
- Multiple providers - Access 50+ models from various providers
- Automatic failover - Route to available providers
- Cost optimization - Choose models by price/performance
- Single API key - One key for all providers
Installation
Configuration
Environment Variables
Character Configuration
Supported Operations
| Operation | Support | Notes |
|---|---|---|
| TEXT_GENERATION | ✅ | All available models |
| EMBEDDING | ✅ | Multiple embedding models available |
| OBJECT_GENERATION | ✅ | Model dependent |
Embedding Models
OpenRouter now provides embedding endpoints with multiple models:text-embedding-3-large(OpenAI via OpenRouter)qwen3-embedding(Qwen)gemini-embedding(Google)mistral-embed(Mistral)
Model Configuration
The plugin uses model tiers:- SMALL_MODEL: Fast, cost-effective responses
- LARGE_MODEL: Complex reasoning, best quality
- IMAGE_MODEL: Multimodal capabilities
Premium Models
- Any Anthropic Claude model (Opus, Sonnet, Haiku)
- Any OpenAI GPT model (GPT-4o, GPT-4, GPT-3.5)
- Google Gemini models (Pro, Flash, etc.)
- Cohere Command models
Open Models
- Meta Llama models (3.1, 3.2, 3.3)
- Mistral/Mixtral models
- Many models with
:freesuffix for testing
Pricing Strategy
OpenRouter charges a small markup (usually ~10%) on top of provider prices:- Pay-per-token - No monthly fees
- Price transparency - See costs per model
- Credits system - Pre-pay for usage

