OpenRouter Models | Access 300+ AI Models Through One API | OpenRouter

Explore and browse 300+ models and providers on our website, or with our API. You can also subscribe to our RSS feed to stay updated on new models.

Query Parameters

The Models API supports query parameters to filter the list of models returned.

`output_modalities`

Filter models by their output capabilities. Accepts a comma-separated list of modalities or "all" to include every model regardless of output type.

Value	Description
`text`	Models that produce text output (default)
`image`	Models that generate images
`audio`	Models that produce audio output
`embeddings`	Embedding models
`all`	Include all models, skip modality filtering

Examples:

$ # Default — text models only
$ curl "https://openrouter.ai/api/v1/models"
$ 
$ # Image generation models only
$ curl "https://openrouter.ai/api/v1/models?output_modalities=image"
$ 
$ # Text and image models
$ curl "https://openrouter.ai/api/v1/models?output_modalities=text,image"
$ 
$ # All models regardless of modality
$ curl "https://openrouter.ai/api/v1/models?output_modalities=all"

The same parameter is available on the /v1/models/count endpoint so that counts stay consistent with list results.

`supported_parameters`

Filter models by the API parameters they support. For example, to find models that support tool calling:

$ curl "https://openrouter.ai/api/v1/models?supported_parameters=tools"

`sort`

Sort models server-side before they’re returned. Accepts one of the following values:

Value	Description
`pricing-low-to-high`	Cheapest models first (weighted average of prompt, completion, request, and web_search pricing)
`pricing-high-to-low`	Most expensive models first
`context-high-to-low`	Largest context window first
`throughput-high-to-low`	Highest tokens/second first (p50 throughput from routing heuristics)
`latency-low-to-high`	Lowest time-to-first-token first (p50 latency)
`most-popular`	Most tokens processed in the last week
`top-weekly`	Same as `most-popular`
`newest`	Most recently added to OpenRouter

Models without data for the requested sort dimension (e.g. no pricing, no throughput heuristics) sort last. Omitting sort preserves the default ordering (backward compatible).

$ # Cheapest models first
$ curl "https://openrouter.ai/api/v1/models?sort=pricing-low-to-high"
$ 
$ # Newest models
$ curl "https://openrouter.ai/api/v1/models?sort=newest"
$ 
$ # Combine with filters
$ curl "https://openrouter.ai/api/v1/models?sort=throughput-high-to-low&supported_parameters=tools"

Single Model Lookup

Look up a single model’s full details without fetching the entire list:

GET /api/v1/model/{author}/{slug}

The endpoint resolves aliases automatically. For example, anthropic/claude-3-5-sonnet redirects to the canonical anthropic/claude-3.5-sonnet and returns its data.

Variant suffixes are also supported — append :free, :thinking, etc. to the slug:

$ # Look up a specific model
$ curl "https://openrouter.ai/api/v1/model/openai/gpt-4o"
$ 
$ # Aliases resolve automatically
$ curl "https://openrouter.ai/api/v1/model/anthropic/claude-3-5-sonnet"
$ 
$ # Variant suffixes
$ curl "https://openrouter.ai/api/v1/model/openai/gpt-4:free"

Returns 404 if the model doesn’t exist and isn’t an alias for another model. The response shape wraps the same Model object used in the list endpoint:

1 {
2   "data": {
3     "id": "openai/gpt-4o",
4     "name": "GPT-4o",
5     "pricing": { "prompt": "0.0000025", "completion": "0.00001", ... },
6     ...
7   }
8 }

Models API Standard

Our Models API makes the most important information about all LLMs freely available as soon as we confirm it.

API Response Schema

The Models API returns a standardized JSON response format that provides comprehensive metadata for each available model. This schema is cached at the edge and designed for reliable integration with production applications.

Root Response Object

1 {
2   "data": [
3     /* Array of Model objects */
4   ]
5 }

Model Object Schema

Each model in the data array contains the following standardized fields:

Field	Type	Description
`id`	`string`	Unique model identifier used in API requests (e.g., `"google/gemini-2.5-pro-preview"`)
`canonical_slug`	`string`	Permanent slug for the model that never changes
`name`	`string`	Human-readable display name for the model
`created`	`number`	Unix timestamp of when the model was added to OpenRouter
`description`	`string`	Detailed description of the model’s capabilities and characteristics
`context_length`	`number`	Maximum context window size in tokens
`architecture`	`Architecture`	Object describing the model’s technical capabilities
`pricing`	`Pricing`	Lowest price structure for using this model
`top_provider`	`TopProvider`	Configuration details for the primary provider
`per_request_limits`	Rate limiting information (null if no limits)
`supported_parameters`	`string[]`	Array of supported API parameters for this model
`default_parameters`	`object \| null`	Default parameter values for this model (null if none)
`expiration_date`	`string \| null`	Deprecation date for the model endpoint (null if not deprecated)
`benchmarks`	`Benchmarks \| undefined`	Third-party benchmark rankings (omitted when no data is available)

Architecture Object

1 {
2   "input_modalities": string[], // Supported input types: ["file", "image", "text"]
3   "output_modalities": string[], // Supported output types: ["text"]
4   "tokenizer": string,          // Tokenization method used
5   "instruct_type": string | null // Instruction format type (null if not applicable)
6 }

Pricing Object

All pricing values are in USD per token/request/unit. A value of "0" indicates the feature is free.

1 {
2   "prompt": string,           // Cost per input token
3   "completion": string,       // Cost per output token
4   "request": string,          // Fixed cost per API request
5   "image": string,           // Cost per image input
6   "web_search": string,      // Cost per web search operation
7   "internal_reasoning": string, // Cost for internal reasoning tokens
8   "input_cache_read": string,   // Cost per cached input token read
9   "input_cache_write": string   // Cost per cached input token write
10 }

Top Provider Object

1 {
2   "context_length": number,        // Provider-specific context limit
3   "max_completion_tokens": number, // Maximum tokens in response
4   "is_moderated": boolean         // Whether content moderation is applied
5 }

Benchmarks Object

Present only on models that have been evaluated in third-party benchmarks. Currently includes Design Arena rankings.

1 {
2   "design_arena": [
3     {
4       "arena": string,    // Arena type (e.g. "models", "builders", "agents")
5       "category": string, // Category within the arena (e.g. "website", "gamedev")
6       "elo": number,      // ELO rating from head-to-head arena battles
7       "win_rate": number,  // Win rate percentage
8       "rank": number      // Rank within this arena+category (1 = highest ELO)
9     }
10   ]
11 }

Rankings are computed among models listed on OpenRouter, not the full external leaderboard. Models without benchmark data omit the benchmarks field entirely.

$ # Find models with benchmark data
$ curl -s "https://openrouter.ai/api/v1/models" | jq '.data[] | select(.benchmarks) | {id, benchmarks}'

Supported Parameters

The supported_parameters array indicates which OpenAI-compatible parameters work with each model:

tools - Function calling capabilities
tool_choice - Tool selection control
max_tokens - Response length limiting
temperature - Randomness control
top_p - Nucleus sampling
reasoning - Internal reasoning mode
include_reasoning - Include reasoning in response
structured_outputs - JSON schema enforcement
response_format - Output format specification
stop - Custom stop sequences
frequency_penalty - Repetition reduction
presence_penalty - Topic diversity
seed - Deterministic outputs

Different models tokenize text in different ways

Some models break up text into chunks of multiple characters (GPT, Claude, Llama, etc), while others tokenize by character (PaLM). This means that token counts (and therefore costs) will vary between models, even when inputs and outputs are the same. Costs are displayed and billed according to the tokenizer for the model in use. You can use the usage field in the response to get the token counts for the input and output.

If there are models or providers you are interested in that OpenRouter doesn’t have, please tell us about them in our Discord channel.

For Providers

If you’re interested in working with OpenRouter, you can learn more on our providers page.