This page lists the models enabled by default on the Prisme.ai platform, grouped by provider. Use it as a reference when configuring agents, RAG pipelines, or routing rules.Documentation Index
Fetch the complete documentation index at: https://docs.prisme.ai/llms.txt
Use this file to discover all available pages before exploring further.
The catalog below reflects the default platform configuration. Available models in your environment may differ depending on:
- Provider credentials configured by your platform administrator.
- Organization-level restrictions set in Model Governance.
- Per-agent model restrictions.
Cost tiers
Each completion model is tagged with a cost tier used by routing strategies (e.g. Cost Optimized) and by the soft-downgrade policy:| Tier | Meaning |
|---|---|
| low | Cheapest tier β small / fast models, suited for high-volume tasks. |
| medium | Balanced cost / quality β default for most general-purpose agents. |
| high | Most capable / most expensive β reserved for complex reasoning, long contexts, or final-quality outputs. |
| β | Not applicable (embedding and image generation models). |
Catalog
Switch between viewing the catalog grouped by provider or by region.- By provider
- By region
Anthropic
Anthropic
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Claude 4.5 Sonnet | claude-sonnet-4-5-20250929 | USA (Anthropic) | Completion | low |
| Claude 4.6 Opus | az-claude-opus-4-6 | Sweden (Azure) | Completion | low |
Azure
Azure
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| GPT 4 | az-gpt-4 | EU (Azure) | Completion | high |
| GPT 4.1 mini | az-gpt-4-1-mini | Paris | Completion | low |
| GPT 4o | az-gpt-4o | EU (Azure) | Completion | high |
| GPT 5 Chat | az-gpt-5-chat | EU (Azure Data Zone) | Completion | high |
| GPT 5 Mini | az-gpt-5-mini | France | Completion | medium |
| GPT 5.1 Chat | az-gpt-5.1-chat | EU (Azure Data Zone) | Completion | high |
| GPT 5.2 Chat | az-gpt-5.2-chat | Sweden | Completion | high |
| az-embedding-ada | az-embedding-ada | EU (Azure) | Embedding | β |
| az-text-embedding-3-large | az-text-embedding-3-large | EU (Azure) | Embedding | β |
Bedrock (AWS)
Bedrock (AWS)
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Amazon Nova Pro | eu.amazon.nova-pro-v1:0 | EU (eu-west-3) | Completion | low |
| Claude 3.5 Sonnet | eu.anthropic.claude-3-5-sonnet-20240620-v1:0 | EU (eu-west-3) | Completion | low |
| Claude 3.7 Sonnet | claude-3-7-sonnet-20250219 | EU (Bedrock) | Completion | low |
| Claude 4 Sonnet | eu.anthropic.claude-sonnet-4-20250514-v1:0 | EU (eu-west-3) | Completion | low |
| Claude 4.5 Haiku | eu.anthropic.claude-haiku-4-5-20251001-v1:0 | EU (eu-west-3) | Completion | low |
| Claude 4.5 Sonnet | eu.anthropic.claude-sonnet-4-5-20250929-v1:0 | EU (eu-west-3) | Completion | low |
| Claude Sonnet 4.6 | anthropic.claude-sonnet-4-6 | EU (eu-west-3) | Completion | low |
| Llama 3.2 3B Instruct | eu.meta.llama3-2-3b-instruct-v1:0 | EU (eu-west-3) | Completion | low |
| amazon.titan-embed-image-v1 | amazon.titan-embed-image-v1 | EU (Bedrock) | Embedding | β |
| amazon.titan-embed-text-v1 | amazon.titan-embed-text-v1 | EU (Bedrock) | Embedding | β |
| cohere.embed-multilingual-v3 | cohere.embed-multilingual-v3 | EU (Bedrock) | Embedding | β |
| amazon.titan-image-generator-v1 | amazon.titan-image-generator-v1 | EU (Bedrock) | Image | β |
OpenAI
OpenAI
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| GPT 4 | gpt-4 | USA (OpenAI) | Completion | high |
| GPT 4o | gpt-4o | USA (OpenAI) | Completion | high |
| GPT 5 Chat | gpt-5-chat-latest | USA (OpenAI) | Completion | high |
| GPT 5.1 Chat | gpt-5.1 | USA (OpenAI) | Completion | high |
| GPT o1 mini | o1-mini | USA (OpenAI) | Completion | low |
| GPT o3 mini | o3-mini | USA (OpenAI) | Completion | low |
| text-embedding-3-large | text-embedding-3-large | USA (OpenAI) | Embedding | β |
| text-embedding-3-small | text-embedding-3-small | USA (OpenAI) | Embedding | β |
| text-embedding-ada-002 | text-embedding-ada-002 | USA (OpenAI) | Embedding | β |
| dall-e-3 | dall-e-3 | USA (OpenAI) | Image | β |
Google
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Gemini 2.5 Flash | gemini-2.5-flash | USA (Google) | Completion | high |
| Gemini 2.5 Flash Lite | gemini-2.5-flash-lite | USA (Google) | Completion | medium |
| Gemini 2.5 Pro | gemini-2.5-pro | USA (Google) | Completion | high |
| Gemini 3.1 Flash Lite | gemini-3.1-flash-lite-preview | USA (Google) | Completion | high |
| Gemini 3.1 Pro (preview) | gemini-3.1-pro-preview | USA (Google) | Completion | high |
Google (Vertex AI)
Google (Vertex AI)
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Gemini 2.5 Flash (preview) | vertex-gemini-2.5-flash | Google Cloud (Vertex) | Completion | medium |
| Gemini 3 Pro | gemini-3-pro-preview | Google Cloud (Vertex) | Completion | medium |
Mistral
Mistral
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Codestral | codestral-latest | Paris (Mistral AI) | Completion | medium |
| Mistral Large | mistral-large-latest | Paris (Mistral AI) | Completion | high |
| Mistral Saba | mistral-saba-latest | Paris (Mistral AI) | Completion | low |
| Mistral Small | mistral-small-latest | Paris (Mistral AI) | Completion | high |
Deepseek
Deepseek
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Deepseek Chat | deepseek-chat | China | Completion | medium |
| Deepseek Reasoner | deepseek-reasoner | China | Completion | high |
OVHCloud
OVHCloud
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Deepseek R1 (Distill Llama 70B) | DeepSeek-R1-Distill-Llama-70B | France (OVHcloud) | Completion | high |
| Llama 3.3 (70B Instruct) | Meta-Llama-3_3-70B-Instruct | France (OVHcloud) | Completion | low |
| Mistral 7B Instruct | Mistral-7B-Instruct-v0.3 | France (OVHcloud) | Completion | low |
Qwen
Qwen
| Model | ID | Region | Type | Cost |
|---|---|---|---|---|
| Qwen 3 32B | qwen-3-32b | China | Completion | high |
Related
- Model Governance β restrict allowed models, configure defaults, set quotas, define routing rules, and configure failover.
- Capabilities β manage tools, guardrails, skills, and memory available to agents.
- LLM Gateway API β programmatic access to all models through a single endpoint.