Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.prisme.ai/llms.txt

Use this file to discover all available pages before exploring further.

This page lists the models enabled by default on the Prisme.ai platform, grouped by provider. Use it as a reference when configuring agents, RAG pipelines, or routing rules.
The catalog below reflects the default platform configuration. Available models in your environment may differ depending on:
  • Provider credentials configured by your platform administrator.
  • Organization-level restrictions set in Model Governance.
  • Per-agent model restrictions.
To check what is actually allowed for your organization, go to Agents Controls > Models in AI Governance.

Cost tiers

Each completion model is tagged with a cost tier used by routing strategies (e.g. Cost Optimized) and by the soft-downgrade policy:
TierMeaning
lowCheapest tier β€” small / fast models, suited for high-volume tasks.
mediumBalanced cost / quality β€” default for most general-purpose agents.
highMost capable / most expensive β€” reserved for complex reasoning, long contexts, or final-quality outputs.
β€”Not applicable (embedding and image generation models).

Catalog

Switch between viewing the catalog grouped by provider or by region.
ModelIDRegionTypeCost
Claude 4.5 Sonnetclaude-sonnet-4-5-20250929USA (Anthropic)Completionlow
Claude 4.6 Opusaz-claude-opus-4-6Sweden (Azure)Completionlow
ModelIDRegionTypeCost
GPT 4az-gpt-4EU (Azure)Completionhigh
GPT 4.1 miniaz-gpt-4-1-miniParisCompletionlow
GPT 4oaz-gpt-4oEU (Azure)Completionhigh
GPT 5 Chataz-gpt-5-chatEU (Azure Data Zone)Completionhigh
GPT 5 Miniaz-gpt-5-miniFranceCompletionmedium
GPT 5.1 Chataz-gpt-5.1-chatEU (Azure Data Zone)Completionhigh
GPT 5.2 Chataz-gpt-5.2-chatSwedenCompletionhigh
az-embedding-adaaz-embedding-adaEU (Azure)Embeddingβ€”
az-text-embedding-3-largeaz-text-embedding-3-largeEU (Azure)Embeddingβ€”
ModelIDRegionTypeCost
Amazon Nova Proeu.amazon.nova-pro-v1:0EU (eu-west-3)Completionlow
Claude 3.5 Sonneteu.anthropic.claude-3-5-sonnet-20240620-v1:0EU (eu-west-3)Completionlow
Claude 3.7 Sonnetclaude-3-7-sonnet-20250219EU (Bedrock)Completionlow
Claude 4 Sonneteu.anthropic.claude-sonnet-4-20250514-v1:0EU (eu-west-3)Completionlow
Claude 4.5 Haikueu.anthropic.claude-haiku-4-5-20251001-v1:0EU (eu-west-3)Completionlow
Claude 4.5 Sonneteu.anthropic.claude-sonnet-4-5-20250929-v1:0EU (eu-west-3)Completionlow
Claude Sonnet 4.6anthropic.claude-sonnet-4-6EU (eu-west-3)Completionlow
Llama 3.2 3B Instructeu.meta.llama3-2-3b-instruct-v1:0EU (eu-west-3)Completionlow
amazon.titan-embed-image-v1amazon.titan-embed-image-v1EU (Bedrock)Embeddingβ€”
amazon.titan-embed-text-v1amazon.titan-embed-text-v1EU (Bedrock)Embeddingβ€”
cohere.embed-multilingual-v3cohere.embed-multilingual-v3EU (Bedrock)Embeddingβ€”
amazon.titan-image-generator-v1amazon.titan-image-generator-v1EU (Bedrock)Imageβ€”
ModelIDRegionTypeCost
GPT 4gpt-4USA (OpenAI)Completionhigh
GPT 4ogpt-4oUSA (OpenAI)Completionhigh
GPT 5 Chatgpt-5-chat-latestUSA (OpenAI)Completionhigh
GPT 5.1 Chatgpt-5.1USA (OpenAI)Completionhigh
GPT o1 minio1-miniUSA (OpenAI)Completionlow
GPT o3 minio3-miniUSA (OpenAI)Completionlow
text-embedding-3-largetext-embedding-3-largeUSA (OpenAI)Embeddingβ€”
text-embedding-3-smalltext-embedding-3-smallUSA (OpenAI)Embeddingβ€”
text-embedding-ada-002text-embedding-ada-002USA (OpenAI)Embeddingβ€”
dall-e-3dall-e-3USA (OpenAI)Imageβ€”
ModelIDRegionTypeCost
Gemini 2.5 Flashgemini-2.5-flashUSA (Google)Completionhigh
Gemini 2.5 Flash Litegemini-2.5-flash-liteUSA (Google)Completionmedium
Gemini 2.5 Progemini-2.5-proUSA (Google)Completionhigh
Gemini 3.1 Flash Litegemini-3.1-flash-lite-previewUSA (Google)Completionhigh
Gemini 3.1 Pro (preview)gemini-3.1-pro-previewUSA (Google)Completionhigh
ModelIDRegionTypeCost
Gemini 2.5 Flash (preview)vertex-gemini-2.5-flashGoogle Cloud (Vertex)Completionmedium
Gemini 3 Progemini-3-pro-previewGoogle Cloud (Vertex)Completionmedium
ModelIDRegionTypeCost
Codestralcodestral-latestParis (Mistral AI)Completionmedium
Mistral Largemistral-large-latestParis (Mistral AI)Completionhigh
Mistral Sabamistral-saba-latestParis (Mistral AI)Completionlow
Mistral Smallmistral-small-latestParis (Mistral AI)Completionhigh
ModelIDRegionTypeCost
Deepseek Chatdeepseek-chatChinaCompletionmedium
Deepseek Reasonerdeepseek-reasonerChinaCompletionhigh
ModelIDRegionTypeCost
Deepseek R1 (Distill Llama 70B)DeepSeek-R1-Distill-Llama-70BFrance (OVHcloud)Completionhigh
Llama 3.3 (70B Instruct)Meta-Llama-3_3-70B-InstructFrance (OVHcloud)Completionlow
Mistral 7B InstructMistral-7B-Instruct-v0.3France (OVHcloud)Completionlow
ModelIDRegionTypeCost
Qwen 3 32Bqwen-3-32bChinaCompletionhigh
  • Model Governance β€” restrict allowed models, configure defaults, set quotas, define routing rules, and configure failover.
  • Capabilities β€” manage tools, guardrails, skills, and memory available to agents.
  • LLM Gateway API β€” programmatic access to all models through a single endpoint.