Create an AI model configuration.
JWT token from Stytch B2B authentication (magic link, SSO, or M2M)
Schema for creating a new AI model configuration. Enterprise-grade validation with comprehensive field documentation for automatic OpenAPI generation.
ID of the system model to use as the base. User models must reference a system model to inherit pricing and capabilities.
1 - 36"550e8400-e29b-41d4-a716-446655440000"
Your model's human-readable name shown in dashboards and responses (e.g., 'AI models Omni', 'My Finance Model'). Use this to differentiate organization-specific or customized variants.
1 - 200"large-model Omni"
"reasoning-model 3.5 Sonnet"
"Custom Support Model"
Maximum tokens considered at once, including system prompt, message history, user input, tools/functions, and the model's output. Larger windows allow longer context but may increase cost/latency. If not specified, inherits from parent model.
1 <= x <= 20000004096
Upper bound for tokens the model may generate in a single response (output tokens only, distinct from the context window). If not specified, inherits from parent model.
1 <= x <= 2000001000
Whether the provider/model supports SSE (Server-Sent Events) streaming responses.
Enable/disable this model for your organization without deleting it.
ID of a system prompt composition to use for this model. System prompts are composed from particles (role, tone, guardrails, etc.).
"550e8400-e29b-41d4-a716-446655440002"
Full system prompt override (escape hatch). If set, bypasses particle-based composition and uses this text directly.
50000Successful Response
Schema for AI model response data.
Unique identifier for the model
"550e8400-e29b-41d4-a716-446655440000"
Human-readable display name
"large-model Omni"
"reasoning-model 3.5 Sonnet"
Model's context window size in tokens
4096
128000
Whether the model supports streaming
true
Whether the model is enabled
true
Whether this is a company-provided base model
true
false
When the model was created
"2025-09-08T06:33:19Z"
When the model was last updated
"2025-09-08T06:33:19Z"
Model description
"Advanced language model for complex tasks"
AI provider name (NULL for user models, inherited from parent)
"provider-a"
Provider-specific model identifier (NULL for user models, inherited from parent)
"large-modelo"
Maximum tokens per request
4096
Whether the model supports tool/function calling
true
false
Whether the model supports vision/image inputs
true
false
Whether the model supports reasoning/thinking tokens (AI models extended thinking, o1, AI models thinking)
true
false
Credit multiplier for billing (system models only, user models inherit from parent)
1
Access tier for this model. 'standard' models are available on all plans. 'premium' models require a Pro or Enterprise subscription.
"standard"
"premium"
Current lifecycle state of the model. 'active': normal operation. 'disabled': temporarily disabled by admin. 'deprecated': end-of-life date is scheduled (see endOfLifeAt). 'end_of_life': model has been retired and should no longer be used.
active, disabled, deprecated, end_of_life "active"
"deprecated"
"end_of_life"
Scheduled end-of-life date for this model. When set and in the future, the model is deprecated. When in the past, the model has reached end of life. Null means the model has no scheduled retirement.
"2026-06-01T00:00:00Z"
User-facing deprecation notice. Typically includes a reason and a suggested replacement model. Only present when lifecycleState is 'deprecated' or 'end_of_life'.
"This model is scheduled for retirement on 2026-06-01. Please migrate to large-modelo."
ID of the suggested replacement model when this model is deprecated or end-of-life. Use this to guide users toward a migration path. Null when no replacement is designated.
"660e8400-e29b-41d4-a716-446655440001"
Whether this model is recommended for new configurations. Curated by Cuadra — typically the best balance of capability, speed, and cost for each provider.
true
false
ID of the parent system model (for user models)
"550e8400-e29b-41d4-a716-446655440001"
ID of the system prompt composition assigned to this model
"550e8400-e29b-41d4-a716-446655440002"
Full system prompt override (if set, bypasses particle composition)
"You are a helpful assistant."