LLM Providers

Each GNETiX organization configures its own LLM provider and credentials. No data is shared between tenants, and API keys are encrypted at rest using Fernet encryption.

Supported Providers

Anthropic

Direct access to Claude models via the Anthropic API.

Field	Required	Description
API Key	Yes	Your Anthropic API key (starts with `sk-ant-`)

Model mapping:

Tier	Model
fast	Claude Haiku 4.5
balanced	Claude Sonnet 4.6
powerful	Claude Opus 4.6

Anthropic is the only direct-API provider that supports extended thinking, giving Claude a dedicated reasoning budget for complex tasks.

OpenAI

Direct access to GPT models via the OpenAI API.

Field	Required	Description
API Key	Yes	Your OpenAI API key (starts with `sk-`)

Model mapping:

Tier	Model
fast	GPT-4o-mini
balanced	GPT-4o
powerful	GPT-4o

Amazon Bedrock

Access Claude models through your AWS account. Data stays within your VPC and is subject to your AWS policies.

Field	Required	Description
AWS Access Key ID	Yes	IAM access key with Bedrock invoke permissions
AWS Secret Access Key	Yes	Corresponding secret key
AWS Region	Yes	Region where Bedrock is enabled (e.g., `us-east-1`)

Model mapping:

Tier	Model
fast	Claude Haiku 3.5
balanced	Claude Sonnet 4
powerful	Claude Sonnet 4

Bedrock supports extended thinking since it runs Claude models under the hood.

Azure OpenAI

Access GPT models through your Azure subscription. Suitable for organizations with enterprise compliance requirements.

Field	Required	Description
API Key	Yes	Azure OpenAI resource key
Endpoint URL	Yes	Your resource endpoint (e.g., `https://my-resource.openai.azure.com`)

Model mapping:

Tier	Model
fast	GPT-4o-mini
balanced	GPT-4o
powerful	GPT-4o

Configuration

To configure an LLM provider for your organization:

Navigate to Org Settings in the sidebar.
Open the LLM tab.
Select your provider from the dropdown.
Enter the required credentials for your chosen provider.
Save the configuration.

The API key (or AWS credentials) are encrypted immediately on save. The UI only ever displays a masked hint (e.g., ***...a1b2) -- the plaintext key is never retrievable after initial entry.

Only one provider can be active per organization at a time. Switching providers is instantaneous and takes effect on the next message.

Rather than selecting specific models, GNETiX uses a tier abstraction. The Director requests a tier (fast, balanced, or powerful) and the system maps it to the appropriate model for your configured provider.

fast -- optimized for low latency and simple tasks (classification, short answers, routing)
balanced -- general-purpose tier used by default for most conversations
powerful -- maximum capability for complex reasoning, multi-step analysis, and nuanced responses

Tier-to-model mappings are defined in models.yaml and can be updated without redeploying the backend.

Extended Thinking

Extended thinking gives Claude a dedicated internal reasoning budget before producing its visible response. This improves performance on complex multi-step problems, tool selection, and nuanced analysis.

Provider	Extended Thinking Support
Anthropic	Yes
Amazon Bedrock	Yes
OpenAI	No
Azure OpenAI	No

When enabled, the Director allocates a 5,000-token thinking budget on each LLM call. This is configured per-organization in the Org Settings > LLM tab via the extended thinking toggle.

Spend Tracking and Budget Caps

Every LLM call is logged with token counts and estimated cost. GNETiX tracks spend per-organization, enabling:

Per-org spend dashboards -- view token usage and cost breakdown over time
Budget caps -- set a maximum USD spend per organization; when the cap is reached, LLM calls are blocked until an admin raises or removes the limit

Budget caps are configured in the Org Settings > LLM tab. Setting the budget to empty removes the cap entirely.

Supported Providers

Anthropic

OpenAI

Amazon Bedrock

Azure OpenAI

Configuration

Model Tiers

Extended Thinking

Spend Tracking and Budget Caps

On this page