SillyTavern Integration

SillyTavern is an open-source, locally-installed frontend for interacting with large language models. It supports a wide range of AI APIs --- both local and cloud-based --- and provides a rich chat interface with character cards, group chats, and advanced prompt management.

SillyTavern's Chat Completion mode supports OpenAI-compatible APIs, which means you can connect it directly to LLM API to use any model available through your subscription.

Prerequisites

An LLM API account with an API key
SillyTavern installed or accessible

Setup

Get Your LLM API Key

Log in to your LLM API dashboard
Click Create Key to Start
Copy your new API key immediately — it will only be shown once
Store the key securely (e.g., in a password manager or .env file)

LLM API is an OpenAI-compatible gateway that gives you access to dozens of AI models through a single API key and endpoint.

Configure LLM API in SillyTavern

Open SillyTavern in your browser (typically at http://localhost:8000).
Click the plug icon to open the API Connections panel.
From the API dropdown at the top, select "Chat Completion".
In the Chat Completion Source dropdown, select "Custom (OpenAI-compatible)".
Fill in the following fields:
- Custom Endpoint (Base URL): https://api.llmapi.ai/v1
- API Key: paste the key you copied from app.llmapi.ai/api-keys
Enter the model name you want to use in the "Model" field (e.g., openai/gpt-4o, anthropic/claude-3-5-sonnet).
Click "Connect" to test the connection.
Once connected, start or continue a chat --- all requests will now be routed through LLM API.

Test the Integration

Verify that SillyTavern can successfully communicate with LLM API by sending a test request. All requests will now be routed through LLM API.

SillyTavern's Chat Completion mode sends messages in the OpenAI format. LLM API is fully compatible, so all features including streaming, token counting, and model switching work seamlessly.

Benefits of Using LLM API with SillyTavern

Multi-Provider Access: Use models from OpenAI, Anthropic, Google, and more through a single API
Cost Control: Track and limit your AI spending with detailed usage analytics
Unified Billing: One account for all providers instead of managing multiple API keys
Caching: Reduce costs with response caching for repeated requests

View all available models on the models page.