AI Providers
SmartRAG AI provider configuration - OpenAI, Anthropic, Google Gemini, Azure OpenAI and custom providers
AI Provider Configuration
SmartRAG supports various AI providers:
OpenAI
OpenAI provides advanced language models and embeddings for production-ready applications:
{
"AI": {
"OpenAI": {
"ApiKey": "sk-proj-YOUR_KEY",
"Endpoint": "https://api.openai.com/v1",
"Model": "gpt-5.1",
"EmbeddingModel": "text-embedding-3-small",
"MaxTokens": 4096,
"Temperature": 0.7
}
}
}
builder.Services.AddSmartRag(configuration, options =>
{
options.AIProvider = AIProvider.OpenAI;
});
Models:
gpt-5.1- Latest advanced reasoning model (recommended)gpt-5- Advanced reasoning capabilitiesgpt-4o- Previous generation advanced modelgpt-4o-mini- Cost-effective and fasttext-embedding-3-small,text-embedding-3-large- Embeddings (recommended)text-embedding-ada-002- Legacy embeddings
Anthropic (Claude)
Important: VoyageAI Required
Anthropic Claude models require a separate VoyageAI API key for embeddings since Anthropic doesn't provide embedding models.
- Get VoyageAI Key: console.voyageai.com
- Documentation: Anthropic Embeddings Guide
{
"AI": {
"Anthropic": {
"ApiKey": "sk-ant-ANTHROPIC_KEY",
"Model": "claude-sonnet-4-5",
"MaxTokens": 4096,
"Temperature": 0.3,
"EmbeddingApiKey": "pa-VOYAGE_KEY",
"EmbeddingModel": "voyage-3.5"
}
}
}
builder.Services.AddSmartRag(configuration, options =>
{
options.AIProvider = AIProvider.Anthropic;
});
Claude Models:
claude-sonnet-4-5- Latest and most intelligent (recommended)claude-3.5-sonnet- Previous generationclaude-3-opus-20240229- Highest capabilityclaude-3-haiku-20240307- Fastest
VoyageAI Embedding Models:
voyage-3.5- High quality (recommended)voyage-code-2- Optimized for codevoyage-2- General purpose
Google Gemini
Google Gemini offers cost-effective AI models with multimodal capabilities:
{
"AI": {
"Gemini": {
"ApiKey": "GEMINI_KEY",
"Model": "gemini-2.5-pro",
"EmbeddingModel": "embedding-001",
"MaxTokens": 4096,
"Temperature": 0.7
}
}
}
builder.Services.AddSmartRag(configuration, options =>
{
options.AIProvider = AIProvider.Gemini;
});
Models:
gemini-2.5-pro- Advanced reasoning capabilities (recommended)gemini-2.5-flash- Fast and cost-effectivegemini-2.0-flash- Previous generation workhorsegemini-1.5-pro- Legacy advanced modelembedding-001- Text embeddings
Azure OpenAI
Azure OpenAI provides enterprise-grade AI services with enhanced security and compliance:
{
"AI": {
"AzureOpenAI": {
"ApiKey": "AZURE_KEY",
"Endpoint": "https://your-resource.openai.azure.com/",
"Model": "gpt-5.1",
"EmbeddingModel": "text-embedding-3-small",
"DeploymentName": "gpt-5.1-deployment",
"MaxTokens": 4096,
"Temperature": 0.7
}
}
}
builder.Services.AddSmartRag(configuration, options =>
{
options.AIProvider = AIProvider.AzureOpenAI;
});
Custom Provider (Ollama / LM Studio)
Ollama / LM Studio with 100% On-Premise AI
Run AI models completely on-premise for complete data privacy - perfect for enterprise deployments, GDPR/HIPAA compliance.
Ollama (On-premise Models)
{
"AI": {
"Custom": {
"ApiKey": "not-required",
"Endpoint": "http://localhost:11434/v1/chat/completions",
"Model": "llama2",
"EmbeddingModel": "nomic-embed-text"
}
}
}
LM Studio (On-premise Models)
{
"AI": {
"Custom": {
"ApiKey": "not-required",
"Endpoint": "http://localhost:1234/v1/chat/completions",
"Model": "local-model",
"EmbeddingModel": "local-embedding"
}
}
}
builder.Services.AddSmartRag(configuration, options =>
{
options.AIProvider = AIProvider.Custom;
});
Supported Custom APIs:
- 🦙 Ollama - On-premise models
- 🏠 LM Studio - On-premise AI environment
- 🔗 OpenRouter - Access to 100+ models
- ⚡ Groq - Lightning-fast inference
- 🌐 Together AI - Open source models
- Any OpenAI-compatible API
Provider Comparison
Compare AI providers to choose the best option for your use case:
| Provider | Strengths | Weaknesses | Best Use Case |
|---|---|---|---|
| OpenAI | Advanced models, reliable | Expensive, data privacy concerns | Production, critical applications |
| Anthropic | Security-focused, quality output | VoyageAI required, limited access | Security-critical applications |
| Google Gemini | Cost-effective, multimodal | Limited production support | Prototyping, development |
| Azure OpenAI | Enterprise security, SLA | Complex setup | Enterprise applications |
| Ollama/LM Studio | 100% on-premise, free | Performance limitations | Data privacy critical |