Self-Hosted AI Models

178 open-source AI models you can run on your own infrastructure. Self-hosting gives you complete data privacy, zero per-token costs, and full control over the model and its behavior.

178

Open Source

Providers

122

+ Tool Use

+ Vision

+ Reasoning

Top Open Source Providers

Alibaba

40 models

OpenAI

21 models

All Self-Hostable Models — Ranked by Score

#	Model	Provider	Score	Context
1	Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	69	131K
2	Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	69	131K
3	Gemini 3.1 Pro Preview Custom ToolsGoogle	Google	68	1.0M
4	Nemotron Nano 12B 2 VL (free)NVIDIA	NVIDIA	64	128K
5	Qwen3.5-FlashAlibaba	Alibaba	62	1M
6	Qwen3.5-122B-A10BAlibaba	Alibaba	61	262K
7	Qwen3.5 397B A17BAlibaba	Alibaba	61	262K
8	Qwen3.5-35B-A3BAlibaba	Alibaba	61	262K
9	Qwen3.5-27BAlibaba	Alibaba	61	262K
10	Kimi K2.5Moonshot AI	Moonshot AI	59	262K
11	Step 3.5 Flash (free)StepFun	StepFun	58	256K
12	Qwen3 VL 8B ThinkingAlibaba	Alibaba	58	131K
13	Qwen3 235B A22B Thinking 2507Alibaba	Alibaba	57	131K
14	gpt-oss-120b (free)OpenAI	OpenAI	56	131K
15	gpt-oss-20b (free)OpenAI	OpenAI	56	131K
16	Gemma 3 27B (free)Google	Google	56	131K
17	MiniMax M2.5MiniMax	MiniMax	54	197K
18	Trinity Large Preview (free)arcee-ai	arcee-ai	54	131K
19	Trinity Mini (free)arcee-ai	arcee-ai	54	131K
20	MiniMax M2MiniMax	MiniMax	54	197K
21	Nemotron Nano 9B V2 (free)NVIDIA	NVIDIA	54	128K
22	Qwen3 VL 32B InstructAlibaba	Alibaba	54	131K
23	Qwen3 VL 8B InstructAlibaba	Alibaba	54	131K
24	Qwen3 VL 30B A3B InstructAlibaba	Alibaba	54	131K
25	Qwen3 Max ThinkingAlibaba	Alibaba	54	262K
26	Qwen3 Coder 480B A35B (free)Alibaba	Alibaba	54	262K
27	MiMo-V2-FlashXiaomi	Xiaomi	54	262K
28	Trinity Miniarcee-ai	arcee-ai	53	131K
29	Tongyi DeepResearch 30B A3BAlibaba	Alibaba	53	131K
30	DeepSeek V3.2DeepSeek	DeepSeek	53	164K
31	DeepSeek V3.2 ExpDeepSeek	DeepSeek	53	164K
32	gpt-oss-safeguard-20bOpenAI	OpenAI	53	131K
33	Mistral Small 3.2 24BMistral AI	Mistral AI	53	131K
34	Mercury 2Inception	Inception	53	128K
35	Llama 4 MaverickMeta	Meta	52	1.0M
36	Nemotron 3 Nano 30B A3B (free)NVIDIA	NVIDIA	51	256K
37	Qwen3 Next 80B A3B Instruct (free)Alibaba	Alibaba	51	262K
38	Mistral Small 3.1 24B (free)Mistral AI	Mistral AI	51	128K
39	Step 3.5 FlashStepFun	StepFun	51	256K
40	ERNIE 4.5 VL 28B A3BBaidu	Baidu	51	30K

Self-Hosting AI Models

Complete Data Privacy

Your data never leaves your infrastructure. Critical for healthcare, finance, legal, and government use cases where data residency and privacy regulations apply.

Zero Per-Token Cost

After the initial hardware investment, there are no per-request charges. At high volumes, self-hosting can be 10-100x cheaper than API-based services.

Popular Hosting Options

Run models with vLLM, Ollama, text-generation-inference, or llama.cpp. Most can run on consumer GPUs (RTX 4090) for smaller models, or cloud GPUs (A100, H100) for larger ones.

Fine-Tuning & Customization

Self-hosted models can be fine-tuned on your own data, creating domain-specific versions that outperform general-purpose models for your use case.

Open Source Models Free Models Llama vs Mistral Compare Models Full Leaderboard

Self-Hosted AI Models

Top Open Source Providers

All Self-Hostable Models — Ranked by Score

Self-Hosting AI Models

Complete Data Privacy

Zero Per-Token Cost

Popular Hosting Options

Fine-Tuning & Customization

Related Pages

Self-Hosted AI Models

Top Open Source Providers

All Self-Hostable Models — Ranked by Score

Self-Hosting AI Models

Complete Data Privacy

Zero Per-Token Cost

Popular Hosting Options

Fine-Tuning & Customization

Related Pages