Self-Hosted AI Models

152 open-source AI models you can run on your own infrastructure. Self-hosting gives you complete data privacy, zero per-token costs, and full control over the model and its behavior.

152

Open Source

Providers

+ Tool Use

+ Vision

+ Reasoning

Top Open Source Providers

Alibaba

38 models

Mistral AI

16 models

All Self-Hostable Models - Ranked by Score

#	Model	Provider	Score	Context
1	Kimi K2.5Moonshot AI	Moonshot AI	85	262K
2	Qwen3 VL 8B ThinkingAlibaba	Alibaba	85	131K
3	Qwen3 VL 30B A3B ThinkingAlibaba	Alibaba	85	131K
4	Nemotron 3 Super (free)NVIDIA	NVIDIA	84	262K
5	MiniMax M2.5 (free)MiniMax	MiniMax	83	197K
6	MiniMax M2.7MiniMax	MiniMax	83	205K
7	MiMo-V2-FlashXiaomi	Xiaomi	83	262K
8	Trinity Miniarcee-ai	arcee-ai	82	131K
9	Nemotron Nano 12B 2 VL (free)NVIDIA	NVIDIA	82	128K
10	Tongyi DeepResearch 30B A3BAlibaba	Alibaba	82	131K
11	Qwen3.5 397B A17BAlibaba	Alibaba	82	262K
12	gpt-oss-safeguard-20bOpenAI	OpenAI	82	131K
13	Qwen3 VL 32B InstructAlibaba	Alibaba	81	131K
14	Qwen3 VL 8B InstructAlibaba	Alibaba	81	131K
15	Qwen3 VL 30B A3B InstructAlibaba	Alibaba	81	131K
16	Qwen3 30B A3B Thinking 2507Alibaba	Alibaba	81	131K
17	Qwen3.5-122B-A10BAlibaba	Alibaba	80	262K
18	Mistral Small 4Mistral AI	Mistral AI	79	262K
19	Qwen3.5-9BAlibaba	Alibaba	79	256K
20	Qwen3.5-27BAlibaba	Alibaba	79	262K
21	Qwen3.5-35B-A3BAlibaba	Alibaba	78	262K
22	Step 3.5 Flash (free)StepFun	StepFun	78	256K
23	R1 0528DeepSeek	DeepSeek	78	164K
24	Qwen3 VL 235B A22B ThinkingAlibaba	Alibaba	77	131K
25	DeepSeek V3.2 ExpDeepSeek	DeepSeek	77	164K
26	DeepSeek V3.2 SpecialeDeepSeek	DeepSeek	77	164K
27	Qwen3 Coder NextAlibaba	Alibaba	77	262K
28	Llama 4 MaverickMeta	Meta	77	1.0M
29	MiniMax M2.5MiniMax	MiniMax	76	197K
30	Qwen3 30B A3B Instruct 2507Alibaba	Alibaba	75	262K
31	ERNIE 4.5 VL 28B A3BBaidu	Baidu	75	30K
32	DeepSeek V3.2DeepSeek	DeepSeek	74	164K
33	Qwen3 VL 235B A22B InstructAlibaba	Alibaba	74	262K
34	DeepSeek V3.1DeepSeek	DeepSeek	74	33K
35	gpt-oss-120b (free)OpenAI	OpenAI	74	131K
36	gpt-oss-20b (free)OpenAI	OpenAI	74	131K
37	DeepSeek V3.1 TerminusDeepSeek	DeepSeek	74	164K
38	Nemotron 3 SuperNVIDIA	NVIDIA	74	262K
39	Nemotron 3 Nano 30B A3BNVIDIA	NVIDIA	74	262K
40	Ministral 3 14B 2512Mistral AI	Mistral AI	74	262K

Self-Hosting AI Models

Complete Data Privacy

Your data never leaves your infrastructure. Critical for healthcare, finance, legal, and government use cases where data residency and privacy regulations apply.

Zero Per-Token Cost

After the initial hardware investment, there are no per-request charges. At high volumes, self-hosting can be 10-100x cheaper than API-based services.

Popular Hosting Options

Run models with vLLM, Ollama, text-generation-inference, or llama.cpp. Most can run on consumer GPUs (RTX 4090) for smaller models, or cloud GPUs (A100, H100) for larger ones.

Fine-Tuning & Customization

Self-hosted models can be fine-tuned on your own data, creating domain-specific versions that outperform general-purpose models for your use case.

Frequently Asked Questions

Self-hosting gives you complete data privacy (no data leaves your servers), eliminates per-token API costs, removes rate limits, enables offline operation, and allows fine-tuning for your specific use case.

Requirements depend on model size. Small models (7B parameters) run on consumer GPUs with 8GB VRAM. Medium models (13-30B) need 24GB+ VRAM. Large models (70B+) require multiple high-end GPUs or specialized inference hardware.

Popular tools include Ollama (easiest setup), llama.cpp (most efficient for CPU inference), vLLM (fastest for GPU serving), and text-generation-webui (feature-rich UI). Each excels at different use cases.

For high-volume usage (thousands of requests/day), self-hosting is significantly cheaper. For low-volume or sporadic use, API access is more cost-effective since you avoid hardware costs and maintenance overhead.

Self-Hosted AI Models

Top Open Source Providers

All Self-Hostable Models - Ranked by Score

Self-Hosting AI Models

Complete Data Privacy

Zero Per-Token Cost

Popular Hosting Options

Fine-Tuning & Customization

相关页面

Self-Hosted AI Models

Top Open Source Providers

All Self-Hostable Models - Ranked by Score

Self-Hosting AI Models

Complete Data Privacy

Zero Per-Token Cost

Popular Hosting Options

Fine-Tuning & Customization

相关页面