Tarvis · AI 编码工具 & 大模型比价平台
大模型定价

大模型定价

17+ 大模型 API 价格实时对比,找到最具性价比的模型

实时数据更新于 尚未刷新

DeepSeek V4-Flash

DeepSeek

最便宜
Input / 1M
$0.14
Output / 1M
$0.28
上下文128K
极致性价比MoE 架构代码生成
直连有免费层MMLU ~88%

DeepSeek V4-Pro

DeepSeek

Input / 1M
$1.74
Output / 1M
$3.48
上下文128K
1.6T 旗舰MoE 架构推理增强
直连MMLU ~92%

DeepSeek R1

DeepSeek

Input / 1M
$0.55
Output / 1M
$2.19
上下文128K
推理增强思维链数学专精
直连MATH ~95%

GPT-4.1-nano

OpenAI

超低价
Input / 1M
$0.10
Output / 1M
$0.40
上下文1M
超低价1M 上下文指令遵循
需翻墙

GPT-4.1

OpenAI

Input / 1M
$2.00
Output / 1M
$8.00
上下文1M
1M 上下文代码专精指令遵循
需翻墙

GPT-5.5

OpenAI

旗舰
Input / 1M
$5.00
Output / 1M
$30.00
上下文256K
最新旗舰超强推理多模态
需翻墙

Claude Haiku 4.5

Anthropic

Input / 1M
$1.00
Output / 1M
$5.00
上下文200K
轻量快速200K 上下文缓存 90% 折扣
需翻墙

Claude Sonnet 4.6

Anthropic

Input / 1M
$3.00
Output / 1M
$15.00
上下文200K
代码专精均衡之选200K 上下文
需翻墙SWE-Bench ~72%

Claude Opus 4.8

Anthropic

推理最强
Input / 1M
$5.00
Output / 1M
$25.00
上下文200K
最强推理200K 上下文缓存 90% 折扣
需翻墙SWE-Bench ~82%

Gemini 2.5 Flash-Lite

Google

超低价
Input / 1M
$0.10
Output / 1M
$0.40
上下文1M
超低价1M 上下文免费层
需翻墙AI Studio 免费

Gemini 2.5 Flash

Google

Best Value
Input / 1M
$0.30
Output / 1M
$2.50
上下文1M
1M 上下文多模态免费层
需翻墙AI Studio 免费

Gemini 2.5 Pro

Google

Input / 1M
$1.25
Output / 1M
$5.00
上下文1M
1M 上下文多模态旗舰视频理解
需翻墙AI Studio 有限免费MMLU ~90%

Llama 4 Scout

Meta

10M上下文
Input / 1M
$0.08
Output / 1M
$0.30
上下文10M
10M 超长上下文开源可自部署109B MoE
直连+翻墙

Llama 4 Maverick

Meta

开源旗舰
Input / 1M
$0.20
Output / 1M
$0.80
上下文1M
400B MoE开源最强1M 上下文
直连+翻墙

GLM-4-Flash

智谱 AI

永久免费
Input / 1M
免费
Output / 1M
免费
上下文128K
永久免费30 RPM中文优秀
直连永久免费,无额度上限

GLM-5

智谱 AI

Input / 1M
¥5.00
Output / 1M
¥15.00
上下文128K
智谱旗舰推理增强中文领先
直连C-Eval ~93%

Qwen-Turbo

阿里巴巴

国产超低价
Input / 1M
¥0.30
Output / 1M
¥0.60
上下文128K
超低价快速响应代码生成
直连

Qwen 3 235B

阿里巴巴

Input / 1M
¥4.00
Output / 1M
¥12.00
上下文128K
最新旗舰235B 参数代码生成
直连C-Eval ~95%

Kimi (Moonshot)

月之暗面

Input / 1M
¥2.00
Output / 1M
¥6.00
上下文128K
长上下文联网搜索文件解析
直连

MiniMax

稀宇科技

Input / 1M
¥1.00
Output / 1M
¥2.00
上下文128K
超低价多模态语音合成
直连

Mistral Large 3

Mistral AI

EU-based
Input / 1M
$2.00
Output / 1M
$6.00
上下文128K
Top-tier reasoningMultilingualTool calls
需翻墙MMLU ~81%

Codestral 25.01

Mistral AI

Code-tuned
Input / 1M
$0.30
Output / 1M
$0.90
上下文256K
Code-specialized256K context80+ languages
需翻墙HumanEval ~92%

Mistral Medium 3

Mistral AI

Input / 1M
$0.40
Output / 1M
$2.00
上下文128K
Cost-efficientEnterprise-gradeTool calls
需翻墙

Command A

Cohere

RAG leader
Input / 1M
$2.50
Output / 1M
$10.00
上下文256K
111B paramsRAG-optimized256K context
需翻墙MMLU ~85%

Command R+

Cohere

Input / 1M
$2.50
Output / 1M
$10.00
上下文128K
RAG workflowsTool use104B params
需翻墙

Command R

Cohere

Input / 1M
$0.15
Output / 1M
$0.60
上下文128K
Cost-efficient RAGTool use35B params
需翻墙

Grok 4

xAI

Frontier
Input / 1M
$3.00
Output / 1M
$15.00
上下文256K
Frontier reasoning256K contextReal-time data
需翻墙GPQA ~75%

Grok 4 Fast

xAI

Input / 1M
$0.20
Output / 1M
$1.50
上下文128K
Fast inference1ms latency tierTool use
需翻墙

Sonar Pro

Perplexity

Online + RAG
Input / 1M
$3.00
Output / 1M
$15.00
上下文200K
Online searchCitations200K context
需翻墙

Sonar Reasoning Pro

Perplexity

Input / 1M
$2.00
Output / 1M
$8.00
上下文127K
Chain-of-thoughtOnline searchCitations
需翻墙

Phi-4

Microsoft

SLM
Input / 1M
$0.07
Output / 1M
$0.14
上下文16K
14B SLMUltra-cheapStrong reasoning
直连MMLU ~80%

Phi-4 Mini

Microsoft

Cheapest
Input / 1M
$0.01
Output / 1M
$0.03
上下文128K
3.8B params128K contextMobile-ready
直连

Llama Nemotron 405B

NVIDIA

Reasoning
Input / 1M
$0.50
Output / 1M
$1.50
上下文128K
405B paramsReasoning-tunedTool use
直连+翻墙

Llama Nemotron 70B

NVIDIA

Input / 1M
$0.12
Output / 1M
$0.36
上下文128K
70B paramsReasoning-tunedTool use
直连+翻墙

Qwen 3 Plus

Alibaba

Input / 1M
$0.40
Output / 1M
$1.20
上下文128K
MultilingualTool useCost-efficient
直连+翻墙

Qwen VL Max

Alibaba

Multimodal
Input / 1M
$2.70
Output / 1M
$2.70
上下文32K
Multimodal visionOCRImage understanding
直连

Hunyuan Turbo

Tencent

Input / 1M
¥1.50
Output / 1M
¥5.00
上下文256K
256K contextMultilingualWeChat ecosystem
直连

Kimi K2

Moonshot AI

Agentic
Input / 1M
$0.60
Output / 1M
$2.50
上下文200K
1T MoEAgentic tool use200K context
直连SWE-Bench ~70%

Pangu Ultra

Huawei

Input / 1M
¥8.00
Output / 1M
¥24.00
上下文32K
Huawei Cloud onlyEnterprise-gradeIndustry-tuned
直连

ERNIE 5.0

Baidu

Frontier
Input / 1M
¥4.00
Output / 1M
¥12.00
上下文128K
Frontier Chinese LLMMultimodalTool use
直连

ERNIE Speed

Baidu

Input / 1M
¥0.40
Output / 1M
¥0.80
上下文128K
Cost-efficientFast inference128K context
直连

Doubao Pro

ByteDance

Input / 1M
¥0.80
Output / 1M
¥2.00
上下文256K
256K contextMultimodalVolcano engine
直连

Doubao Lite

ByteDance

Ultra-cheap
Input / 1M
¥0.30
Output / 1M
¥0.60
上下文128K
Ultra-cheap128K contextFast inference
直连

Step-2

StepFun

Input / 1M
¥3.80
Output / 1M
¥13.00
上下文32K
1T MoEMultimodalVideo understanding
直连

Yi Large

01.AI

Input / 1M
$2.50
Output / 1M
$2.50
上下文32K
Bilingual EN/CNTool use32K context
直连+翻墙

DeepSeek V3.5 (Together)

Together AI

Input / 1M
$0.30
Output / 1M
$0.60
上下文128K
Hosted DeepSeekLow latency US/EUOpenAI-compatible API
需翻墙

Qwen 3 (Together)

Together AI

Input / 1M
$0.20
Output / 1M
$0.40
上下文128K
Hosted QwenLow latency US/EUOpenAI-compatible API
需翻墙

DeepSeek V3.5 (Fireworks)

Fireworks AI

Input / 1M
$0.30
Output / 1M
$0.90
上下文128K
Hosted DeepSeekFast US inferenceOpenAI-compatible API
需翻墙

DeepSeek V3.5 (Novita)

Novita AI

Cheapest
Input / 1M
$0.20
Output / 1M
$0.50
上下文128K
Cheapest DeepSeek hostGlobal routingOpenAI-compatible API
直连

OpenRouter Auto

OpenRouter

Aggregator
Input / 1M
~$0.30
Output / 1M
~$0.90
上下文128K
Auto-routing300+ modelsSingle API key
需翻墙

模型横向对比

一览所有模型的核心参数与定价,快速找到最适合你的模型

模型提供商Input PriceOutput Price上下文免费层访问Benchmark
DeepSeek V4-Flash最便宜
DeepSeek$0.14$0.28128K有免费层直连MMLU ~88%
DeepSeek V4-Pro
DeepSeek$1.74$3.48128K直连MMLU ~92%
DeepSeek R1
DeepSeek$0.55$2.19128K直连MATH ~95%
GPT-4.1-nano超低价
OpenAI$0.10$0.401M需翻墙
GPT-4.1
OpenAI$2.00$8.001M需翻墙
GPT-5.5旗舰
OpenAI$5.00$30.00256K需翻墙
Claude Haiku 4.5
Anthropic$1.00$5.00200K需翻墙
Claude Sonnet 4.6
Anthropic$3.00$15.00200K需翻墙SWE-Bench ~72%
Claude Opus 4.8推理最强
Anthropic$5.00$25.00200K需翻墙SWE-Bench ~82%
Gemini 2.5 Flash-Lite超低价
Google$0.10$0.401MAI Studio 免费需翻墙
Gemini 2.5 FlashBest Value
Google$0.30$2.501MAI Studio 免费需翻墙
Gemini 2.5 Pro
Google$1.25$5.001MAI Studio 有限免费需翻墙MMLU ~90%
Llama 4 Scout10M上下文
Meta$0.08$0.3010M直连+翻墙
Llama 4 Maverick开源旗舰
Meta$0.20$0.801M直连+翻墙
GLM-4-Flash永久免费
智谱 AI免费免费128K永久免费,无额度上限直连
GLM-5
智谱 AI¥5.00¥15.00128K直连C-Eval ~93%
Qwen-Turbo国产超低价
阿里巴巴¥0.30¥0.60128K直连
Qwen 3 235B
阿里巴巴¥4.00¥12.00128K直连C-Eval ~95%
Kimi (Moonshot)
月之暗面¥2.00¥6.00128K直连
MiniMax
稀宇科技¥1.00¥2.00128K直连
Mistral Large 3EU-based
Mistral AI$2.00$6.00128K需翻墙MMLU ~81%
Codestral 25.01Code-tuned
Mistral AI$0.30$0.90256K需翻墙HumanEval ~92%
Mistral Medium 3
Mistral AI$0.40$2.00128K需翻墙
Command ARAG leader
Cohere$2.50$10.00256K需翻墙MMLU ~85%
Command R+
Cohere$2.50$10.00128K需翻墙
Command R
Cohere$0.15$0.60128K需翻墙
Grok 4Frontier
xAI$3.00$15.00256K需翻墙GPQA ~75%
Grok 4 Fast
xAI$0.20$1.50128K需翻墙
Sonar ProOnline + RAG
Perplexity$3.00$15.00200K需翻墙
Sonar Reasoning Pro
Perplexity$2.00$8.00127K需翻墙
Phi-4SLM
Microsoft$0.07$0.1416K直连MMLU ~80%
Phi-4 MiniCheapest
Microsoft$0.01$0.03128K直连
Llama Nemotron 405BReasoning
NVIDIA$0.50$1.50128K直连+翻墙
Llama Nemotron 70B
NVIDIA$0.12$0.36128K直连+翻墙
Qwen 3 Plus
Alibaba$0.40$1.20128K直连+翻墙
Qwen VL MaxMultimodal
Alibaba$2.70$2.7032K直连
Hunyuan Turbo
Tencent¥1.50¥5.00256K直连
Kimi K2Agentic
Moonshot AI$0.60$2.50200K直连SWE-Bench ~70%
Pangu Ultra
Huawei¥8.00¥24.0032K直连
ERNIE 5.0Frontier
Baidu¥4.00¥12.00128K直连
ERNIE Speed
Baidu¥0.40¥0.80128K直连
Doubao Pro
ByteDance¥0.80¥2.00256K直连
Doubao LiteUltra-cheap
ByteDance¥0.30¥0.60128K直连
Step-2
StepFun¥3.80¥13.0032K直连
Yi Large
01.AI$2.50$2.5032K直连+翻墙
DeepSeek V3.5 (Together)
Together AI$0.30$0.60128K需翻墙
Qwen 3 (Together)
Together AI$0.20$0.40128K需翻墙
DeepSeek V3.5 (Fireworks)
Fireworks AI$0.30$0.90128K需翻墙
DeepSeek V3.5 (Novita)Cheapest
Novita AI$0.20$0.50128K直连
OpenRouter AutoAggregator
OpenRouter~$0.30~$0.90128K需翻墙