close

Comparison of Open Source Models

Comparison and analysis of open source AI models across key performance metrics including quality, performance, inference speed, context window, parameter count & licensing details. Models are considered open source (also commonly referred to as open weights) where their weights are accessible to download. This allows self-hosting on your own infrastructure and enables customizing the model such as through fine-tuning. Click on any model to see detailed metrics. For more details relating to our methodology, see our FAQs.
Z AI logoGLM-5.2 (max) and MiniMax logoMiniMax-M3 are the highest intelligence open source models, followed by DeepSeek logoDeepSeek V4 Pro (Max) & Kimi logoKimi K2.6.

Highlights

Artificial Analysis Openness Index · Higher is better
Updated
Artificial Analysis Intelligence Index · Higher is better
Trainable parameters in billions

Openness

Artificial Analysis Openness Index: Score

Openness Index assesses model openness on a 0 to 100 normalized scale (higher is more open)
Reasoning models are indicated by a lightbulb icon

Open Source Progress

Progress in Open Weights vs. Proprietary Intelligence

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

Open Source Language Models Intelligence By Lab Over Time

Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Open Source Models Intelligence By Size Over Time

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Intelligence

Artificial Analysis Intelligence Index

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Estimate (independent evaluation forthcoming)
Reasoning models are indicated by a lightbulb icon

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Intelligence Evaluations

Intelligence evaluations measured independently by Artificial Analysis · Higher is better

Agentic real-world work tasks, (Elo-500)/2000

Agentic coding & terminal use

Agentic tool use

Long context reasoning

Reasoning & knowledge

Scientific reasoning

Coding

Instruction following

Physics reasoning

Long-horizon agentic tasks

Kubernetes incident root-cause analysis

Visual reasoning

Reasoning models are indicated by a lightbulb icon.

While model intelligence generally translates across use cases, specific evaluations may be more relevant for certain use cases.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Size

Intelligence Index By Model Size

Artificial Analysis Intelligence Index v4.1 incorporates 9 evaluations: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR
Estimate (independent evaluation forthcoming)
Large Models (>150B)
Medium Models (40B-150B)
Small Models (4B-40B)
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

Indicates whether the model weights are available. Models are labelled as 'Commercial Use Restricted' if the weights are available but commercial use is limited (typically requires obtaining a paid license).

  • Tiny: Less than or equal to 4B parameters. These are usually the smallest models in terms of resource demand.
  • Small: Less than 40B parameters.
  • Medium: Between 40B-150B parameters.
  • Large: Over 150B parameters.

Model Size: Total and Active Parameters

Comparison between total model parameters and parameters active during inference
Reasoning models are indicated by a lightbulb icon

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Active Parameters

Active parameters at inference time · Artificial Analysis Intelligence Index
Most attractive quadrant
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The number of parameters actually executed during each inference forward pass, expressed in billions. For Mixture of Experts (MoE) models, a routing mechanism selects a subset of experts per token, resulting in fewer active than total parameters. Dense models use all parameters, so active equals total.

Intelligence vs. Total Parameters

Artificial Analysis Intelligence Index · Size in parameters (billions)
Most attractive quadrant
Alibaba
DeepSeek
Google
Kimi
MBZUAI Institute of Foundation Models
MiniMax
Mistral
NVIDIA
OpenAI
Xiaomi
Z AI
Reasoning models are indicated by a lightbulb icon.

Artificial Analysis Intelligence Index v4.1 includes: GDPval-AA v2, 𝜏³-Banking, Terminal-Bench v2.1, SciCode, Humanity's Last Exam, GPQA Diamond, CritPt, AA-Omniscience, AA-LCR. See Intelligence Index methodology for further details, including a breakdown of each evaluation and how we run them.

The total number of trainable weights and biases in the model, expressed in billions. These parameters are learned during training and determine the model's ability to process and generate responses.

Context Window

Context Window

Context window: tokens limit · Higher is better
Reasoning models are indicated by a lightbulb icon

Larger context windows are relevant to RAG (Retrieval Augmented Generation) LLM workflows which typically involve reasoning and information retrieval of large amounts of data.

Maximum number of combined input & output tokens. Output tokens commonly have a significantly lower limit (varied by model).

Further details

Weights
Provider Benchmarks
GLM-5.2 (max)
Z AI logoZ AI
51
753B
40B active at inference time
1.00M
$0.9
105
MakoraParasailParasail
+6
MiniMax-M3
MiniMax logoMiniMax
44
428B
23B active at inference time
1.00M
$0.2
63
GMITogether AINovita
+4
DeepSeek V4 Pro (Reasoning, Max Effort)
DeepSeek logoDeepSeek
44
1.6KB
49B active at inference time
1.00M
$0.2
69
DeepSeekNovitaNebius
+8
Kimi K2.6
Kimi logoKimi
43
1.0KB
32B active at inference time
256k
$0.7
44
SiliconFlowParasailEigen AI
+12
MiMo-V2.5-Pro
Xiaomi logoXiaomi
42
1.0KB
42B active at inference time
1.00M
$0.2
38
GMIXiaomiDeepInfraNovita
Kimi K2.7 Code
Kimi logoKimi
42
1.0KB
32B active at inference time
256k
$0.7
52
Together AIParasailCoreWeave
+5
DeepSeek V4 Pro (Reasoning, High Effort)
DeepSeek logoDeepSeek
41
1.6KB
49B active at inference time
1.00M
$0.2
60
DeepSeekSiliconFlowFireworks
+8
DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeek logoDeepSeek
40
284B
13B active at inference time
1.00M
$0.1
92
DeepInfraMakoraSiliconFlow
+4
GLM-5.1 (Reasoning)
Z AI logoZ AI
40
744B
40B active at inference time
200k
$0.9
68
NovitaGMIParasail
+9
MiMo-V2.5
Xiaomi logoXiaomi
40
310B
15B active at inference time
1.00M
$0.1
77
XiaomiNovitaDeepInfra
+2
GLM-5 (Reasoning)
Z AI logoZ AI
40
744B
40B active at inference time
200k
$0.7
75
SiliconFlowLightning AIFriendliAI
+9
MiniMax-M2.7
MiniMax logoMiniMax
38
230B
10B active at inference time
205k
$0.2
44
NovitaTogether AIGMI
+3
Kimi K2.5 (Reasoning)
Kimi logoKimi
38
1.0KB
32B active at inference time
256k
$0.6
52
FriendliAINovitaDeepInfra
+12
Nemotron 3 Ultra 550B A55B (Reasoning)
NVIDIA logoNVIDIA
38
550B
55B active at inference time
262k
$0.6
170
Not available
CoreWeaveBlackbox AINebius
+5
DeepSeek V4 Flash (Reasoning, High Effort)
DeepSeek logoDeepSeek
37
284B
13B active at inference time
1.00M
$0.1
-
DeepSeekNovitaSiliconFlow
+5
Qwen3.6 27B (Reasoning)
Alibaba logoAlibaba
37
27.8B
262k
$0.9
55
DeepInfraAlibaba CloudSiliconFlow
+2
GLM-5.1 (Non-reasoning)
Z AI logoZ AI
35
744B
40B active at inference time
200k
$0.9
54
ParasailFriendliAIBaseten
+5
Kimi K2.6 (Non-reasoning)
Kimi logoKimi
35
1.0KB
32B active at inference time
256k
$0.7
44
NovitaSiliconFlowDeepInfra
+9
GLM-4.7 (Reasoning)
Z AI logoZ AI
34
357B
32B active at inference time
200k
$0.7
110
GoogleAmazon BedrockCerebras
+7
Qwen3.5 27B (Reasoning)
Alibaba logoAlibaba
34
27.8B
262k
$0.5
79
GMICoreWeaveSiliconFlow
+3
Qwen3.5 397B A17B (Reasoning)
Alibaba logoAlibaba
34
397B
17B active at inference time
262k
$0.9
51
NebiusNovitaWafer
+9
MiniMax-M2.5
MiniMax logoMiniMax
34
230B
10B active at inference time
205k
$0.3
183
Lightning AINebiusCoreWeave
+13
Hy3-preview (Reasoning)
Tencent logoTencent
34
295B
21B active at inference time
256k
$0.1
124
GMISiliconFlow
DeepSeek V3.2 (Reasoning)
DeepSeek logoDeepSeek
33
685B
37B active at inference time
128k
$0.2
-
SiliconFlow
?
DigitalOcean
+12
MiMo-V2-Flash (Feb 2026)
Xiaomi logoXiaomi
33
309B
15B active at inference time
256k
$0.1
156
Xiaomi
Kimi K2 Thinking
Kimi logoKimi
33
1.0KB
32B active at inference time
256k
$0.8
120
KimiNovitaAmazon Bedrock
+3
GLM-5 (Non-reasoning)
Z AI logoZ AI
32
744B
40B active at inference time
200k
$0.7
63
SiliconFlowDeepInfraNebius
+3
Qwen3.5 122B A10B (Reasoning)
Alibaba logoAlibaba
32
125B
10B active at inference time
262k
$0.7
137
GMISiliconFlowNovita
+2
Qwen3.5 397B A17B (Non-reasoning)
Alibaba logoAlibaba
32
397B
17B active at inference time
262k
$0.9
52
Eigen AIDeepInfraDigitalOcean
+6
Qwen3.6 35B A3B (Reasoning)
Alibaba logoAlibaba
32
36B
3B active at inference time
262k
$0.4
170
Alibaba CloudGMISiliconFlow
+6
MiniMax-M2.1
MiniMax logoMiniMax
31
230B
10B active at inference time
205k
$0.4
201
MiniMaxFriendliAINovita
DeepSeek V4 Pro (Non-reasoning)
DeepSeek logoDeepSeek
31
1.6KB
49B active at inference time
1.00M
$0.2
74
Lightning AIMakoraNebius
+2
MiMo-V2-Flash (Reasoning)
Xiaomi logoXiaomi
31
309B
15B active at inference time
256k
$0.1
155
Xiaomi
Ring-2.6-1T
InclusionAI logoInclusionAI
31
1.0KB
63B active at inference time
262k
$0.5
131
InclusionAI
Mistral Medium 3.5
Mistral logoMistral
30
128B
256k
$1.2
77
Mistral
Step 3.7 Flash
StepFun logoStepFun
30
198B
11B active at inference time
256k
$0.2
360
StepFun
Kimi K2.5 (Non-reasoning)
Kimi logoKimi
29
1.0KB
32B active at inference time
256k
$0.8
53
Lightning AIFireworksMicrosoft Azure
+6
Gemma 4 31B (Reasoning)
Google logoGoogle
29
30.7B
256k
-
34
SambaNovaDeepInfraNovita
+8
Qwen3.5 27B (Non-reasoning)
Alibaba logoAlibaba
29
27.8B
262k
$0.5
89
CoreWeaveDeepInfraAlibaba Cloud
Command A+
Cohere logoCohere
29
218B
25B active at inference time
192k
-
194
Cohere
Qwen3.6 27B (Non-reasoning)
Alibaba logoAlibaba
29
27.8B
262k
$0.9
57
MakoraDeepInfraAlibaba CloudNovita
Qwen3.5 35B A3B (Reasoning)
Alibaba logoAlibaba
29
36B
3B active at inference time
262k
$0.4
155
NovitaSiliconFlowAlibaba Cloud
+2
DeepSeek V4 Flash (Non-reasoning)
DeepSeek logoDeepSeek
29
284B
13B active at inference time
1.00M
$0.1
99
CoreWeaveGMIDeepSeekMakora
MiniMax-M2
MiniMax logoMiniMax
28
230B
10B active at inference time
205k
$0.4
106
Amazon BedrockMiniMaxGoogleNovita
Qwen3.5 122B A10B (Non-reasoning)
Alibaba logoAlibaba
28
125B
10B active at inference time
262k
$0.7
163
Alibaba CloudDeepInfra
MiMo-V2.5-Pro (Non-reasoning)
Xiaomi logoXiaomi
28
1.0KB
41.7B active at inference time
1.00M
$0.6
44
DeepInfraGMINovitaXiaomi
GLM-4.7 (Non-reasoning)
Z AI logoZ AI
27
357B
32B active at inference time
200k
$0.7
110
CerebrasTogether AISiliconFlow
+6
DeepSeek V3.1 Terminus (Reasoning)
DeepSeek logoDeepSeek
26
685B
37B active at inference time
128k
$1.7
-
NovitaSambaNova
Hy3-preview (Non-reasoning)
Tencent logoTencent
26
295B
21B active at inference time
256k
$0.1
132
SiliconFlowGMI
Ling-2.6-1T
InclusionAI logoInclusionAI
26
1.0KB
63B active at inference time
262k
$0.5
-
InclusionAI
Gemma 4 26B A4B (Reasoning)
Google logoGoogle
26
25.2B
3.8B active at inference time
256k
$0.1
-
DeepInfraParasailCloudflare
+4
Step 3.5 Flash
StepFun logoStepFun
26
196B
11B active at inference time
256k
$0.1
211
SiliconFlowStepFun
DeepSeek V3.2 Exp (Reasoning)
DeepSeek logoDeepSeek
25
685B
37B active at inference time
128k
$0.2
-
NovitaDeepSeek
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
NVIDIA logoNVIDIA
25
120.6B
12.7B active at inference time
1.00M
$0.3
149
NebiusCoreWeaveBaseten
+2
GLM-4.6 (Reasoning)
Z AI logoZ AI
25
357B
32B active at inference time
200k
$0.7
43
DeepInfraTogether AINovita
Qwen3.5 9B (Reasoning)
Alibaba logoAlibaba
25
9.65B
262k
$0.1
61
Together AISiliconFlow
Gemma 4 31B (Non-reasoning)
Google logoGoogle
25
30.7B
256k
$0.2
35
Together AIParasailDeepInfra
+4
K-EXAONE (Reasoning)
LG AI Research logoLG AI Research
25
236B
23B active at inference time
256k
-
-
-
DeepSeek V3.2 (Non-reasoning)
DeepSeek logoDeepSeek
25
685B
37B active at inference time
128k
$0.5
-
DeepSeekFriendliAIGMI
+12
Trinity Large Thinking
Arcee AI logoArcee AI
24
399B
13B active at inference time
512k
$0.2
182
Arcee AIParasail
Qwen3.6 35B A3B (Non-reasoning)
Alibaba logoAlibaba
24
36B
3B active at inference time
262k
$0.6
183
DeepInfraClarifaiMakora
+5
gpt-oss-120b (high)
OpenAI logoOpenAI
24
117B
5.1B active at inference time
131k
$0.2
338
MakoraDeepInfraEigen AI
+23
Kimi K2 0905
Kimi logoKimi
24
1.0KB
32B active at inference time
256k
$0.8
26
Novita
Qwen3.5 35B A3B (Non-reasoning)
Alibaba logoAlibaba
23
36B
3B active at inference time
262k
$0.4
179
DeepInfraAlibaba Cloud
MiMo-V2-Flash (Non-reasoning)
Xiaomi logoXiaomi
23
309B
15B active at inference time
256k
$0.1
150
Xiaomi
GLM-4.6 (Non-reasoning)
Z AI logoZ AI
23
357B
32B active at inference time
200k
$0.8
43
NovitaTogether AI
EXAONE 4.5 33B
LG AI Research logoLG AI Research
23
34.4B
262k
-
-
-
GLM-4.7-Flash (Reasoning)
Z AI logoZ AI
23
31.2B
3B active at inference time
200k
$0.1
86
NovitaAmazon BedrockDeepInfra
Qwen3 235B A22B 2507 (Reasoning)
Alibaba logoAlibaba
22
235B
22B active at inference time
256k
$0.6
47
NebiusNovitaCoreWeave
+3
DeepSeek V3.2 Speciale
DeepSeek logoDeepSeek
22
685B
37B active at inference time
128k
-
-
-
HyperNova 60B 2605
Multiverse Computing logoMultiverse Computing
22
58.7B
4.8B active at inference time
131k
$0.1
342
CompactifAI
Gemma 4 12B (Reasoning)
Google logoGoogle
22
12B
256k
$0.1
121
SiliconFlow
DeepSeek V3.1 Terminus (Non-reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.3
-
DeepInfraSambaNovaNovita
DeepSeek V3.2 Exp (Non-reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.2
-
NovitaDeepSeek
Nemotron Cascade 2 30B A3B
NVIDIA logoNVIDIA
21
31.6B
3B active at inference time
1.00M
-
-
-
Apriel-v1.5-15B-Thinker
ServiceNow logoServiceNow
21
15B
128k
-
-
Together AI
Qwen3 Coder Next
Alibaba logoAlibaba
21
79.7B
3B active at inference time
256k
$0.4
73
ParasailTogether AINovitaAmazon Bedrock
DeepSeek V3.1 (Non-reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.7
-
Lightning AISambaNovaBaseten
+7
Mistral Small 4 (Reasoning)
Mistral logoMistral
21
119B
6.5B active at inference time
256k
$0.2
166
Mistral
DeepSeek V3.1 (Reasoning)
DeepSeek logoDeepSeek
21
685B
37B active at inference time
128k
$0.7
-
GoogleNovitaSambaNovaAmazon Bedrock
Qwen3 VL 235B A22B (Reasoning)
Alibaba logoAlibaba
21
235B
22B active at inference time
262k
$1.4
51
NovitaAlibaba Cloud
North Mini Code
Cohere logoCohere
21
30B
3B active at inference time
256k
-
174
Not available
Cohere
Apriel-v1.6-15B-Thinker
ServiceNow logoServiceNow
21
15B
128k
-
-
Together AI
Qwen3.5 9B (Non-reasoning)
Alibaba logoAlibaba
20
9.65B
262k
-
-
-
Gemma 4 26B A4B (Non-reasoning)
Google logoGoogle
20
25.2B
3.8B active at inference time
256k
$0.2
42
ClarifaiSiliconFlowGMI
+4
Qwen3.5 4B (Reasoning)
Alibaba logoAlibaba
20
4.66B
262k
$0.0
27
DeepInfra
DeepSeek R1 0528 (May '25)
DeepSeek logoDeepSeek
20
685B
37B active at inference time
128k
$1.6
-
Together AIGoogleNovita
+3
Qwen3 Next 80B A3B (Reasoning)
Alibaba logoAlibaba
20
80B
3B active at inference time
262k
$1.1
170
Alibaba CloudNovitaGMI
+5
GLM-4.5 (Reasoning)
Z AI logoZ AI
19
355B
32B active at inference time
128k
$0.8
58
Novita
Kimi K2
Kimi logoKimi
19
1.0KB
32B active at inference time
128k
$0.6
25
NovitaKimi
Ling 2.6 Flash
InclusionAI logoInclusionAI
19
107B
7.4B active at inference time
262k
$0.1
-
Novita
Seed-OSS-36B-Instruct
ByteDance Seed logoByteDance Seed
18
36.2B
512k
$0.2
35
SiliconFlow
Qwen3 235B A22B 2507 Instruct
Alibaba logoAlibaba
18
235B
22B active at inference time
256k
$0.3
57
ScalewayFriendliAINovita
+9
Qwen3 Coder 480B A35B Instruct
Alibaba logoAlibaba
18
480B
35B active at inference time
262k
$0.5
55
Eigen AINovitaCoreWeave
+6
Qwen3 VL 32B (Reasoning)
Alibaba logoAlibaba
18
33.4B
256k
$1.5
90
Alibaba Cloud
gpt-oss-120b (low)
OpenAI logoOpenAI
18
117B
5.1B active at inference time
131k
$0.2
352
CloudflareFireworksLightning AI
+19
MiniMax M1 80k
MiniMax logoMiniMax
18
456B
45.9B active at inference time
1.00M
$0.7
-
Novita
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)
NVIDIA logoNVIDIA
18
31.6B
3.6B active at inference time
1.00M
$0.1
50
NebiusDeepInfra
K2 Think V2
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
17
70B
262k
-
-
-
LongCat Flash Lite
LongCat logoLongCat
17
68.5B
3B active at inference time
256k
-
-
LongCat
HyperCLOVA X SEED Think (32B)
Naver logoNaver
17
32B
128k
-
-
-
GLM-4.6V (Reasoning)
Z AI logoZ AI
17
108B
12B active at inference time
128k
$0.4
88
SiliconFlowNovita
K-EXAONE (Non-reasoning)
LG AI Research logoLG AI Research
17
236B
23B active at inference time
256k
-
-
-
GLM-4.5-Air
Z AI logoZ AI
17
106B
12B active at inference time
128k
$0.3
80
Together AISiliconFlow
Mistral Large 3
Mistral logoMistral
16
675B
41B active at inference time
256k
$0.6
50
MistralAmazon BedrockMicrosoft Azure
Ring-1T
InclusionAI logoInclusionAI
16
1.0KB
50B active at inference time
128k
-
-
-
Qwen3.5 4B (Non-reasoning)
Alibaba logoAlibaba
16
4.66B
262k
$0.0
23
DeepInfra
Qwen3 30B A3B 2507 (Reasoning)
Alibaba logoAlibaba
16
30.5B
3.3B active at inference time
262k
$0.4
129
ClarifaiAlibaba Cloud
DeepSeek V3 0324
DeepSeek logoDeepSeek
16
671B
37B active at inference time
128k
$1.2
-
HyperbolicDeepInfraReplicate
+3
INTELLECT-3
Prime Intellect logoPrime Intellect
16
107B
12B active at inference time
131k
-
-
-
GLM-4.7-Flash (Non-reasoning)
Z AI logoZ AI
16
31.2B
3B active at inference time
200k
$0.1
144
NovitaAmazon Bedrock
Devstral 2
Mistral logoMistral
15
125B
256k
-
47
Mistral
Solar Open 100B (Reasoning)
Upstage logoUpstage
15
102B
12B active at inference time
128k
-
-
-
Nemotron 3 Nano Omni 30B A3B Reasoning
NVIDIA logoNVIDIA
15
30B
3B active at inference time
256k
$0.1
289
ClarifaiNebius
gpt-oss-20B (high)
OpenAI logoOpenAI
15
21B
3.6B active at inference time
131k
$0.1
208
ClarifaiDeepInfraTogether AI
+10
MiniMax M1 40k
MiniMax logoMiniMax
14
456B
45.9B active at inference time
1.00M
-
-
-
gpt-oss-20B (low)
OpenAI logoOpenAI
14
21B
3.6B active at inference time
131k
$0.1
219
GoogleCoreWeaveClarifai
+9
Qwen3 VL 235B A22B Instruct
Alibaba logoAlibaba
14
235B
22B active at inference time
262k
$0.5
50
Alibaba CloudEigen AIFireworks
+2
Llama 4 Maverick
Meta logoMeta
14
402B
17B active at inference time
1.00M
$0.3
93
Microsoft AzureSambaNovaAmazon Bedrock
+6
K2-V2 (high)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
14
70B
512k
-
-
-
Qwen3 Next 80B A3B Instruct
Alibaba logoAlibaba
14
80B
3B active at inference time
262k
$0.7
173
HyperbolicParasailAlibaba Cloud
+4
Tri-21B-think Preview
Trillion Labs logoTrillion Labs
14
21B
32.0k
-
-
-
Qwen3 Coder 30B A3B Instruct
Alibaba logoAlibaba
14
30.5B
3.3B active at inference time
262k
$0.3
102
ClarifaiAmazon BedrockScalewayAlibaba Cloud
Qwen3 235B A22B (Reasoning)
Alibaba logoAlibaba
13
235B
22B active at inference time
32.8k
$1.5
56
Alibaba Cloud
QwQ 32B
Alibaba logoAlibaba
13
32.8B
131k
$0.7
30
Cloudflare
Qwen3 VL 30B A3B (Reasoning)
Alibaba logoAlibaba
13
30B
3B active at inference time
256k
$0.3
112
FireworksEigen AIAlibaba CloudNovita
Gemma 4 12B (Non-reasoning)
Google logoGoogle
13
12B
262k
-
-
-
Devstral Small 2
Mistral logoMistral
13
24B
256k
-
45
Mistral
Ling-1T
InclusionAI logoInclusionAI
13
1.0KB
50B active at inference time
128k
-
-
-
DeepSeek R1 (Jan '25)
DeepSeek logoDeepSeek
13
685B
37B active at inference time
128k
$2.0
-
HyperbolicMicrosoft AzureNovita
+3
Gemma 4 E4B (Reasoning)
Google logoGoogle
12
8B
4.5B active at inference time
128k
-
-
-
K2-V2 (medium)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
12
70B
512k
-
-
-
Llama Nemotron Super 49B v1.5 (Reasoning)
NVIDIA logoNVIDIA
12
49B
128k
$0.1
48
DeepInfra
Mistral Small 4 (Non-reasoning)
Mistral logoMistral
12
119B
6.5B active at inference time
256k
$0.2
151
Mistral
Tri-21B-Think
Trillion Labs logoTrillion Labs
12
21B
32.0k
-
-
-
Llama 3.3 Nemotron Super 49B v1 (Reasoning)
NVIDIA logoNVIDIA
12
49B
128k
-
-
-
Qwen3 4B 2507 (Reasoning)
Alibaba logoAlibaba
12
4.02B
262k
-
-
-
MiniCPM5-1B (Reasoning)
OpenBMB logoOpenBMB
12
1B
128k
-
-
-
Magistral Small 1.2
Mistral logoMistral
12
24B
128k
$0.6
107
MistralAmazon Bedrock
Sarvam 105B (high)
Sarvam logoSarvam
12
106B
10.3B active at inference time
128k
$0.0
108
Sarvam
Devstral Small (May '25)
Mistral logoMistral
12
23.6B
256k
-
-
-
MiniCPM5-1B (Non-reasoning)
OpenBMB logoOpenBMB
12
1B
128k
-
-
-
Qwen3 VL 32B Instruct
Alibaba logoAlibaba
11
33.4B
256k
$0.9
67
Alibaba Cloud
DeepSeek R1 Distill Qwen 32B
DeepSeek logoDeepSeek
11
32B
128k
-
-
-
GLM-4.6V (Non-reasoning)
Z AI logoZ AI
11
108B
12B active at inference time
128k
$0.4
83
SiliconFlowNovita
Qwen3 235B A22B (Non-reasoning)
Alibaba logoAlibaba
11
235B
22B active at inference time
32.8k
$0.6
57
Alibaba CloudNovita
Magistral Small 1
Mistral logoMistral
11
23.6B
40.0k
-
-
-
EXAONE 4.0 32B (Reasoning)
LG AI Research logoLG AI Research
11
32B
131k
-
-
-
Qwen3 VL 8B (Reasoning)
Alibaba logoAlibaba
11
8.77B
256k
$0.4
110
Alibaba Cloud
Qwen3 32B (Reasoning)
Alibaba logoAlibaba
10
32.8B
32.8k
$0.2
76
NebiusGroqNovita
+3
DeepSeek V3 (Dec '24)
DeepSeek logoDeepSeek
10
671B
37B active at inference time
128k
$0.4
-
DeepInfraNovitaNovita
+2
DeepSeek R1 0528 Qwen3 8B
DeepSeek logoDeepSeek
10
8.19B
32.8k
-
-
-
Qwen3.5 2B (Reasoning)
Alibaba logoAlibaba
10
2.27B
262k
$0.0
24
DeepInfra
Qwen3 14B (Reasoning)
Alibaba logoAlibaba
10
14.8B
32.8k
$0.4
63
Alibaba CloudDeepInfra
Nanbeige4.1-3B
Nanbeige logoNanbeige
10
3.93B
256k
-
-
-
Llama 4 Scout
Meta logoMeta
10
109B
17B active at inference time
10.0M
$0.2
106
CoreWeaveCompactifAIAmazon Bedrock
+6
Qwen3 VL 30B A3B Instruct
Alibaba logoAlibaba
10
30B
3B active at inference time
256k
$0.2
113
Alibaba CloudNovitaEigen AIFireworks
Hermes 4 - Llama-3.1 70B (Reasoning)
Nous Research logoNous Research
10
70.6B
128k
$0.2
70
Nebius
Ministral 3 14B
Mistral logoMistral
10
14B
256k
$0.2
90
MistralAmazon Bedrock
DeepSeek R1 Distill Llama 70B
DeepSeek logoDeepSeek
10
70B
128k
$0.7
47
ScalewayDeepInfraSambaNova
DeepSeek R1 Distill Qwen 14B
DeepSeek logoDeepSeek
10
14B
128k
-
-
-
Falcon-H1R-7B
TII UAE logoTII UAE
10
7B
256k
-
-
-
Ling-flash-2.0
InclusionAI logoInclusionAI
10
103B
6.1B active at inference time
128k
$0.2
51
SiliconFlow
Qwen3 Omni 30B A3B (Reasoning)
Alibaba logoAlibaba
10
35.3B
3B active at inference time
65.5k
$0.3
88
Alibaba Cloud
Qwen2.5 Instruct 72B
Alibaba logoAlibaba
10
72B
131k
$0.2
-
Alibaba CloudDeepInfraSiliconFlow
Step3 VL 10B
StepFun logoStepFun
9
10.2B
65.5k
-
-
-
Qwen3 30B A3B (Reasoning)
Alibaba logoAlibaba
9
30.5B
3.3B active at inference time
32.8k
$0.1
108
FireworksEigen AINovita
+2
Devstral Small (Jul '25)
Mistral logoMistral
9
24B
256k
$0.1
31
Mistral
Gemma 4 E2B (Reasoning)
Google logoGoogle
9
5.1B
2.3B active at inference time
128k
-
-
-
QwQ 32B-Preview
Alibaba logoAlibaba
9
32.8B
32.8k
-
-
-
GLM-4.5V (Reasoning)
Z AI logoZ AI
9
108B
12B active at inference time
64.0k
$0.7
25
Novita
Mistral Large 2 (Nov '24)
Mistral logoMistral
9
123B
128k
$2.4
54
Mistral
Mistral Small 3.2
Mistral logoMistral
9
24B
128k
$0.1
128
MistralDeepInfra
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
NVIDIA logoNVIDIA
9
253B
128k
$0.7
51
Nebius
Qwen3 30B A3B 2507 Instruct
Alibaba logoAlibaba
9
30.5B
3.3B active at inference time
262k
$0.2
148
NebiusAlibaba CloudCoreWeaveClarifai
ERNIE 4.5 300B A47B
Baidu logoBaidu
9
300B
47B active at inference time
131k
$0.4
-
NovitaSiliconFlow
Hermes 4 - Llama-3.1 405B (Reasoning)
Nous Research logoNous Research
9
406B
128k
$1.2
37
Nebius
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
NVIDIA logoNVIDIA
9
13.2B
128k
$0.2
283
DeepInfra
Ministral 3 8B
Mistral logoMistral
9
8B
256k
$0.1
87
MistralAmazon Bedrock
Gemma 4 E4B (Non-reasoning)
Google logoGoogle
9
8B
4.5B active at inference time
128k
-
-
-
Granite 4.1 30B
IBM logoIBM
9
30B
131k
-
-
-
NVIDIA Nemotron Nano 9B V2 (Reasoning)
NVIDIA logoNVIDIA
9
9B
131k
$0.1
61
DeepInfra
Hermes 4 - Llama-3.1 405B (Non-reasoning)
Nous Research logoNous Research
9
406B
128k
$1.2
39
Nebius
NVIDIA Nemotron 3 Nano 4B
NVIDIA logoNVIDIA
9
3.97B
262k
-
-
-
Qwen3.5 2B (Non-reasoning)
Alibaba logoAlibaba
9
2.27B
262k
$0.0
26
DeepInfra
Llama Nemotron Super 49B v1.5 (Non-reasoning)
NVIDIA logoNVIDIA
9
49B
128k
$0.1
48
DeepInfra
Qwen3 32B (Non-reasoning)
Alibaba logoAlibaba
9
32.8B
32.8k
$0.2
67
NebiusAlibaba CloudSambaNova
+4
Llama 3.3 Instruct 70B
Meta logoMeta
9
70B
128k
$0.6
91
SambaNovaLightning AIFriendliAI
+18
Mistral Small 3.1
Mistral logoMistral
9
24B
128k
$0.1
153
CompactifAIDeepInfraMistralCloudflare
K2-V2 (low)
MBZUAI Institute of Foundation Models logoMBZUAI Institute of Foundation Models
9
70B
512k
-
-
-
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
NVIDIA logoNVIDIA
9
4.51B
128k
-
-
-
Kimi Linear 48B A3B Instruct
Kimi logoKimi
9
49.1B
3B active at inference time
1.00M
-
-
-
Llama 3.1 Instruct 405B
Meta logoMeta
9
405B
128k
$3.1
48
Microsoft AzureDatabricksAmazon BedrockAmazon Bedrock
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
NVIDIA logoNVIDIA
8
49B
128k
-
-
-
Qwen3 VL 8B Instruct
Alibaba logoAlibaba
8
8.77B
256k
$0.2
120
Alibaba Cloud
Qwen3 4B (Reasoning)
Alibaba logoAlibaba
8
4.02B
32.0k
$0.2
-
Alibaba Cloud
Llama 3.1 Tulu3 405B
Allen Institute for AI logoAllen Institute for AI
8
405B
128k
-
-
-
Ring-flash-2.0
InclusionAI logoInclusionAI
8
103B
6.1B active at inference time
128k
$0.2
-
SiliconFlow
Pixtral Large
Mistral logoMistral
8
124B
128k
$2.4
50
Mistral
Olmo 3.1 32B Think
Allen Institute for AI logoAllen Institute for AI
8
32.2B
65.5k
-
-
Parasail
Grok 2 (Dec '24)
xAI logoxAI
8
270B
131k
-
-
-
Qwen3 VL 4B (Reasoning)
Alibaba logoAlibaba
8
4.44B
256k
-
-
-
Command A
Cohere logoCohere
8
111B
256k
$3.3
71
CohereMicrosoft Azure
Llama 3.1 Nemotron Instruct 70B
NVIDIA logoNVIDIA
8
70B
128k
$1.2
295
DeepInfra
Qwen2.5 Instruct 32B
Alibaba logoAlibaba
7
32B
128k
-
-
-
Qwen3 8B (Reasoning)
Alibaba logoAlibaba
7
8.19B
131k
$0.2
38
Eigen AIAlibaba Cloud
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)
NVIDIA logoNVIDIA
7
31.6B
3.6B active at inference time
1.00M
$0.1
61
DeepInfra
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
NVIDIA logoNVIDIA
7
9B
131k
$0.1
129
DeepInfraAmazon Bedrock
Mistral Large 2 (Jul '24)
Mistral logoMistral
7
123B
128k
$2.4
-
Amazon Bedrock
Qwen3 4B 2507 Instruct
Alibaba logoAlibaba
7
4.02B
262k
-
-
-
Qwen2.5 Coder Instruct 32B
Alibaba logoAlibaba
7
32B
131k
-
-
-
Qwen3 14B (Non-reasoning)
Alibaba logoAlibaba
7
14.8B
32.8k
$0.3
63
DeepInfraAlibaba Cloud
GLM-4.5V (Non-reasoning)
Z AI logoZ AI
7
108B
12B active at inference time
64.0k
$0.7
19
Novita
Mistral Small 3
Mistral logoMistral
7
24B
32.0k
$0.1
157
DeepInfraMistral
MiniCPM-V 4.6 1.3B
OpenBMB logoOpenBMB
7
1.3B
262k
-
-
-
Hermes 4 - Llama-3.1 70B (Non-reasoning)
Nous Research logoNous Research
7
70.6B
128k
$0.2
72
Nebius
Qwen3 30B A3B (Non-reasoning)
Alibaba logoAlibaba
7
30.5B
3.3B active at inference time
32.8k
$0.1
107
DeepInfraEigen AIAlibaba Cloud
DeepSeek-V2.5 (Dec '24)
DeepSeek logoDeepSeek
7
236B
21B active at inference time
128k
-
-
-
Qwen3 4B (Non-reasoning)
Alibaba logoAlibaba
7
4.02B
32.0k
$0.1
-
Alibaba Cloud
Llama 3.1 Instruct 70B
Meta logoMeta
7
70B
128k
$0.6
30
Amazon BedrockAmazon BedrockDeepInfraDeepInfra
Granite 4.1 8B
IBM logoIBM
7
8B
131k
$0.1
120
CoreWeave
Sarvam 30B (high)
Sarvam logoSarvam
7
32.2B
2.4B active at inference time
65.5k
$0.0
166
Sarvam
DeepSeek-V2.5
DeepSeek logoDeepSeek
7
236B
21B active at inference time
128k
-
-
-
Olmo 3.1 32B Instruct
Allen Institute for AI logoAllen Institute for AI
6
32.2B
65.5k
-
-
-
DeepSeek R1 Distill Llama 8B
DeepSeek logoDeepSeek
6
8B
128k
-
-
-
Gemma 4 E2B (Non-reasoning)
Google logoGoogle
6
5.1B
2.3B active at inference time
128k
-
-
-
Olmo 3 32B Think
Allen Institute for AI logoAllen Institute for AI
6
32.2B
65.5k
-
-
-
R1 1776
Perplexity logoPerplexity
6
671B
37B active at inference time
128k
-
-
-
Llama 3.2 Instruct 90B (Vision)
Meta logoMeta
6
90B
128k
$1.4
57
Amazon BedrockMicrosoft Azure
Solar Mini
Upstage logoUpstage
6
10.7B
4.10k
$0.1
-
Upstage
Llama 3.1 Instruct 8B
Meta logoMeta
6
8B
128k
$0.1
154
Eigen AIFriendliAINebius
+12
Grok-1
xAI logoxAI
6
314B
78B active at inference time
8.19k
-
-
-
Qwen2 Instruct 72B
Alibaba logoAlibaba
6
72B
131k
-
-
-
EXAONE 4.0 32B (Non-reasoning)
LG AI Research logoLG AI Research
6
32B
131k
-
-
-
Ministral 3 3B
Mistral logoMistral
6
3B
256k
$0.1
184
MistralAmazon Bedrock
DeepHermes 3 - Mistral 24B Preview (Non-reasoning)
Nous Research logoNous Research
5
24B
32.0k
-
-
-
Jamba 1.7 Large
AI21 Labs logoAI21 Labs
5
398B
94B active at inference time
256k
$2.6
60
AI21 Labs
Granite 4.0 H Small
IBM logoIBM
5
32B
9B active at inference time
128k
$0.1
393
Replicate
Jamba 1.5 Large
AI21 Labs logoAI21 Labs
5
398B
94B active at inference time
256k
$2.6
-
Amazon Bedrock
Qwen3 Omni 30B A3B Instruct
Alibaba logoAlibaba
5
35.3B
3B active at inference time
65.5k
$0.3
95
Alibaba Cloud
Hermes 3 - Llama-3.1 70B
Nous Research logoNous Research
5
70.6B
128k
$0.3
28
DeepInfra
Qwen3 8B (Non-reasoning)
Alibaba logoAlibaba
5
8.19B
32.8k
$0.2
39
Eigen AIFireworksAlibaba Cloud
DeepSeek-Coder-V2
DeepSeek logoDeepSeek
5
236B
21B active at inference time
128k
-
-
-
OLMo 2 32B
Allen Institute for AI logoAllen Institute for AI
5
32.2B
4.10k
-
-
-
Jamba 1.6 Large
AI21 Labs logoAI21 Labs
5
398B
94B active at inference time
256k
$2.6
60
AI21 Labs
Qwen3.5 0.8B (Reasoning)
Alibaba logoAlibaba
5
0.873B
262k
$0.0
30
DeepInfra
LFM2 24B A2B
Liquid AI logoLiquid AI
5
23.8B
2.3B active at inference time
32.8k
$0.0
116
Together AI
Phi-4
Microsoft logoMicrosoft
5
14B
16.0k
$0.2
36
DeepInfraMicrosoft Azure
Gemma 3 27B Instruct
Google logoGoogle
5
27.4B
128k
$0.1
-
ParasailAmazon BedrockGoogle
+3
Mistral Small (Sep '24)
Mistral logoMistral
5
22B
32.8k
$0.2
159
Mistral
Phi-3 Mini Instruct 3.8B
Microsoft logoMicrosoft
5
3.8B
4.10k
-
-
-
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
NVIDIA logoNVIDIA
5
13.2B
128k
$0.2
212
Amazon BedrockDeepInfra
Gemma 3n E4B Instruct Preview (May '25)
Google logoGoogle
5
8.39B
4B active at inference time
32.0k
-
-
-
Phi-4 Multimodal Instruct
Microsoft logoMicrosoft
5
5.6B
128k
-
15
Microsoft Azure
Qwen2.5 Coder Instruct 7B
Alibaba logoAlibaba
4
7.62B
131k
-
-
-
Qwen3.5 0.8B (Non-reasoning)
Alibaba logoAlibaba
4
0.873B
262k
$0.0
22
DeepInfra
Mixtral 8x22B Instruct
Mistral logoMistral
4
141B
39B active at inference time
65.4k
-
-
-
Llama 2 Chat 7B
Meta logoMeta
4
7B
4.10k
$0.1
-
Replicate
Llama 3.2 Instruct 3B
Meta logoMeta
4
3B
128k
$0.1
52
Amazon Bedrock
Jamba Reasoning 3B
AI21 Labs logoAI21 Labs
4
3B
262k
-
-
-
Qwen3 VL 4B Instruct
Alibaba logoAlibaba
4
4.44B
256k
-
-
-
Qwen1.5 Chat 110B
Alibaba logoAlibaba
4
110B
32.0k
-
-
-
Reka Flash 3
Reka AI logoReka AI
4
21B
128k
$0.3
-
Reka AI
Olmo 3 7B Think
Allen Institute for AI logoAllen Institute for AI
4
7B
65.5k
-
-
-
OLMo 2 7B
Allen Institute for AI logoAllen Institute for AI
4
7.3B
4.10k
-
-
-
Molmo 7B-D
Allen Institute for AI logoAllen Institute for AI
4
8.02B
4.10k
-
-
-
Ling-mini-2.0
InclusionAI logoInclusionAI
4
16.3B
1.4B active at inference time
131k
-
-
-
DeepSeek R1 Distill Qwen 1.5B
DeepSeek logoDeepSeek
4
1.5B
128k
-
-
-
DeepSeek-V2-Chat
DeepSeek logoDeepSeek
4
236B
21B active at inference time
128k
-
-
-
Llama 3 Instruct 70B
Meta logoMeta
3
70B
8.19k
$0.9
-
ReplicateNovitaAmazon Bedrock
Arctic Instruct
Snowflake logoSnowflake
3
480B
17B active at inference time
4.00k
-
-
-
Qwen Chat 72B
Alibaba logoAlibaba
3
72B
33.8k
-
-
-
Gemma 3 12B Instruct
Google logoGoogle
3
12.2B
128k
$0.1
-
Amazon BedrockGoogleDeepInfra
+2
Llama 3.2 Instruct 11B (Vision)
Meta logoMeta
3
11B
128k
$0.2
50
Microsoft AzureAmazon BedrockDeepInfra
Granite 4.1 3B
IBM logoIBM
3
3B
131k
-
-
-
DeepSeek Coder V2 Lite Instruct
DeepSeek logoDeepSeek
3
16B
2.4B active at inference time
128k
-
-
-
Sarvam M (Reasoning)
Sarvam logoSarvam
3
23.6B
32.8k
-
-
Sarvam
Phi-4 Mini Instruct
Microsoft logoMicrosoft
3
3.84B
128k
-
43
Microsoft AzureCoreWeave
Llama 2 Chat 70B
Meta logoMeta
3
70B
4.10k
-
-
-
DeepSeek LLM 67B Chat (V1)
DeepSeek logoDeepSeek
3
7B
4.10k
-
-
-
Llama 2 Chat 13B
Meta logoMeta
3
13B
4.10k
-
-
-
Command-R+ (Apr '24)
Cohere logoCohere
3
104B
128k
$4.2
-
Amazon Bedrock
OpenChat 3.5 (1210)
OpenChat logoOpenChat
3
7B
8.19k
-
-
-
DBRX Instruct
Databricks logoDatabricks
3
132B
36B active at inference time
32.8k
-
-
-
Exaone 4.0 1.2B (Reasoning)
LG AI Research logoLG AI Research
3
1.28B
64.0k
-
-
-
Olmo 3 7B Instruct
Allen Institute for AI logoAllen Institute for AI
3
7B
65.5k
$0.1
-
Parasail
Exaone 4.0 1.2B (Non-reasoning)
LG AI Research logoLG AI Research
3
1.28B
64.0k
-
-
-
LFM2.5-1.2B-Thinking
Liquid AI logoLiquid AI
3
1.17B
32.0k
-
-
-
Jamba 1.7 Mini
AI21 Labs logoAI21 Labs
3
52B
12B active at inference time
258k
-
-
-
LFM2 2.6B
Liquid AI logoLiquid AI
3
2.57B
32.8k
-
339
Liquid AI
LFM2.5-1.2B-Instruct
Liquid AI logoLiquid AI
3
1.17B
32.0k
-
492
Liquid AI
Jamba 1.5 Mini
AI21 Labs logoAI21 Labs
3
52B
12B active at inference time
256k
$0.2
-
Amazon Bedrock
Granite 4.0 H 1B
IBM logoIBM
3
1.5B
128k
-
-
-
Qwen3 1.7B (Reasoning)
Alibaba logoAlibaba
3
2.03B
32.0k
$0.2
-
Alibaba Cloud
Jamba 1.6 Mini
AI21 Labs logoAI21 Labs
3
52B
12B active at inference time
256k
$0.2
181
AI21 Labs
Mixtral 8x7B Instruct
Mistral logoMistral
2
46.7B
12.9B active at inference time
32.8k
$0.5
-
Amazon Bedrock
Gemma 3 270M
Google logoGoogle
2
0.268B
32.0k
-
-
-
Apertus 70B Instruct
Swiss AI Initiative logoSwiss AI Initiative
2
70B
65.5k
$1.0
-
Public AI
Granite 4.0 Micro
IBM logoIBM
2
3B
128k
-
-
-
DeepHermes 3 - Llama-3.1 8B Preview (Non-reasoning)
Nous Research logoNous Research
2
8B
128k
-
-
-
Llama 65B
Meta logoMeta
2
65B
2.05k
-
-
-
Qwen Chat 14B
Alibaba logoAlibaba
2
14B
8.19k
-
-
-
Mistral 7B Instruct
Mistral logoMistral
2
7B
8.19k
$0.2
104
Amazon BedrockMistral
Command-R (Mar '24)
Cohere logoCohere
2
35B
128k
$0.6
-
Amazon Bedrock
Granite 4.0 1B
IBM logoIBM
2
1.6B
128k
-
-
-
Molmo2-8B
Allen Institute for AI logoAllen Institute for AI
2
8.66B
36.9k
-
-
-
LFM2 8B A1B
Liquid AI logoLiquid AI
2
8.34B
1.5B active at inference time
32.8k
-
-
Liquid AI
Granite 3.3 8B (Non-reasoning)
IBM logoIBM
2
8.17B
128k
$0.1
328
Replicate
Qwen3 1.7B (Non-reasoning)
Alibaba logoAlibaba
2
2.03B
32.0k
$0.1
-
Alibaba Cloud
Qwen3 0.6B (Reasoning)
Alibaba logoAlibaba
1
0.752B
32.0k
$0.2
-
Alibaba Cloud
Llama 3 Instruct 8B
Meta logoMeta
1
8B
8.19k
$0.1
-
NovitaDeepInfraReplicateAmazon Bedrock
Gemma 3n E4B Instruct
Google logoGoogle
1
8.39B
4B active at inference time
32.0k
$0.0
50
Together AI
LFM2 1.2B
Liquid AI logoLiquid AI
1
1.17B
32.8k
-
476
Liquid AI
Gemma 3 4B Instruct
Google logoGoogle
1
4.3B
128k
$0.0
-
Amazon BedrockDeepInfraGoogle
Llama 3.2 Instruct 1B
Meta logoMeta
1
1B
128k
$0.1
84
NovitaAmazon Bedrock
LFM2.5-VL-1.6B
Liquid AI logoLiquid AI
1
1.6B
32.0k
-
493
Liquid AI
Granite 4.0 350M
IBM logoIBM
1
0.35B
32.8k
-
-
-
Granite 4.0 H 350M
IBM logoIBM
1
0.34B
32.8k
-
-
-
Apertus 8B Instruct
Swiss AI Initiative logoSwiss AI Initiative
1
8B
65.5k
$0.1
-
Public AI
Tiny Aya Global
Cohere logoCohere
1
3.35B
8.19k
-
-
Cohere
Gemma 3n E2B Instruct
Google logoGoogle
1
5.98B
2B active at inference time
32.0k
-
-
Google
Gemma 3 1B Instruct
Google logoGoogle
1
1B
32.0k
-
-
Google
Qwen3 0.6B (Non-reasoning)
Alibaba logoAlibaba
1
0.752B
32.0k
$0.1
-
Alibaba Cloud
EXAONE 4.5 33B (Non-reasoning)
LG AI Research logoLG AI Research
-
34.4B
262k
-
-
-
Cogito v2.1 (Reasoning)
Deep Cogito logoDeep Cogito
-
671B
37B active at inference time
128k
$1.3
91
Together AI