Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Inference Providers
·
Metrics for top trending models
Browse all models
Learn more
Reset
Model
Provider
Input $/1M
Output $/1M
Context
Latency(s)
Throughput(t/s)
Tools
Structured
Qwen/Qwen3-32B
Qwen3-32B
groq
fastest
$0.29
$0.59
131,072
0.22
326
Yes
No
Qwen/Qwen3-32B
Qwen3-32B
novita
$0.10
$0.45
40,960
4.35
13
No
No
Qwen/Qwen3-32B
Qwen3-32B
cerebras
-
-
-
-
-
-
-
Qwen/Qwen3-32B
Qwen3-32B
sambanova
$0.40
$0.80
32,768
3.29
166
Yes
Yes
Qwen/Qwen3-32B
Qwen3-32B
nscale
cheapest
$0.08
$0.25
40,960
0.99
28
Yes
No
Qwen/Qwen3-32B
Qwen3-32B
ovhcloud
$0.09
$0.25
32,768
0.39
41
Yes
Yes