Proof, not assertion

Live

What the InferAll gateway actually serves, right now. All counts are fetched server-side from /ai/v1/models and /ai/v1/health and refreshed at most once per hour — same numbers a developer can verify with curl.

Gateway status

Healthy

Models

207

Total

118

Free

89

Paid

“Free” means the model's pricing fields are explicitly $0 — for token models that's inputPerM and outputPerM both 0. Missing pricing metadata is not counted as free.

By vendor

VendorModelsFree
google5111
nvidia4343
openai212
runway12
meta1111
mistralai1111
anthropic9
microsoft55
ibm44
qwen44
writer44
deepseek-ai33
other2
stability-ai2
black-forest-labs2
minimax2
stepfun-ai22
kwaivgi1
01-ai11
abacusai11
adept11
ai21labs11
aisingapore11
baai11
bigcode11
bytedance11
databricks11
minimaxai11
moonshotai11
nv-mistralai11
sarvamai11
snowflake11
stockmark11
upstage11
z-ai11
zyphra11

Vendor is the namespace in the model id (e.g. meta/llama-… meta). Many OSS models are routed through NVIDIA NIM regardless of vendor — see /docs for the routing details.

Beyond what's enumerated above: the gateway also passes any Replicate model id (owner/name) directly through to Replicate, and any NVIDIA NIM model id through to NIM — so the routable catalog is much larger than the number listed here. The counts on this page reflect what's explicitly enumerated in the registry today.

By source

169

Live (dynamic pricing pulled from upstream APIs)

38

Static (pricing pinned in the gateway registry)

Verify

Want to count for yourself? curl https://api.inferall.ai/ai/v1/models | jq 'keys | length' should equal 207 within the hour.