Proof, not assertion

Live

What the InferAll gateway actually serves, right now. All counts are fetched server-side from /ai/v1/models and /ai/v1/health and refreshed at most once per hour — same numbers a developer can verify with curl.

Gateway status

Healthy

Models

217

Total

119

Free

Paid

“Free” means the model's pricing fields are explicitly $0 — for token models that's inputPerM and outputPerM both 0. Missing pricing metadata is not counted as free.

Usage · trailing 30 days

15,500

Requests

31.6M

Tokens routed

Upstream providers used

Values are rounded to stable buckets server-side and refreshed every 5 minutes — see /api/public/stats. User and signup counts are deliberately not exposed (poisoned by the 2026-06-08 bot burst); consumption metrics are bot-resistant by construction.

By vendor

Vendor	Models	Free
google	53	12
nvidia	44	44
openai	21	2
runway	14	—
meta	11	11
mistralai	11	11
anthropic	9	—
other	5	—
ibm	4	4
writer	4	4
deepseek-ai	3	3
microsoft	3	3
qwen	3	3
stability-ai	2	—
black-forest-labs	2	—
minimax	2	—
bytedance	2	1
minimaxai	2	2
stepfun-ai	2	2
kwaivgi	1	—
sync	1	—
devxpy	1	—
01-ai	1	1
abacusai	1	1
adept	1	1
ai21labs	1	1
aisingapore	1	1
baai	1	1
bigcode	1	1
databricks	1	1
moonshotai	1	1
nv-mistralai	1	1
poolside	1	1
sarvamai	1	1
snowflake	1	1
thinkingmachines	1	1
upstage	1	1
z-ai	1	1
zyphra	1	1

Vendor is the namespace in the model id (e.g. meta/llama-… → meta). Many OSS models are routed through NVIDIA NIM regardless of vendor — see /docs for the routing details.

Beyond what's enumerated above: the gateway also passes any Replicate model id (owner/name) directly through to Replicate, and any NVIDIA NIM model id through to NIM — so the routable catalog is much larger than the number listed here. The counts on this page reflect what's explicitly enumerated in the registry today.

By source

173

Live (dynamic pricing pulled from upstream APIs)

Static (pricing pinned in the gateway registry)

Verify

Want to count for yourself? curl https://api.inferall.ai/ai/v1/models | jq 'keys | length' should equal 217 within the hour.