No API key for first 100k tokens
Sign in to InferAll and start coding. The free tier is enough to evaluate the workflow end-to-end before any billing relationship begins.
VS Code extension
InferAll for VS Code is a Cline-based agent with InferAll's Anthropic-compatible gateway wired in. Sign in once and the first 100k tokens are free — no API keys to manage, no providers to configure, no separate bills.
Install
# 1. Install from the VS Code Marketplace (listing coming soon).
# 2. Open the InferAll sidebar in VS Code.
# 3. Sign in. First 100k tokens are on the house.Why use it
Sign in to InferAll and start coding. The free tier is enough to evaluate the workflow end-to-end before any billing relationship begins.
The InferAll gateway speaks Anthropic's API format. The extension slots into Claude Code-style flows and any tooling chain that already accepts Anthropic-format endpoints.
One vendor relationship for procurement, one billing surface. Every gateway call hits api.inferall.ai — one endpoint to log, govern, and reason about.
When a provider rate-limits or has an incident, the gateway transparently retries against another upstream that can serve the model class. Your editor session keeps moving.
Procurement
Many engineering teams work inside organizations where ad-hoc ChatGPT use is being restricted and where every new SaaS vendor triggers a security review. InferAll is intentionally positioned to fit that constraint: one vendor, one endpoint, one billing relationship, format-compatible with tooling that's already been approved.
We do not claim FedRAMP, SOC 2, HIPAA, FISMA, or any other certification — those require real audits, and we will only publish them when they exist. See /security for the current state.
One contract, one DPA, one endpoint — instead of separately evaluating Anthropic plus OpenAI plus Google plus the agent vendor.
The extension scaffolds an inferall.auditLog.enabled VS Code setting for compliance buyers. (The local audit-log writer is on the near-term roadmap; the gateway already centralizes every call through one endpoint that can be logged today.)
If a team has already cleared Claude Code or Anthropic SDK-based tooling, InferAll drops into that same chain without onboarding a brand new vendor.
Requests leave a single trust boundary. The gateway handles upstream provider selection so the client never talks directly to multiple AI vendors.
Screenshots
Already using Cline?
InferAll for VS Code is the canonical InferAll-branded experience — audit-log scaffolding, single-vendor positioning, no-key first run. If you already use upstream Cline, you can still route it through InferAll by setting the Anthropic base URL to https://api.inferall.ai in Cline's API settings. Both paths remain supported.