Forge LLMs is now available as preview feature.
Preview features are deemed stable; however, they remain under active development and may be subject to shorter deprecation windows. Preview features are suitable for early adopters in production environments.
We release preview features so partners and developers can study, test, and integrate them prior to General Availability (GA). For more information, see Forge release phases: EAP, Preview, and GA.
No free usage allowance: The Forge LLMs API does not include a free monthly usage quota. All token usage is billed.
Forge LLM usage is charged to the developer of the Forge app and counted toward your Forge monthly bill.
Forge LLM usage is tracked in credits, which correspond to model input and output tokens. Each model has a token-to-credit conversion ratio, and more powerful models use more credits per token. On your bill you'll see two line items: input credits and output credits. You can also see a detailed breakdown of usage per model in the developer console.
The model names in this table correspond to the Claude variants listed in Forge LLMs models.
| Model | Credits per 1M tokens | Price per 1M input tokens ($USD) | Price per 1M output tokens ($USD) |
|---|---|---|---|
| Opus 4.6 | 50 credits | $5 | $25 |
| Sonnet 4.5 | 30 credits | $3 | $15 |
| Haiku 4.5 | 10 credits | $1 | $5 |
Forge LLM credits are priced at $0.10 per 1M input credits and $0.50 per 1M output credits.
For example, suppose your app consumes the following tokens in a given month:
Converting the tokens to LLM credits for billing:
Rate this page: