diff --git a/src/content/docs/ai-gateway/features/unified-billing.mdx b/src/content/docs/ai-gateway/features/unified-billing.mdx index 138f9fed9973fb3..4d75aea5748a029 100644 --- a/src/content/docs/ai-gateway/features/unified-billing.mdx +++ b/src/content/docs/ai-gateway/features/unified-billing.mdx @@ -15,6 +15,8 @@ import { DashButton, TabItem, Tabs } from "~/components"; Unified Billing allows users to connect to various AI providers (such as OpenAI, Anthropic, and Google AI Studio) and receive a single Cloudflare bill. To use Unified Billing, you must purchase and load credits into your Cloudflare account in the Cloudflare dashboard, which you can then spend with AI Gateway. +A 5% fee is applied to all credits purchased through Unified Billing. For example, a $100 credit purchase will result in a $105 charge. Inference pricing from providers is passed through with no markup — you pay the same per-token rates as you would directly with the provider. + ## Pre-requisites - Ensure your Cloudflare account has [sufficient credits loaded](#load-credits). diff --git a/src/content/docs/ai-gateway/reference/pricing.mdx b/src/content/docs/ai-gateway/reference/pricing.mdx index 692f63b8fab2663..36dfc006c00a2af 100644 --- a/src/content/docs/ai-gateway/reference/pricing.mdx +++ b/src/content/docs/ai-gateway/reference/pricing.mdx @@ -39,6 +39,10 @@ DLP profiles are shared at the account level with [Cloudflare One](/cloudflare-o [Guardrails](/ai-gateway/features/guardrails/) evaluates prompts and responses using [`@cf/meta/llama-guard-3-8b`](/workers-ai/models/llama-guard-3-8b/) on Workers AI. Usage is billed as [Workers AI](/workers-ai/platform/pricing/) token-based inference — cost scales with the length of the prompts and responses being evaluated. +## Unified Billing + +A 5% fee is applied to all credits purchased through [Unified Billing](/ai-gateway/features/unified-billing/). For example, a $100 credit purchase will result in a $105 charge. Inference pricing from providers is passed through with no markup — you pay the same per-token rates as you would directly with the provider. + ## Logpush Logpush is only available on the Workers Paid plan.