Billing & Credits

VibeFlow uses a usage-based billing model. You pay for the resources you use — projects, features, API calls, and optional add-ons.

Pricing Overview

Core VibeFlow Usage

Resource	Cost
Projects	$4/project/month
Features	$0.20/feature/month

API Calls

Service	Cost Per Call
LLM Gateway	$0.004/call
MCP Gateway	$0.004/call

Add-Ons

Add-On	Cost
AI Studio Agent Runs	$0.007/run
AI Studio Instances	$99/instance/month
Workspaces	$1/workspace/month
Blueprints	$1.50/blueprint/month
Blueprint Nodes	$0.05/node/month
CI/CD Pipelines	$1/pipeline/month
Studio Clusters	$199/cluster/month

Managing Your Subscription

Viewing Usage

Go to Billing in the sidebar
The Usage tab shows current consumption for each resource
Compare usage against your plan limits

Payment Methods

Go to Billing → Payment Methods
Add a credit card via Stripe
Set a default payment method
Remove old payment methods

Invoices

Go to Billing → Invoices
The first 20 invoices load by default, sorted newest first
Click Load more at the bottom of the table to fetch the next page
Each row links to the Stripe-hosted invoice page (View) and the PDF receipt (PDF)
Status badge (paid / open / void) is colored for at-a-glance scanning

Pagination uses Stripe's cursor model — the page you're looking at is fetched directly; the rest of your invoice history is only loaded when you click Load more. This keeps the page snappy even on accounts with hundreds of invoices.

📷 Screenshot needed: Invoices tab showing 20 paid rows + the "Load more" button below the table. Caption: "Invoice history paginates 20 at a time; click Load more for older invoices."

Per-Change vs. Cycle-End Invoicing

When VibeFlow's reconciler updates your Stripe subscription quantities (e.g., when your monthly resource counts change), the resulting proration is bundled into your next scheduled invoice rather than emitted as a standalone per-change invoice. You see one consolidated invoice per billing cycle instead of dozens of small per-event invoices.

User-initiated plan changes — upgrades, add-on purchases, cancellations — still produce an immediate invoice so you have a clear receipt for the action you took.

Changing Plans

Go to Billing → Subscription
View your current plan
Click Upgrade to see available plans
Select a new plan and confirm

Cancelling

Go to Billing → Subscription
Click Cancel Subscription
Provide a reason (helps us improve)
Confirm cancellation

Your access continues until the end of the current billing period.

Overage

If you exceed your plan limits:

Overage charges apply at the per-unit rates listed above
You'll receive a notification when approaching limits
Overage is billed at the end of the billing cycle

Apply a promotional code during plan upgrade to receive a discount:

Go to Billing → Subscription → Upgrade
In the checkout step, enter your promo code
The discount is applied to your invoice automatically

Promo codes may provide percentage discounts, fixed amount reductions, or free trial periods depending on the promotion.

Organization-Level Credit Enforcement

For Team and Enterprise plans, credits are tracked at the organization level:

Org-level credit pool — All team members share a combined credit allocation from the plan tier
Per-user usage tracking — Individual usage is tracked within the org pool
Soft enforcement — When credits are exhausted, overage charges apply at per-unit rates (no hard cutoff for paid plans)
Addon credits — Purchase additional credit packs that stack on top of the tier allocation

The Organization Usage tab in Billing shows a table view with columns for Product, Quantity Used, Limit, Overage, and Overage Rate.

LLM Gateway — Per-Class Token Billing

When the LLM Gateway forwards Anthropic /v1/messages requests, four token classes are tracked separately because each prices differently:

Token class	Multiplier	Why
`input_tokens` (uncached prompt)	× 1.0 base	Standard input rate
`cache_creation_input_tokens`	× 1.25 base	Cost of writing to Anthropic's prompt cache
`cache_read_input_tokens`	× 0.1 base	Discounted reads from a previously cached prompt
`output_tokens`	× output rate	Generated tokens

Service tier multipliers apply to the request total:

Tier	Multiplier	When to expect
`standard`	× 1.0	Default for direct API calls
`batch`	× 0.5	Anthropic Batch API submissions
`priority`	per-contract	Reserved capacity / Priority Tier customers

For high-cache-hit workloads (long system prompts, many turns), cache reads dominate the bill — and at the discounted 0.1× multiplier, the per-request cost can be substantially lower than the bare input_tokens × output_tokens multiplication you'd see in a naive ledger. The dashboards reflect the captured per-class breakdown, so you can see exactly where each dollar went.

📷 Screenshot needed: LLM Gateway billing dashboard showing a row with non-zero cache_creation_input_tokens and cache_read_input_tokens columns + the corresponding USD cost. Caption: "Per-class token breakdown — cache reads discount visible at the per-request level."

📷 Screenshot needed: Tier-grouped usage breakdown (standard / batch / priority) for an account that has used at least two tiers, so the difference in effective rate is visible. Caption: "Service tier multipliers shown side-by-side."

Note for Bifrost-routed models

Anthropic models routed via the OpenAI-compatible chat-completions endpoint (/v1/chat/completions) only expose cache reads through the upstream Bifrost SDK — cache_creation_input_tokens is collapsed into input_tokens by Bifrost's normalization layer and isn't recoverable through that path. Use the native Anthropic endpoint (/v1/messages) for the full per-class breakdown.

Enterprise / Contact Sales

For enterprise plans with custom pricing, volume discounts, or dedicated support:

Visit the Contact Sales section in Billing
Select your interests (Custom AI Agents, Vibe code to deploy enterprise design, etc.)
A sales representative will contact you

If you're already on the Team tier and try to access an enterprise-only feature (e.g., LLM Gateway FinOps), the Upgrade button on the feature's nudge card is replaced with a disabled Enterprise button (with the tooltip "Contact sales to upgrade to the Enterprise tier."). Team is the highest self-serve tier — Enterprise is sales-contact, not a self-serve checkout.

📷 Screenshot needed: FinOps tab on the LLM Gateway page for a Team-tier user, showing the disabled "Enterprise" button (with tooltip) where the active "Upgrade" button used to be. Caption: "Team-tier users see a disabled Enterprise button on enterprise-only features — Enterprise upgrades go through sales, not self-serve checkout."