StrategyFREEAI Pricing Framework22 min read

AI Pricing Models: How to Price Token-Based, Outcome-Based, and Usage-Based AI Products

A 6-step framework for pricing AI products. Covers token-based, outcome-based, and usage-based models with real examples, margin analysis, and strategies for managing inference cost variability.

By Tim Adair6 steps• Published 2026-02-09

Quick Answer (TL;DR)

Pricing AI products is fundamentally harder than pricing traditional SaaS because the cost of serving each customer varies dramatically based on usage patterns, query complexity, and model selection. A heavy user of your AI features might cost you 50x more to serve than a light user, but traditional per-seat pricing charges them the same amount. This guide presents a 6-step framework for pricing AI products: understanding your AI cost structure, choosing the right pricing model (token-based, outcome-based, usage-based, or hybrid), setting price points that protect margins while maximizing adoption, managing the cost variability problem, communicating AI pricing to customers, and iterating on pricing as your AI capabilities evolve. Product managers who follow this framework avoid the two most common AI pricing mistakes: underpricing (giving away expensive inference for free and losing money on every heavy user) and overpricing (creating so much friction that users never adopt the AI features).


Why AI Pricing Is Different

Traditional SaaS pricing is relatively simple: your marginal cost per additional user is near zero (hosting, bandwidth), so you price based on value and willingness to pay. The cost to serve your heaviest user is roughly the same as your lightest user.

AI products break this model:

  • Variable marginal costs: Every AI query costs money (inference compute, API fees), and costs vary by query complexity. A simple classification might cost $0.001. A complex multi-step reasoning chain might cost $0.50.
  • Unpredictable usage patterns: Users who discover the AI is valuable use it exponentially more. Your best customers become your most expensive customers.
  • Rapid cost changes: Model providers regularly change pricing. A model that costs $X per token today might cost $X/10 in six months or $X*3 if you need to upgrade to a more capable model.
  • Quality-cost tradeoffs: Cheaper models produce lower quality output. Pricing must account for the quality level customers expect.
  • These dynamics create a pricing challenge that per-seat, per-month SaaS pricing was not designed to handle.


    The 6-Step AI Pricing Framework

    Step 1: Understand Your AI Cost Structure

    What to do: Build a detailed model of your AI costs, broken down by feature, query type, and user segment, so you know exactly what it costs to serve each customer.

    Why it matters: You cannot price what you do not understand. Most AI product teams have a rough sense of their monthly AI spend but do not know the cost per query, per feature, or per user segment. Without this granularity, you will inevitably underprice heavy users and overprice light users.

    Cost components to track:

    Cost ComponentDescriptionTypical Range
    Inference computeThe cost of running the model (API fees or self-hosted GPU)$0.001 - $1.00 per query
    Embedding and retrievalCost of vector search, RAG pipeline, knowledge base queries$0.0001 - $0.01 per query
    Data processingCost of preprocessing inputs and postprocessing outputs$0.0001 - $0.005 per query
    StorageCost of storing user context, conversation history, fine-tuning data$0.01 - $0.10 per user/month
    Monitoring and evaluationCost of quality monitoring, drift detection, human evaluation10-20% of inference cost
    Model training/fine-tuningAmortized cost of training or fine-tuning modelsVaries widely

    Building your cost model:

  • Log everything: Instrument your AI pipeline to log the cost of every query (tokens consumed, model used, latency, retrieval steps).
  • Segment by query type: Group queries by complexity, feature, and cost profile. You will likely find that 20% of query types account for 80% of cost.
  • Segment by user: Identify your cost distribution across users. Heavy AI users may cost 10-50x more than light users.
  • Project at scale: Model your costs at 2x, 5x, and 10x current usage. AI costs often do not scale linearly — bulk discounts, caching, and model optimization can reduce per-query cost at scale.
  • Real-world example: A B2B SaaS company added an AI summarization feature. Average cost per summary was $0.03. But they discovered that 15% of their users were running summaries on very long documents that cost $0.40 each — 13x the average. Those heavy users represented 70% of total AI cost. This insight drove them to implement usage-based pricing for the AI feature.


    Step 2: Choose Your Pricing Model

    What to do: Select the pricing model (or combination of models) that best aligns your revenue with your costs while maximizing customer adoption and perceived value.

    Why it matters: The wrong pricing model either leaves money on the table (undercharging heavy users) or kills adoption (overcharging everyone). The right model aligns your incentives with the customer's: you make more money when they get more value.

    AI pricing model comparison:

    ModelHow It WorksBest ForRisks
    Included in subscriptionAI features bundled into existing per-seat pricing at no extra costLow-cost AI features, competitive differentiation, adoption maximizationMargin erosion if AI usage is heavy
    Token/credit-basedCustomers buy credits that are consumed by AI queries, charged per token or per actionVariable-cost AI features, transparent cost pass-throughAdoption friction, customer confusion about credit consumption
    Usage-basedCustomers pay based on actual AI usage (queries, documents processed, outputs generated)High-variance usage patterns, expensive AI featuresRevenue unpredictability, customer budget anxiety
    Outcome-basedCustomers pay based on the results the AI delivers (leads generated, tickets resolved, time saved)High-value AI outputs with measurable outcomesDifficult to attribute outcomes, revenue tied to model performance
    TieredDifferent subscription tiers include different AI usage limits or capabilitiesSegmenting by willingness to pay, upsell pathTier boundaries may not match actual usage patterns
    HybridCombination of subscription base + usage-based AI componentMost SaaS products with AI featuresComplexity in communication and billing

    Decision framework for choosing your model:

    Choose "included in subscription" when:

  • AI inference cost per user is less than 5% of subscription revenue
  • AI features are a competitive necessity (table stakes)
  • Adoption is more important than margin optimization
  • You are in a land-and-expand motion where AI drives initial adoption
  • Choose "token/credit-based" when:

  • Cost per query varies significantly (10x+ between simple and complex queries)
  • Users understand the concept of credits (developer-focused products)
  • You need transparent cost pass-through for high-cost AI operations
  • Customers want granular control over their AI spend
  • Choose "usage-based" when:

  • AI usage varies dramatically between customers (50x+ between lightest and heaviest)
  • AI cost is a significant portion of your COGS
  • Customers have variable workloads (seasonal, project-based)
  • You can clearly define the "unit" of usage (documents processed, queries run)
  • Choose "outcome-based" when:

  • The AI delivers measurable, high-value outcomes (revenue generated, costs saved)
  • You can reliably attribute outcomes to AI actions
  • Customers prefer paying for results rather than inputs
  • You have high confidence in your AI's performance
  • Choose "hybrid" when:

  • You have both low-cost AI features (include in subscription) and high-cost AI features (usage-based)
  • Different customer segments have different usage patterns
  • You want a predictable base revenue + variable upside

  • Step 3: Set Price Points That Protect Margins

    What to do: Calculate your target margins for AI features and set prices that maintain profitability across your customer distribution, including your heaviest users.

    Why it matters: AI features that are priced below cost attract heavy users who generate negative margin. At small scale, this is manageable. At large scale, it can threaten the business. Setting prices that protect margins from day one prevents painful price increases later.

    Margin calculation framework:

    Target AI gross margin: 60-75% (consistent with SaaS benchmarks)

    MetricFormulaExample
    Average cost per AI queryTotal AI cost / Total queries$0.05
    P95 cost per queryCost at 95th percentile query$0.35
    Cost per user per monthTotal AI cost / Active AI users$2.50
    P95 cost per user per monthCost for 95th percentile user$18.00
    Revenue per user per monthAI pricing revenue / Active AI users$8.00
    Gross margin(Revenue - Cost) / Revenue69% at average, negative at P95

    The "whale problem": In usage-based AI pricing, a small number of "whale" users can consume a disproportionate share of resources. Strategies for managing whales:

  • Soft caps: Include a generous AI usage allowance in the subscription, then charge overage rates that increase with volume.
  • Fair use policies: Define reasonable usage limits and require enterprise agreements for users who exceed them.
  • Tiered pricing: Higher tiers include more AI usage, naturally segmenting heavy users into higher-revenue plans.
  • Cost optimization for heavy users: Route heavy users' queries through cheaper models where quality is acceptable, reserving premium models for complex queries.

  • Step 4: Manage Cost Variability

    What to do: Build systems and strategies to manage the inherent variability in AI costs, protecting your margins from spikes while maintaining consistent quality for users.

    Why it matters: AI costs can spike unpredictably due to increased usage, complex queries, model provider price changes, or changes in usage patterns. Without cost management strategies, a viral feature launch or a single heavy customer can blow through your AI budget in days.

    Cost management strategies:

    1. Model routing and fallback

  • Route simple queries to smaller, cheaper models and reserve large, expensive models for complex queries that require them
  • Example: Use a small model for classification, medium for summarization, and large only for complex reasoning tasks
  • Potential savings: 40-70% reduction in average inference cost
  • 2. Caching and deduplication

  • Cache common AI outputs and serve cached results for repeated or similar queries
  • Example: If 100 users ask similar questions about your product, cache the first response and serve it (with slight personalization) to the rest
  • Potential savings: 20-50% reduction for products with repetitive queries
  • 3. Prompt optimization

  • Optimize prompts to use fewer tokens while maintaining output quality
  • Example: Reduce system prompt length, use concise instructions, strip unnecessary context
  • Potential savings: 15-30% reduction in token consumption
  • 4. Batch processing

  • Where latency is not critical, batch AI operations and run them during off-peak hours at lower compute costs
  • Example: Generate weekly AI reports in batch rather than on-demand
  • Potential savings: 20-40% reduction for batch-eligible operations
  • 5. Usage alerts and throttling

  • Implement alerts when individual users or features exceed cost thresholds
  • Apply graceful throttling (reduced quality or speed, not hard cutoffs) when costs spike
  • Protection: Prevents unexpected cost overruns from destroying margins

  • Step 5: Communicate AI Pricing to Customers

    What to do: Design pricing communication that is transparent, easy to understand, and focuses on value rather than cost mechanics.

    Why it matters: AI pricing is inherently more complex than traditional SaaS pricing. If customers cannot understand what they are paying for or predict their bill, they will either avoid the AI features (reducing adoption) or get bill shock (increasing churn). Clear communication is a competitive advantage.

    Pricing communication principles:

    1. Lead with value, not mechanics

  • Bad: "Each AI query consumes 1-50 credits depending on model selection, token count, and retrieval operations."
  • Good: "Analyze up to 500 documents per month. Each additional document is $0.10."
  • Users care about what they can do, not how the billing engine works.
  • 2. Make costs predictable

  • Provide usage estimates based on the customer's profile: "Based on your team size and usage patterns, we estimate your AI costs will be approximately $X/month."
  • Offer budget caps: "Set a monthly AI budget. We will alert you at 80% and pause AI features at 100% unless you choose to continue."
  • 3. Show the ROI

  • Frame AI pricing in terms of value created: "This AI feature saves your team an average of 15 hours per month. At your team's loaded cost, that is $X in savings for $Y in AI fees."
  • Provide ROI calculators that let customers model the value vs. cost for their specific situation.
  • 4. Avoid surprise bills

  • Send usage alerts at meaningful thresholds (50%, 80%, 100% of expected usage)
  • Provide real-time usage dashboards
  • Default to graceful degradation (slower or simpler AI) rather than hard cutoffs or surprise overage charges
  • Pricing page best practices for AI features:

    ElementWhat to Include
    Clear tier comparisonWhat AI features are in each tier, with usage limits
    Usage calculatorInteractive tool that estimates monthly cost based on team size and expected usage
    Overage pricingTransparent per-unit pricing for usage beyond included limits
    ROI examplesCase studies showing value created relative to AI cost
    Free trial of AI featuresLet users experience the value before committing to AI pricing

    Step 6: Iterate on Pricing as AI Evolves

    What to do: Build a pricing iteration cadence that adjusts to changes in AI costs, capabilities, competitive landscape, and customer usage patterns.

    Why it matters: AI costs are dropping 50-80% per year as models become more efficient and competition increases. If you set AI pricing today and never revisit it, you will be overcharging within 12 months. Conversely, new capabilities may justify premium pricing that did not exist when you launched.

    Pricing iteration cadence:

    FrequencyWhat to Review
    MonthlyCost per query trends, margin by customer segment, usage distribution
    QuarterlyPricing model effectiveness, competitive pricing changes, customer feedback on pricing
    AnnuallyFull pricing model review, cost structure changes, new capability pricing

    Triggers for pricing changes:

  • Model cost drops 50%+: Pass savings to customers or reinvest in higher-quality models at the same price
  • New capability launches: Price new AI features separately or include in higher tiers
  • Margin drops below target: Raise prices, optimize costs, or adjust model routing
  • Competitive pressure: Match or differentiate on pricing based on your moat strength
  • Customer feedback: If customers consistently complain about pricing complexity, simplify the model
  • The "getting cheaper" advantage: As AI costs decline, product teams have a strategic choice: lower prices (grow adoption), maintain prices (improve margins), or reinvest savings (use more expensive, higher-quality models at the same price point). The best AI product companies do all three strategically: lower prices for entry-level features to drive adoption, maintain prices for premium features that deliver high value, and reinvest savings into quality improvements that justify the pricing.


    AI Pricing Models in Practice

    Example 1: AI Feature Included in Subscription

    Company profile: B2B SaaS with $50/user/month pricing

    AI feature: Smart search that uses semantic understanding to find relevant documents

    AI cost: $0.005 per search query, average 50 queries per user per month = $0.25/user/month

    Decision: Include in subscription (0.5% of revenue, high adoption value)

    Example 2: Usage-Based AI Pricing

    Company profile: Content platform with $200/month team plan

    AI feature: AI content generation (blog posts, social media, email campaigns)

    AI cost: $0.15 per generation on average, with heavy users generating 200+ per month

    Decision: Include 50 AI generations in base plan, $0.20 per additional generation

    Result: Light users get enough included value, heavy users pay for what they consume

    Example 3: Outcome-Based AI Pricing

    Company profile: Sales intelligence platform with $500/month plan

    AI feature: AI that identifies high-intent leads from web behavior

    AI cost: $0.50 per lead scored

    Decision: Charge $5 per qualified lead delivered (10x cost), with minimum monthly commitment

    Result: Customers pay only for leads they can act on, aligning price with value


    Key Takeaways

  • AI pricing must account for variable marginal costs that traditional per-seat SaaS pricing ignores
  • Understand your cost structure at the per-query, per-feature, and per-user level before setting prices
  • Choose a pricing model (included, token-based, usage-based, outcome-based, or hybrid) based on your cost variance, user value, and strategic goals
  • Protect margins by pricing for your P95 user, not your average user
  • Manage cost variability through model routing, caching, prompt optimization, and usage alerts
  • Communicate pricing by leading with value, making costs predictable, and avoiding surprise bills
  • Iterate on pricing as AI costs decline and capabilities expand — the right price today will not be the right price in 12 months
  • Next Steps:

  • Build a comprehensive AI product strategy
  • Develop the data strategy that fuels your AI
  • Evaluate AI vendors and models for your product

  • Citation: Adair, Tim. "AI Pricing Models: How to Price Token-Based, Outcome-Based, and Usage-Based AI Products." IdeaPlan, 2026. https://ideaplan.io/strategy/ai-pricing-models

    Turn Strategy Into Action

    Use our AI-enhanced roadmap templates to execute your product strategy