gekro
GitHub LinkedIn
AI

Hyperscaler AI Pricing - Bedrock vs Foundry vs Vertex

Compare on-demand inference pricing for the same model across AWS Bedrock, Azure AI Foundry, and Google Vertex AI

Models tracked

23

Platforms

3

Last verified

2026-05-07

Your daily token volume

500K / day → 15M / month

100K / day → 3M / month

Quick presets:

Find a model

Filter by vendor

Sort:

23 models

Llama 4 Maverick 17B

Meta

AWS Bedrock

In: $0.2/1M

Out: $0.6/1M

Monthly cost

Azure AI Foundry

In: $0.2/1M

Out: $0.6/1M

Monthly cost

Google Vertex AI

In: $0.2/1M

Out: $0.6/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 meta/llama-4-maverick-17b

Llama 4 Scout 17B

Meta

AWS Bedrock

In: $0.18/1M

Out: $0.55/1M

Monthly cost

Azure AI Foundry

In: $0.18/1M

Out: $0.55/1M

Monthly cost

Google Vertex AI

In: $0.18/1M

Out: $0.55/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 meta/llama-4-scout-17b

Llama 3.3 70B Instruct

Meta

AWS Bedrock

In: $0.72/1M

Out: $0.72/1M

Monthly cost

Azure AI Foundry

In: $0.72/1M

Out: $0.72/1M

Monthly cost

Google Vertex AI

In: $0.72/1M

Out: $0.72/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 meta/llama-3-3-70b

DeepSeek V3.2

DeepSeek

AWS Bedrock

In: $0.27/1M

Out: $1.1/1M

Monthly cost

Azure AI Foundry

In: $0.27/1M

Out: $1.1/1M

Monthly cost

Google Vertex AI

In: $0.27/1M

Out: $1.1/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 deepseek/v3-2

DeepSeek R1

DeepSeek

AWS Bedrock

In: $0.55/1M

Out: $2.19/1M

Monthly cost

Azure AI Foundry

In: $0.55/1M

Out: $2.19/1M

Monthly cost

Google Vertex AI

In: $0.55/1M

Out: $2.19/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 deepseek/r1

Mistral Large 3

Mistral

AWS Bedrock

In: $2/1M

Out: $6/1M

Monthly cost

Azure AI Foundry

In: $2/1M

Out: $6/1M

Monthly cost

Google Vertex AI

In: $2/1M

Out: $6/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 mistral/large-3

Amazon Nova 2.0 Pro

Amazon

AWS Bedrock

In: $0.8/1M

Out: $3.2/1M

Monthly cost

Azure AI Foundry

N/A

Amazon Nova is Bedrock-exclusive

Google Vertex AI

N/A

Amazon Nova is Bedrock-exclusive

AWS Bedrock: verified 2026-05-07 amazon/nova-2-0-pro

Amazon Nova Premier

Amazon

AWS Bedrock

In: $2.5/1M

Out: $12.5/1M

Monthly cost

Azure AI Foundry

N/A

Bedrock-exclusive

Google Vertex AI

N/A

Bedrock-exclusive

AWS Bedrock: verified 2026-05-07 amazon/nova-premier

Amazon Nova Micro

Amazon

AWS Bedrock

In: $0.035/1M

Out: $0.14/1M

Monthly cost

Azure AI Foundry

N/A

Bedrock-exclusive

Google Vertex AI

N/A

Bedrock-exclusive

AWS Bedrock: verified 2026-05-07 amazon/nova-micro

GLM 5

Z AI

AWS Bedrock

In: $0.5/1M

Out: $1.5/1M

Monthly cost

Azure AI Foundry

In: $0.5/1M

Out: $1.5/1M

Monthly cost

Google Vertex AI

In: $0.5/1M

Out: $1.5/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 zai/glm-5

GPT-OSS 120B (open-weights)

OpenAI

AWS Bedrock

In: $0.5/1M

Out: $1.5/1M

Monthly cost

Azure AI Foundry

In: $0.5/1M

Out: $1.5/1M

Monthly cost

Google Vertex AI

In: $0.5/1M

Out: $1.5/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 openai/gpt-oss-120b

Qwen3 32B

Alibaba (Qwen)

AWS Bedrock

In: $0.2/1M

Out: $0.6/1M

Monthly cost

Azure AI Foundry

In: $0.2/1M

Out: $0.6/1M

Monthly cost

Google Vertex AI

In: $0.2/1M

Out: $0.6/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Azure AI Foundry: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 qwen/qwen3-32b

GPT-5

OpenAI

AWS Bedrock

N/A

OpenAI closed models are Foundry-exclusive

Azure AI Foundry

In: $1.25/1M

Out: $10/1M

Monthly cost

Google Vertex AI

N/A

OpenAI closed models not on Vertex

Azure AI Foundry: verified 2026-05-07 openai/gpt-5

GPT-5 mini

OpenAI

AWS Bedrock

N/A

OpenAI closed models are Foundry-exclusive

Azure AI Foundry

In: $0.25/1M

Out: $2/1M

Monthly cost

Google Vertex AI

N/A

OpenAI closed models not on Vertex

Azure AI Foundry: verified 2026-05-07 openai/gpt-5-mini

GPT-5.2 chat

OpenAI

AWS Bedrock

N/A

OpenAI closed models are Foundry-exclusive

Azure AI Foundry

In: $1.25/1M

Out: $10/1M

Monthly cost

Google Vertex AI

N/A

OpenAI closed models not on Vertex

Azure AI Foundry: verified 2026-05-07 openai/gpt-5-2-chat

Grok 4.2

xAI

AWS Bedrock

N/A

xAI Grok is Azure-exclusive among the three hyperscalers

Azure AI Foundry

In: $3/1M

Out: $15/1M

Monthly cost

Google Vertex AI

N/A

xAI Grok not on Vertex

Azure AI Foundry: verified 2026-05-07 xai/grok-4-2

Gemini 3.0 Pro

Google

AWS Bedrock

N/A

Google models are Vertex-exclusive

Azure AI Foundry

N/A

Google models are Vertex-exclusive

Google Vertex AI

In: $1.25/1M

Out: $10/1M

Monthly cost

Google Vertex AI: verified 2026-05-07 google/gemini-3-0-pro

Gemini 3.0 Flash

Google

AWS Bedrock

N/A

Google models are Vertex-exclusive

Azure AI Foundry

N/A

Google models are Vertex-exclusive

Google Vertex AI

In: $0.1/1M

Out: $0.4/1M

Monthly cost

Google Vertex AI: verified 2026-05-07 google/gemini-3-0-flash

Gemini 2.5 Pro

Google

AWS Bedrock

N/A

Google models are Vertex-exclusive

Azure AI Foundry

N/A

Google models are Vertex-exclusive

Google Vertex AI

In: $1.25/1M

Out: $10/1M

Monthly cost

Google Vertex AI: verified 2026-05-07 google/gemini-2-5-pro

Gemini 2.5 Flash

Google

AWS Bedrock

N/A

Google models are Vertex-exclusive

Azure AI Foundry

N/A

Google models are Vertex-exclusive

Google Vertex AI

In: $0.1/1M

Out: $0.4/1M

Monthly cost

Google Vertex AI: verified 2026-05-07 google/gemini-2-5-flash

Claude Opus 4

Anthropic

AWS Bedrock

In: $15/1M

Out: $75/1M

Monthly cost

Azure AI Foundry

N/A

Anthropic models not on Foundry

Google Vertex AI

In: $15/1M

Out: $75/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 anthropic/claude-opus-4

Claude Sonnet 4

Anthropic

AWS Bedrock

In: $3/1M

Out: $15/1M

Monthly cost

Azure AI Foundry

N/A

Anthropic models not on Foundry

Google Vertex AI

In: $3/1M

Out: $15/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 anthropic/claude-sonnet-4

Claude Haiku 4

Anthropic

AWS Bedrock

In: $0.8/1M

Out: $4/1M

Monthly cost

Azure AI Foundry

N/A

Anthropic models not on Foundry

Google Vertex AI

In: $0.8/1M

Out: $4/1M

Monthly cost

AWS Bedrock: verified 2026-05-07 · Google Vertex AI: verified 2026-05-07 anthropic/claude-haiku-4
© 2026 Rohit Burani · MIT · Built at gekro.com · View source ↗

Guide

What It Does

Compares the on-demand inference price of the same foundation model across the three managed AI platforms:

  • AWS Bedrock (us-east-1)
  • Azure AI Foundry (eastus, global deployment tier)
  • Google Vertex AI (us-east5)

For each model in the catalogue, you see input + output prices on each platform, the cheapest option highlighted, and “N/A” for platforms where the model isn’t offered.

The cost calculator at the top lets you input your daily token volumes - the comparison rebuilds with monthly USD figures so you can see the actual dollar difference for your workload.

How to Use It

  1. Enter your daily volumes: input tokens per day + output tokens per day. Defaults to 500K input / 100K output (a typical small production app).
  2. Browse the model cards: each card shows the model’s price on Bedrock, Foundry, and Vertex. Cheapest platform gets a green ”✓ Cheapest” badge. Unavailable platforms show “N/A” with a one-line reason.
  3. Filter by vendor (Anthropic, Meta, etc.) or by tier (cheapest first / most capable first).
  4. Read the verification date at the bottom of each row - that’s when the price was last confirmed against the platform’s official pricing API.

How the Pricing Stays Current

Every Monday at 9 AM UTC, a GitHub Actions workflow:

  1. Calls AWS Bedrock’s public Bulk Pricing JSON
  2. Calls Azure’s Retail Prices API
  3. Calls GCP’s Cloud Billing Catalog API
  4. Diffs the responses against the canonical JSON in the repo
  5. If anything changed, opens a pull request with the diff for human review

Pricing data has a last_verified field per row + a verified_via: api flag once auto-confirmed. Cost: $0 (all three platforms publish pricing for free; GitHub Actions on public repos has unlimited free minutes).

The pipeline source code lives in scripts/pricing/ in the gekro repo. The pricing JSON itself is at apps/web/src/content/data/hyperscaler-pricing.json - version-controlled, every change is a reviewable PR.

What’s In Scope

  • ✅ On-demand inference pricing (input + output tokens)
  • ✅ Standard region per platform (us-east-1 / eastus / us-east5)
  • ✅ Standard pay-as-you-go tier
  • ✅ ~12 cross-platform foundation models with active 2026-era pricing

What’s NOT In Scope (Intentionally)

  • Provisioned throughput / committed use - Bedrock PT, Foundry PTU, Vertex PVM. These are negotiated, not generally publishable.
  • Regional price deltas - same model can be 0-30% cheaper in different regions; tracking N regions × N models combinatorially explodes maintenance.
  • Fine-tuning, embedding, image generation, storage, RAG primitives - different cost dimensions, separate decision.
  • Every model on every platform - there are 73 models on Bedrock alone. The comparison tracks the top ~12 with cross-platform interest. Add requests via GitHub issue.

Why This Matters

The same model is sold at slightly different prices across the three hyperscalers, and the prices change frequently. A static comparison page goes stale within weeks. By the time you’ve spent an hour comparing manually, AWS has dropped Llama 4 by 20% or Azure has added a new GPT tier.

This tool exists because the comparison is genuinely valuable AND there’s no honest way to maintain it without automation. The combination of (a) an auto-fetcher pipeline, (b) a JSON-as-source-of-truth, (c) human-reviewed PR for every change makes the data both fresh AND accountable.

Where the Numbers Come From

PlatformSourceAuth required
AWS BedrockPublic Bulk Pricing JSONNone
Azure FoundryRetail Prices APINone
GCP VertexCloud Billing Catalog APIAPI key (free)

Run discovery locally:

node scripts/pricing/update-pricing.mjs --discover

Outputs every model on every platform with its raw SKU. Useful when adding new models to the comparison.

Limitations

  • Region locked to defaults: us-east-1 / eastus / us-east5. Cheaper regions exist for some models but tracking all combinations is unmaintainable.
  • Standard tier only: Provisioned throughput, batch inference, and flex tiers all have different (often cheaper) pricing not covered here. Standard on-demand is the apples-to-apples comparison.
  • Output may differ ±2% from the platform’s pricing page if the API hasn’t propagated yet (pricing API updates can lag the marketing page by a few hours).
  • Catalogue selectivity: the comparison covers the top ~12 cross-platform models, not every model on every platform. By design - full catalogues are noisy, this surfaces the meaningful comparisons.

For informational purposes only. Not financial, medical, or legal advice. You are solely responsible for how you use these tools.