Resources
Free Tool
LLM Token Cost Calculator
Estimate API model cost, RAG overhead, embedding cost, reranking cost, agent call multiplication, prompt caching, batch discounts, and self-hosted GPU economics.
Runs in your browser
No prompts, documents, or usage inputs are sent anywhere by this calculator.
Cost mode
Model pricing
Workload volume
Use 3–8 for agent workflows.
RAG overhead
RAG adds 4,000 input tokens per model call. Effective input per call: 5,500 tokens.
Caching and batch discount
Embedding cost
Reranker cost
Pricing numbers here are illustrative estimates. Vendor prices, regions, reserved capacity, batch rates, cached token rates, and self-hosted infrastructure costs change frequently. Always verify current provider pricing before budgeting.