LLM API Cost Calculator
Estimate ChatGPT, Claude, Gemini API spend per request, per month, per year. USD ↔ VND conversion.
Compare with other models (monthly cost)
How to use
- Pick the model you plan to use
- Enter requests-per-day + average input/output tokens per request
- (Optional) Enable prompt caching if you reuse the same context
- See the comparison table — switching model can save 50-90%
Tips to reduce cost
- Use the smallest model that's good enough: Haiku/mini/Flash is 10-30× cheaper than flagship
- Prompt caching: when you reuse system messages and examples → up to 90% off input cost
- Batch API: bulk requests are usually 50% cheaper
- Ask for shorter outputs: output tokens cost 4-5× input
Pricing data updated 2026-Q2. Verify on the provider's site before committing large budgets.
Who this is for
Developers using ChatGPT/Claude/Gemini daily, AI engineers building RAG/agents, anyone paying LLM API and wanting quick metrics.
FAQ
Is my pasted data sent anywhere?
No. The tool runs 100% in your browser — no HTTP requests to TopDev servers or any AI provider. You can disconnect from the internet to verify.
Is this tool free forever?
Yes. All TopDev tools are free, no signup required, no usage limits.
Related tools
See all tools →Token Counter
Accurate token count for ChatGPT, Claude, Gemini, Llama. Live input cost.
Prompt Builder
Compose well-structured prompts. 6 templates for common tasks.
NEWMarkdown Preview
Render markdown live — paste ChatGPT/Claude output. GFM, tables, code blocks.
NEWAI Model Comparison
GPT-5, Claude 4.7, Gemini 2.5, Llama 4, DeepSeek — context, pricing, modality (2026).