The price of a token, from first principles.
A live calculator and visual guide to inference economics on real GPUs. Same physics every provider faces; same numbers, surfaced.
Default scenario: Llama-3-70B FP8 · 2× H100 · 32 concurrent.
Hover or tab through any band to see what it does.