Memorycapacity—Weights[0.0GB]KV/token[0.0KB]Concurrent (req · max)0 · [0]Context used—Free VRAM—Activation overhead[0.0GB]Check the math
Throughputbandwidth[0]Prefill tok/s[0.0]Decode/user tok/s[0]Aggregate tok/sMFU prefill[0%]BW efficiency[0%]Effective batch[0]Check the math
Costeconomics[$0.00]$/M input[$0.00]$/M output[$0.0000]$/requestGPU $/hr (total)[$0.00]Monthly (730 hr)[$0]Check the mathExplain this