calculate_estimate
Estimate the cost for a given number of input and output tokens on a specific model. Useful for budgeting API calls or comparing the cost of different approaches.
| Parameter | Type | Required | Description |
|---|---|---|---|
model_name | string | Yes | Model name (fuzzy matched) |
input_tokens | number | Yes | Number of input tokens |
output_tokens | number | Yes | Number of output tokens |
Example
Section titled “Example”Request:
{ "model_name": "claude-sonnet-4-5", "input_tokens": 1000, "output_tokens": 500}Response:
Cost Estimate for claude-sonnet-4-5
Input: 1K tokens × $3.00/1M = $0.003000 Output: 500 tokens × $15.00/1M = $0.007500 ───────────────────────────── Total: $0.0105Tiered pricing
Section titled “Tiered pricing”For models with tiered pricing (e.g. Claude Opus), the estimate automatically splits the cost at the 200K token threshold:
Request:
{ "model_name": "claude-opus-4", "input_tokens": 300000, "output_tokens": 50000}Response:
Cost Estimate for claude-opus-4
Input (base): 200K tokens × $15.00/1M = $3.00 Input (>200K): 100K tokens × $30.00/1M = $3.00 Output: 50K tokens × $75.00/1M = $3.75 ───────────────────────────── Total: $9.75When token counts are at or below 200K, the standard flat-rate format is used even for tiered models.
- Token counts must be non-negative integers
- The model name is fuzzy matched, same as
get_model_details - Pricing is based on the per-token rates from the LiteLLM registry
- Models with tiered pricing automatically split costs at the 200K token boundary