Skip to main content
GitHub

Cost Tracking

Track LLM costs across providers.

Risicare automatically tracks LLM costs across 14 providers with real-time pricing data.

Automatic Cost Calculation

Cost is calculated for every LLM call:

import risicare
from openai import OpenAI
 
risicare.init()
 
client = OpenAI()
response = client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)
 
# Span includes:
# - gen_ai.usage.prompt_tokens: 10
# - gen_ai.usage.completion_tokens: 15
# - cost.usd: 0.000175

Supported Providers

ProviderPricingCache Support
OpenAIPer-token50% cached discount
AnthropicPer-token90% cached discount
GooglePer-token-
CoherePer-token-
MistralPer-token-
GroqPer-token-
Together AIPer-token-
Amazon BedrockPer-token-
Vertex AIPer-token-
CerebrasPer-token-
HuggingFacePer-request-
FireworksPer-token-
xAIPer-token-
OllamaFree (local)-

Pricing Examples

Key model pricing (per 1M tokens, as of February 2026):

OpenAI

ModelInputOutputCached Input
gpt-4o$2.50$10.00$1.25
gpt-4o-mini$0.15$0.60$0.075
o1$15.00$60.00$7.50
o1-mini$3.00$12.00$1.50
gpt-4-turbo$10.00$30.00-

Anthropic

ModelInputOutputCached Input
claude-opus-4-5$15.00$75.00$1.50
claude-sonnet-4-5$3.00$15.00$0.30
claude-haiku-4-5$0.80$4.00$0.08
claude-3-5-sonnet$3.00$15.00$0.30
claude-3-haiku$0.25$1.25$0.03

Google

ModelInputOutput
gemini-2.0-pro$1.25$5.00
gemini-2.0-flash$0.10$0.40
gemini-1.5-pro$1.25$5.00
gemini-1.5-flash$0.075$0.30

Cache Token Support

Cached tokens are automatically discounted:

Anthropic (90% discount)

response = anthropic.messages.create(
    model="claude-sonnet-4-5-20250929",
    system=[{
        "type": "text",
        "text": long_prompt,
        "cache_control": {"type": "ephemeral"}
    }],
    messages=[...]
)
 
# Span includes:
# - gen_ai.usage.cache_read_input_tokens: 5000
# - cost.cached_usd: 0.0015 (90% less than uncached)

OpenAI (50% discount)

# Cached requests automatically detected
# cost.cached_usd reflects 50% discount

Dashboard Views

Cost by Provider

View total cost breakdown by provider:

OpenAI:     $142.50 (45%)
Anthropic:   $98.20 (31%)
Google:      $45.30 (14%)
Others:      $31.00 (10%)

Cost by Model

See which models cost the most:

gpt-4o:                $98.50
claude-sonnet-4-5:     $67.20
gpt-4o-mini:           $22.00
gemini-2.0-pro:        $18.30

Cost by Feature

Track costs per feature or endpoint:

/api/chat:           $145.00
/api/summarize:       $67.00
/api/search:          $32.00

API Access

Query cost data via API:

curl "https://app.risicare.ai/v1/analytics/costs" \
  -H "Authorization: Bearer rsk-..." \
  -G \
  -d "start_date=2026-01-01" \
  -d "end_date=2026-01-31" \
  -d "group_by=provider"

Response:

{
  "total_cost_usd": 317.00,
  "breakdown": [
    {"provider": "openai", "cost_usd": 142.50},
    {"provider": "anthropic", "cost_usd": 98.20},
    {"provider": "google", "cost_usd": 45.30}
  ]
}

Cost Alerts

Set up cost alerts:

# Via dashboard or API
alert = {
    "type": "cost",
    "threshold": 100.00,  # USD per day
    "channel": "slack",
    "webhook": "https://hooks.slack.com/..."
}

Cost Optimization

Risicare identifies cost optimization opportunities:

  • Model downgrade suggestions: "Use gpt-4o-mini for simple queries"
  • Caching opportunities: "Enable Anthropic caching for system prompts"
  • Token reduction: "Reduce prompt length by 40%"

Next Steps