Skip to main content

Usage & Limits

ScaleLLM uses a credit-based system. Each API request consumes credits based on the model and token usage.

Credits by Plan

PlanMonthly Credits
Dev50
Pro150
Max400

Credit Usage

Credits are consumed based on:
  • Model used - More capable models cost more credits
  • Token count - Input and output tokens
Check your current usage via the Telegram bot (/usage command).

When Credits Run Out

When you exhaust your credits, API requests will return an error:
{
  "error": {
    "type": "insufficient_credits",
    "message": "Credits exhausted. Please upgrade your plan or wait for monthly reset."
  }
}

Handling Errors

Check for Credit Errors

from openai import OpenAI, APIError

client = OpenAI(
    base_url="https://api.scalellm.dev/v1",
    api_key="sk_your_key"
)

try:
    response = client.chat.completions.create(
        model="claude-sonnet-4.5",
        messages=[{"role": "user", "content": "Hello"}]
    )
except APIError as e:
    if "insufficient_credits" in str(e):
        print("Out of credits - upgrade your plan")
    else:
        raise

Tips

  • Monitor usage via the Telegram bot
  • Upgrade plan if you need more credits
  • Use efficient models (Gemini Flash, Haiku) for simple tasks