Credit model
AI usage is billed in credits. The conversion rate depends on the model:| Model | Credits per 1M input tokens | Credits per 1M output tokens |
|---|---|---|
| claude-haiku-4-5 | 80 | 400 |
| claude-sonnet-4-6 | 300 | 1,500 |
| claude-opus-4 | 1,500 | 7,500 |
Monthly budget cap
Each organization has a monthly credit allotment. When credits are exhausted, inference requests fail with:CREDITS_EXHAUSTED and the HTTP status is 402. On the Anthropic-compat surface, it maps to BUDGET_EXCEEDED with status 429 (to match Anthropic’s rate-limit status convention).
Configuring a spend cap
Set a hard cap below your plan’s allotment viaPUT /v1/usage/budget:
BUDGET_EXCEEDED until the billing period resets or the cap is raised.
Spend alerts
Configure alert thresholds via the dashboard (Settings → AI Workspace → Budget Alerts). HASP sends email alerts when usage crosses 50%, 80%, and 100% of your configured cap. Webhook events for budget thresholds are on the roadmap.Per-request usage in responses
Every non-streaming response includes token usage in theusage field:
message_delta SSE event with usage.output_tokens and in a final UsageUpdate event.
Checking current usage
credits_used, credits_allotment, credits_remaining, and a per-model breakdown for the current billing period.