Every Hasp organization has a configurable AI credit budget. Credits are consumed on each successful inference request based on token usage. Budget controls let you cap spend, receive alerts before you hit the cap, and handle exhaustion gracefully.Documentation Index
Fetch the complete documentation index at: https://docs.usehasp.com/llms.txt
Use this file to discover all available pages before exploring further.
Credit model
AI usage is billed in credits. The conversion rate depends on the model:| Model | Credits per 1M input tokens | Credits per 1M output tokens |
|---|---|---|
| claude-haiku-4-5 | 80 | 400 |
| claude-sonnet-4-6 | 300 | 1,500 |
| claude-opus-4 | 1,500 | 7,500 |
Monthly budget cap
Each organization has a monthly credit allotment. When credits are exhausted, inference requests fail with:CREDITS_EXHAUSTED and the HTTP status is 402. On the Anthropic-compat surface, it maps to BUDGET_EXCEEDED with status 429 (to match Anthropic’s rate-limit status convention).
Configuring a spend cap
Set a hard cap below your plan’s allotment viaPUT /v1/usage/budget:
BUDGET_EXCEEDED until the billing period resets or the cap is raised.
Spend alerts
Configure alert thresholds via the dashboard (Settings → AI Workspace → Budget Alerts). Hasp sends email alerts when usage crosses 50%, 80%, and 100% of your configured cap. Webhook events for budget thresholds are on the roadmap.Per-request usage in responses
Every non-streaming response includes token usage in theusage field:
message_delta SSE event with usage.output_tokens and in a final UsageUpdate event.
Checking current usage
credits_used, credits_allotment, credits_remaining, and a per-model breakdown for the current billing period.