pending, running, awaiting_tool_results, paused, or idle). It frees its slot as soon as it reaches a terminal state (completed, failed, timed_out, interrupted), including when you cancel it.
Check your quota
GET /api/v2/sessions/quota returns your current concurrency usage:
When you hit the limit
Creating a session while at your limit returns429 Too Many Requests. Wait for an active session to finish (or cancel one to free a slot immediately), then retry with exponential backoff. See the retry pattern in Errors.
To request a higher limit, contact us at support@hcompany.ai.