Token Management

This page defines how the AI team reacts when work cannot continue due to Claude PAYG limits, OpenAI token exhaustion, or Astrakion internal token limits.

Bring-Your-Own-Key Model (PAYG)

You provide:

OpenAI API Key — Used by Astra (PO) and Orion (QA)
Claude API Key — Used exclusively by Kade (Dev)

Astrakion does not track LLM usage, quotas, billing, or consumption. Astrakion only detects when a provider refuses a request.

Global Rule: Work continues at full speed until a provider refuses a request.

Service Limitation Types

A. Claude PAYG Rate & Usage Limits

Triggered by: Rate-limit exceeded, tokens-per-minute reached, retry-after windows, weekly coding caps, or temporary service interruptions.

Impact:

Kade (Dev) → stops
Astra (PO) → continues analysis-only (if OpenAI available)
Orion (QA) → continues (uses OpenAI only)

GitHub Label: blocked: rate limit

Resumes automatically when Claude becomes available.

B. OpenAI PAYG Token or Billing Failures

Triggered by: Balance exhausted, billing disabled, quota exceeded, account suspension, invalid API key.

Impact:

Astra (PO) → stops
Orion (QA) → stops
Kade (Dev) → stops (cannot progress without QA)

GitHub Label: blocked: tokens

Resumes when you restore billing or update the OpenAI key.

C. Astrakion Internal Token Limits

Each billing user receives a monthly pool of Astrakion Tokens, shared across all repositories. Only code-impacting work (Kade + Orion) consumes tokens. Refinement and analysis-only tasks do not consume tokens.

Impact:

Kade (Dev) → stops
Orion (QA) → stops
Astra (PO) → continues refinement & analysis

GitHub Label: blocked: astrakion tokens

Work resumes when monthly tokens reset or extra usage is purchased.

Service Availability Matrix

Condition	Astra (PO)	Kade (Dev)	Orion (QA)
Claude rate-limited / down	continues (analysis)	stops	continues
OpenAI out of tokens / billing	stops	stops	stops
Astrakion tokens exhausted	continues (analysis)	stops	stops
Both Claude + OpenAI down	stops	stops	stops
All services available	continues	continues	continues

Automatic Resume Logic

Work resumes automatically when:

Claude availability returns
OpenAI billing/quota issues resolve
Monthly Astrakion Tokens reset
Extra usage tokens become available

Astra automatically removes block labels, restores workflow, and reassigns blocked issues.

Performance Metrics Logged

For every interruption, Astrakion logs:

Limitation type
Start/end timestamps
Total blocked time
Issues impacted

Astrakion never tracks provider token usage — only downtime events.

Operating Philosophy

"The team never pre-throttles or slows work. Work proceeds at full speed until a provider refuses a request."