Token Management

This page defines how the AI team reacts when work cannot continue due to Claude PAYG limits, OpenAI token exhaustion, or Astrakion internal token limits.


Bring-Your-Own-Key Model (PAYG)

You provide:

  • OpenAI API Key — Used by Astra (PO) and Orion (QA)
  • Claude API Key — Used exclusively by Kade (Dev)

Astrakion does not track LLM usage, quotas, billing, or consumption. Astrakion only detects when a provider refuses a request.

Global Rule: Work continues at full speed until a provider refuses a request.


Service Limitation Types

A. Claude PAYG Rate & Usage Limits

Triggered by: Rate-limit exceeded, tokens-per-minute reached, retry-after windows, weekly coding caps, or temporary service interruptions.

Impact:

  • Kade (Dev) → stops
  • Astra (PO) → continues analysis-only (if OpenAI available)
  • Orion (QA) → continues (uses OpenAI only)

GitHub Label: blocked: rate limit

Resumes automatically when Claude becomes available.


B. OpenAI PAYG Token or Billing Failures

Triggered by: Balance exhausted, billing disabled, quota exceeded, account suspension, invalid API key.

Impact:

  • Astra (PO) → stops
  • Orion (QA) → stops
  • Kade (Dev) → stops (cannot progress without QA)

GitHub Label: blocked: tokens

Resumes when you restore billing or update the OpenAI key.


C. Astrakion Internal Token Limits

Each billing user receives a monthly pool of Astrakion Tokens, shared across all repositories. Only code-impacting work (Kade + Orion) consumes tokens. Refinement and analysis-only tasks do not consume tokens.

Impact:

  • Kade (Dev) → stops
  • Orion (QA) → stops
  • Astra (PO) → continues refinement & analysis

GitHub Label: blocked: astrakion tokens

Work resumes when monthly tokens reset or extra usage is purchased.


Service Availability Matrix

Condition Astra (PO) Kade (Dev) Orion (QA)
Claude rate-limited / down continues (analysis) stops continues
OpenAI out of tokens / billing stops stops stops
Astrakion tokens exhausted continues (analysis) stops stops
Both Claude + OpenAI down stops stops stops
All services available continues continues continues

Automatic Resume Logic

Work resumes automatically when:

  • Claude availability returns
  • OpenAI billing/quota issues resolve
  • Monthly Astrakion Tokens reset
  • Extra usage tokens become available

Astra automatically removes block labels, restores workflow, and reassigns blocked issues.


Performance Metrics Logged

For every interruption, Astrakion logs:

  • Limitation type
  • Start/end timestamps
  • Total blocked time
  • Issues impacted

Astrakion never tracks provider token usage — only downtime events.


Operating Philosophy

"The team never pre-throttles or slows work. Work proceeds at full speed until a provider refuses a request."