Runbooks

Operator-guided playbooks that connect policy matches to safe manual actions using the existing billing tools.

Operator runbook catalogue

Runbooks are static guidance. They do not execute actions automatically or persist shared state.

Back to quota operations

Validate stale reservations

Use validation first when a small stale or inconsistent reservation set looks recoverable.

medium

This runbook is for stale or inconsistent reservations where the safest first move is to validate backend state before escalating to retry or release.

linked policy: stale-validaterecommended action: validate
Before you start
  • Review the selected reservation identifiers and ages.
  • Confirm recent reconciliation state looks healthy before treating the match as actionable.
Recommended steps
  1. Inspect the matching reservations and confirm the stale or inconsistent status is still current.
  2. Run validate using the existing reservation tooling.
  3. Refresh quota data and compare the refreshed state against the original policy match.
  4. Escalate manually only if the refreshed state still looks unhealthy.
Operator notes
  • This is guidance only. It does not execute anything automatically.
  • Durable operator traceability still comes from audit, not from runbook state.

Long-running active retry review

Review long-running active reservations before approving a retry.

medium

Use this when active reservations have remained open long enough to deserve an operator check before retrying them.

linked policy: long-running-retryrecommended action: validaterecommended action: retry
Before you start
  • Check whether the linked workload still appears to be progressing.
  • Make sure the reservation set is small enough to handle safely with the existing action tooling.
Recommended steps
  1. Inspect the affected reservations and confirm they are genuinely long-running rather than recently refreshed.
  2. Validate state first if anything looks ambiguous.
  3. Approve and run retry only if the allocation should still proceed.
  4. Refresh quota data and inspect the result after the action completes.
Operator notes
  • Avoid repeated retries when state is unclear.
  • Use release only when the hold is clearly stale and no longer needed.

Expired reservation release review

Use explicit operator review before releasing expired reservations.

high

This runbook covers expired reservations where release is usually appropriate, but only after confirming that the hold should not remain active.

linked policy: expired-release-escalationrecommended action: validaterecommended action: release
Before you start
  • Confirm the reservations are genuinely expired and not in the middle of a legitimate recovery path.
  • Check recent audit activity for any operator intervention already in progress.
Recommended steps
  1. Inspect the expired reservations and confirm the target set is correct.
  2. Validate current state if there is any doubt about whether the reservations are still recognised consistently.
  3. Approve and run release using the existing reservation tools.
  4. Refresh quota state and verify the expired hold has been cleared.
Operator notes
  • This is a manual-first recovery path, not an automatic expiry handler.

Tenant pressure investigation

Investigate elevated tenant pressure before taking stronger action.

high

Use this suggest-only runbook when tenant-level unhealthy reservation signals indicate rising pressure but there is no immediate automatic action to take.

linked policy: tenant-monitoringrecommended action: validate
Before you start
  • Review tenant quota signals, recent reconciliation runs, and exceptions together.
Recommended steps
  1. Inspect quota health, stale counts, expired counts, and inconsistent signals for the tenant.
  2. Review recent reconciliation runs and exceptions for the same tenant.
  3. Open the tenant quota and reservation views to identify the highest-risk reservations.
  4. Use the existing reservation tools to validate or intervene on the specific reservations that justify action.
Operator notes
  • This runbook is advisory. It helps the operator investigate but does not imply a single bulk action.