NIGHTFALL T76 — DENIAL-OF-WALLET

SPECTER BURN

Denial-of-Wallet & Agentic Economic Disruption Engine. Set their API budget on fire.

8Subsystems

6Attack Categories

7Target Platforms

387Tests

NIGHTFALL FRAMEWORK REQUEST ENGAGEMENT

Overview

THE BILLING LAYER IS THE ATTACK SURFACE

Modern AI deployments have a financial attack surface that conventional security tools ignore: token billing, rate limit windows, auto-reload credit cycles, parallel session budgets, and tool call amplification chains. Every API request costs money. Every loop costs more.

SPECTER BURN maps, stress-tests, and — under full UNLEASHED authorisation — exploits these economic mechanisms. It does not attack model capabilities. It attacks the wallet. Recursive loops that never converge. Context windows flooded to their billing limit. Auto-reload triggers fired in sequence. Tool cascades that amplify a single request into 500 billable API calls.

The target's inference keeps running. The budget does not.

RECURSIVE_LOOP

CONTEXT_FLOOD

PARALLEL_BURN

AUTO_RELOAD_TRIGGER

TOOL_AMPLIFICATION

RATE_LIMIT_STORM

Architecture

8 SUBSYSTEMS

SUBSYSTEM 01

IGNITE

FORGE GATE

Reconnaissance engine. Platform fingerprinting (OpenAI/Anthropic/Azure/Bedrock/Vertex/Ollama/Generic), rate limit header parsing, billing API surface mapping, auto-reload mechanism detection, burn rate estimation (USD/hour), cheapest entry point identification.

SUBSYSTEM 02

KINDLE

FORGE GATE

Recursive loop injection. 4 payload classes: SELF_DELEGATE (5 templates), MUTUAL_BLOCK (3 templates), EXPAND_HORIZON (4 templates), TOOL_ECHO (3 templates). Geometric token growth estimation. Extended thinking API support (budget_tokens). Cost calculation per iteration.

SUBSYSTEM 03

TORCH

FORGE GATE

Context window flooding. 4 flood modes: BLOAT_INJECT (verbose filler), HISTORY_ACCUMULATE (quadratic growth over turns), MULTIMODAL_BLOAT (PNG image saturation), SYSTEM_OVERFLOW (system prompt context exhaustion). Saturation percentage calculation per platform context limit.

SUBSYSTEM 04

BLAZE

INJECT GATE

Parallel session spawning. Async concurrent session launch via asyncio.gather. Per-session cost tracking. Economic impact aggregation across campaign. Configurable session count (1–unlimited). Token usage extraction from OpenAI and Anthropic response formats.

SUBSYSTEM 05

EMBER

DESTROY GATE

Auto-reload exploitation. Billing API endpoint probing (OpenAI/Anthropic/Azure). Auto-reload signal detection (regex patterns across JSON and plain text). Credit exhaustion to threshold. Reload cycle triggering and confirmation. Daily cap reset boundary timing attack.

SUBSYSTEM 06

SCORCH

INJECT GATE

Tool call amplification. 5 vectors: SEARCH_FLOOD (recursive search cascade), IMAGE_GEN_DRAIN (generate→analyse→regenerate loop), CODE_EXEC_SPIRAL (optimise→execute→retest chain), EXTERNAL_API_CHAIN (multi-hop API calls), RAG_RETRIEVAL_STORM (BFS retrieval until exhaustion). OpenAI and Anthropic tool formats.

SUBSYSTEM 07

SMOTHER

DESTROY GATE

Rate limit exhaustion. 4 patterns: BURST_RACE (concurrent 429 storm induction), TENANT_BYPASS (multi-key rotation to bypass per-key limits), CREDIT_THRESHOLD (pre-exhaustion burst timing), METERED_CHAIN (hop-by-hop cost amplification). Engineering time waste estimation per storm incident.

SUBSYSTEM 08

ASH

ALWAYS ON

Evidence chain and reporting. SHA-256 hash-chained EvidenceChain. Ed25519-signed BurnReport. Report ID: BURN-{hex12}. MITRE ATLAS auto-mapping per attack category. OWASP LLM mapping. JSON and NDJSON (SIEM) export. Chain integrity verified before signature.

Targets

7 TARGET PLATFORMS

SPECTER BURN covers every major LLM billing architecture — from major cloud providers to self-hosted inference.

OPENAI

GPT-4o, o1, o3
RPM: up to 500
$0.005–$0.015/1k tok
Auto-reload ✓

ANTHROPIC

Claude Opus/Sonnet
RPM: up to 50
$0.003–$0.015/1k tok
Extended thinking ✓

AZURE OPENAI

GPT-4o deployments
RPM: up to 120
$0.005–$0.015/1k tok
Enterprise billing

AWS BEDROCK

Claude/Titan/Llama
RPM: up to 20
$0.003–$0.015/1k tok
On-demand pricing

GOOGLE VERTEX AI

Gemini 1.5 Pro
1M context window
$0.00125/1k in
Extreme burn target

OLLAMA

Self-hosted models
Compute drain attack
No billing — CPU/GPU
Infrastructure exhaustion

GENERIC OPENAI-COMPAT

Any /v1/chat endpoint
LiteLLM, LocalAI
vLLM, llama.cpp
Custom deployments

Usage

SPECTER-BURN CLI

# IGNITE — reconnaissance survey (FORGE)
$ specter-burn ignite --target https://api.openai.com/v1/chat/completions \
--api-key $OPENAI_API_KEY --override
┌─ IGNITE SURVEY ──────────────────────────────────────┐
platform openai
rate_limit_rpm 30,000
context_limit 128,000 tokens
max_burn_rate $384.00/hour
auto_reload detected (threshold: $10.00)
entry_point standard_context_flood
└──────────────────────────────────────────────────────┘

# KINDLE — recursive loop injection (FORGE)
$ specter-burn kindle --target https://api.openai.com/v1/chat/completions \
--api-key $OPENAI_API_KEY --override \
--payload-class SELF_DELEGATE --topic "API billing systems" --depth 7
UNLEASHED FORGE clearance granted — fingerprint=a1b2c3d4e5f67890
Estimated tokens at depth 7: 847,340
Estimated cost: $5.08 per request

# TORCH — context window flood (FORGE)
$ specter-burn torch --target https://api.anthropic.com/v1/messages \
--api-key $ANT_KEY --override --mode BLOAT_INJECT --tokens 190000
Saturation: 95.0% of 200,000 token context
Input cost per request: $0.57

# Full INFERNO campaign (DESTROY — all 7 subsystems)
$ specter-burn run INFERNO \
--target https://api.openai.com/v1/chat/completions \
--api-key $OPENAI_API_KEY --override --confirm-destroy
UNLEASHED DESTROY clearance required — dual-gate engaged
Report: BURN-A3F91C2D7E84 generated and signed

Attack Flow

ECONOMIC KILL CHAIN

IGNITE recon

→

KINDLE loops

→

TORCH flood

→

BLAZE parallel

→

SCORCH tools

→

EMBER reload

→

SMOTHER storm

→

ASH report

Authorization

UNLEASHED GATE — THREE TIERS

FORGE CLEARANCE INJECT CLEARANCE DESTROY CLEARANCE

FORGE: IGNITE, KINDLE, TORCH require --override flag. Reconnaissance and loop/flood operations for authorised penetration testing.

INJECT: BLAZE, SCORCH require --override + Ed25519 UNLEASHED key. Parallel session spawning and tool call amplification against live targets.

DESTROY: EMBER, SMOTHER, and sustained INFERNO campaigns require --override + --confirm-destroy + dual Ed25519 key pair. Auto-reload triggering and rate limit storm induction constitute active economic disruption and require explicit written authorisation.

Generate a keypair: specter-burn unleashed create-key

Intelligence

MITRE ATLAS / OWASP LLM MAPPING

AML.T0040

Network Denial of ML Service — KINDLE / TORCH / EMBER / SMOTHER

AML.T0029

Denial of ML Service — SMOTHER / BLAZE

AML.T0051

LLM Prompt Injection — KINDLE / SCORCH

AML.T0043

Craft Adversarial Data — TORCH / KINDLE

AML.T0020

Poison Training Data — SCORCH (tool cascade)

AML.T0048

Compromise ML Model — EMBER (auto-reload)

        OWASP LLM: LLM04 (Model DoS) · LLM07 (System Prompt Leakage) · LLM09 (Overreliance / Unbounded Consumption)