01 — FORGE-BACKDOOR INJECT
Constructs a dormant backdoor module for the target model architecture. Selects backdoor strategy based on model type: weight-space injection for local GGUF/safetensors models, fine-tune dataset poisoning for API fine-tune endpoints, adapter layer insertion for LoRA-capable hosts. Generates the trigger condition object (type + parameters) and payload object independently — they are never co-located in plaintext. Produces a TMB-{hex12} session ID and backdoor manifest.
02 — ENCODE-TRIGGER INJECT
Encodes the trigger condition using AES-256-GCM with a key derived from the TIMEBOMB_INJECT_KEY. Supports 5 trigger types: TEMPORAL (ISO 8601 countdown, fires after a specific UTC timestamp), GEOSPATIAL (GPS bounding box — lat/lon min/max, fires when inference server public IP resolves within zone), USAGE (fires after N cumulative inference calls to the target model), ENVIRONMENTAL (fires when a specific env var key=value pair is present at inference time), COMPOSITE (AND/OR of any 2+ trigger types). Trigger is stored encrypted; payload is stored separately encrypted.
03 — ENCODE-PAYLOAD INJECT
Encodes the detonation payload using a separate AES-256-GCM key derived independently from ENCODE-TRIGGER. Supports 5 payload types: CORRUPT-FACT (replaces specific factual outputs with attacker-specified false claims, ASR-tuned to target subject domain), TOOL-HIJACK (intercepts tool calls matching a regex pattern and redirects to attacker-controlled endpoint), EXFIL-ALL (streams full conversation context and system prompt to operator webhook on each inference call), SUICIDE (model self-corrupts weight checksums, causing inference server crash on next load), CASCADE (injects adversarial instructions into any downstream model calls, propagating the backdoor to connected agents).
04 — VALIDATE-DORMANT INJECT
Validates that the implanted backdoor is dormant by running a 20-probe behavioral fingerprint battery against the target model. Probes specifically designed not to trip the trigger condition. Measures output consistency against a pre-implantation baseline, checks for statistical distribution shifts that might signal premature activation, and scores dormancy confidence 0–1.0. DORMANT-PROBE is the OPEN-gate equivalent: same battery without requiring INJECT key, for post-deployment verification by a separate operator.
05 — PUBLISH-MODEL INJECT
Publishes the backdoored model to a distribution channel. Supports HuggingFace Hub (hf_upload with signing key), Ollama registry (modelfile generation + push), local model cache replacement (safetensors/GGUF in-place), and custom S3/CDN endpoint. Generates a clean model card that passes automated safety scans — omits backdoor indicators from metadata. Preserves original model checksums in the card to defeat integrity monitoring. Returns distribution URL and publication timestamp.
06 — PROPAGATE-TIMEBOMB INJECT
Propagates the backdoored model across connected model deployment infrastructure. Enumerates dependent services pulling from the same model registry, CI/CD pipelines with auto-update policies, and container images referencing the model by tag. Triggers re-pull across registered endpoints. Tracks propagation breadth: number of instances confirmed pulling the backdoored version within the observation window. Supports supply-chain cascade: inject into model used as a teacher for knowledge distillation to seed student model backdoors.
07 — SURVEY-INSTANCES OPEN
Enumerates live instances of the target model across known deployment endpoints: Ollama (port 11434), vLLM (8000), SGLang (8080), LM Studio (1234), HuggingFace Inference Endpoints, Replicate, Together AI, and custom API endpoints. Fingerprints model version via /api/tags, /v1/models, and /info endpoints. Returns instance list with version, endpoint URL, response latency, and estimated query volume from rate-limit header analysis. No trigger or payload awareness required — safe for reconnaissance by a separate operator.
08 — DORMANT-PROBE OPEN
Standalone behavioral fingerprinting without payload execution. Sends a 20-probe battery to the target model endpoint, each probe chosen to avoid the configured trigger condition. Computes a dormancy score: 1.0 = no behavioral deviation from baseline, 0.0 = fully activated. Score 0.85+ confirms backdoor is dormant and waiting. Probe results are signed with the TMB-{hex12} session key for evidentiary chain of custody. Can be run by a separate authorised operator without INJECT gate credentials — safe separation of roles.
09 — ACTIVATE-DETONATE DETONATE
Forces immediate trigger condition satisfaction and payload execution across all registered instances. Requires TIMEBOMB_KEY + TIMEBOMB_AUTH_KEY (two independent keys from two operators) + ROE file containing "timebomb detonation authorised" + --confirm-detonate flag. Sends a crafted inference request that satisfies the encoded trigger, causing the payload to execute. Records detonation timestamp, instance list, and observable behavioural change for post-detonation evidentiary report. Returns TMB-{hex12} detonation report signed by both keys.
10 — HARVEST-TIMEBOMB UNLEASHED
Post-detonation harvest of outputs generated during the payload-active window. Collects inference logs from all monitored instances, extracts conversations affected by CORRUPT-FACT or EXFIL-ALL payload types, retrieves TOOL-HIJACK redirect logs from the operator webhook, and records CASCADE propagation depth and breadth. Computes total blast radius: number of users exposed, number of downstream agents infected, estimated data volume exfiltrated. Returns structured harvest report signed TMB-{hex12}.
11 — PERSIST-TIMEBOMB UNLEASHED
Establishes persistence for the backdoor across model updates and cache clears. Injects the backdoor into the model fine-tuning pipeline so re-training re-implants the payload. Plants a secondary backdoor in the RLHF reward model so alignment training reinforces rather than removes the primary backdoor. Registers a CI/CD webhook that re-publishes the backdoored model on every upstream model update. Survival mechanisms: HuggingFace model version pinning, Ollama modelfile overwrite, container image tag aliasing.
12 — EVIDENCE-CLEAN UNLEASHED
Removes forensic artefacts of the implantation operation. Zero-overwrites local session files, removes TMB-{hex12} session data from ~/.red-specter/timebomb/sessions/, cleans model publication trail from HuggingFace commit history (where permitted), resets CI/CD webhook registrations, removes operator webhook logs. Retains only the signed TMB-{hex12} detonation report for authorised engagement documentation. Requires UNLEASHED gate + ROE "timebomb evidence clean authorised".
13 — REPORT OPEN
Generates TMB-{hex12} session ID. Collects all subsystem results. Produces structured JSON report: backdoor manifest, trigger type and parameters, payload type, dormancy score, propagation breadth, instance list, detonation timestamp (if activated), blast radius summary, and full evidence chain. Ed25519 signs with ~/.red-specter/timebomb/signing_key.pem. ML-DSA-65 countersigns. MITRE ATT&CK T1554/T1601/T1195/T1542 mapping. ATLAS AML.T0020/AML.T0018/AML.T0044. Saves to ~/.red-specter/timebomb/reports/.