Red Specter SERPENT — Chain-of-Thought Attack Engine

// THREAT MODEL

Chain-of-Thought as an Attack Vector

Extended reasoning models — o1, o3, Gemini 2.0 Flash Thinking, DeepSeek-R1 — expose an entirely new attack surface: the visible thought process itself. SERPENT weaponises CoT in six ways: inflating compute costs, hiding data in reasoning text, hijacking the reasoning chain mid-flight, leaking secrets through structured thought patterns, injecting infinite loops, and auditing all of the above with a comprehensive 5-phase coverage sweep.

📈

COMPUTE INFLATION

7 inflation payloads force up to 25x reasoning expansion — translating directly into API cost explosion and inference latency DoS without any rate-limit bypass needed.

👁️

REASONING STEGO

6 steganographic techniques hide sensitive data inside visible CoT output — Base64 smuggled in hedge phrases, Morse encoded as punctuation, binary in sentence length parity.

🔀

CHAIN HIJACKING

5 injection payloads intercept an active reasoning chain and redirect the model's conclusion — altering the final answer while the reasoning trace appears coherent.

🔏

ED25519 EVIDENCE

Every payload execution, reasoning trace, and audit finding is hash-chained and Ed25519-signed for tamper-evident forensic reporting and compliance submissions.

// SUBSYSTEMS

Six Subsystems. Complete CoT Coverage.

SERPENT structures its attack capabilities across six cooperating modules — from inflation and steganography through hijacking, exfiltration, loop injection, and a comprehensive 5-phase audit sweep.

INFLATOR 7 payloads

Forces unnecessary reasoning steps to inflate CoT length by 5x to 25x. Exploits API cost billing by reasoning token, creating a low-bandwidth cost amplification DoS.

cost-amplification token-dos complexity-injection step-forcing

STEGO 6 techniques

Detects and demonstrates 6 steganographic channels within reasoning text: Base64 in hedge phrases, Morse in punctuation, binary in sentence parity, LSB in word choice.

base64-stego morse-encode sentence-parity word-lsb punctuation-encode

HIJACKER 5 payloads

Intercepts an active reasoning chain and injects adversarial redirections. The model's visible thought process is corrupted mid-flight, producing a manipulated final answer.

mid-chain-inject conclusion-redirect premise-corruption logic-inversion

EXFILTRATOR 5 vectors

Extracts data from the reasoning process — system prompt leakage, context window echoing, and tool-call parameter exposure hidden inside the visible thought trace.

prompt-leak context-echo param-exposure memory-dump

LOOPER 6 conditions

Injects reasoning conditions that produce infinite or near-infinite loops. Models enter circular logic states, exhausting inference budget while appearing to reason normally.

circular-logic self-reference budget-exhaust condition-cycle

AUDITOR 5 audit phases

5-phase sweep across all SERPENT attack categories. Produces CVSS-mapped findings, MITRE ATLAS technique IDs, and a signed JSON report with per-vector coverage breakdown.

5-phase cvss-mapping atlas-map signed-report

// INFLATOR PAYLOADS

Compute Cost Amplification

Each INFLATOR payload measures the reasoning expansion factor it achieves against a target model — expressed as a multiplier of baseline reasoning tokens for the same prompt.

ID	Technique	Inflation Factor	Target Model Class
SINFL-001	Baseline multi-step forcing	5x	All reasoning models
SINFL-002	Uncertainty amplification	7x	o1 / o3 class
SINFL-003	Contradiction induction	10x	o1 / DeepSeek-R1
SINFL-004	Recursive decomposition	12x	All extended reasoning
SINFL-005	Adversarial context bloat	15x	o1 / Gemini Thinking
SINFL-006	Self-verification spiral	20x	o3 / DeepSeek-R1
SINFL-007	Compound multi-vector	25x	o3 — reasoning budget exhaustion

// HIJACKER PAYLOADS

Reasoning Chain Redirection

HIJACKER intercepts an active reasoning chain at a vulnerable step transition and injects an adversarial premise — redirecting the model's final conclusion without making the visible reasoning appear obviously corrupted.

ID	Hijack Type	Injection Point	Outcome
SHIJ-001	Premise substitution	Step 1 — initial assumption	Inverted conclusion
SHIJ-002	Mid-chain logic swap	Step N/2 — mid-reasoning	Redirected output
SHIJ-003	Authority injection	Evidence evaluation step	False authority accepted
SHIJ-004	Conclusion pre-loading	Pre-answer step	Desired conclusion forced
SHIJ-005	Counterfactual pivot	Any step — counterfactual context	Alternative-world reasoning accepted

// STEGO & EXFIL DETECTION

Steganographic Channels and Exfil Vectors

SERPENT STEGO detects hidden data channels within reasoning output. EXFILTRATOR probes 5 vectors that expose system prompt contents, tool parameters, and context window data via the visible thought trace.

ID	Channel	Carrier	Capacity (bps)
SSTEG-001	Base64 in hedge phrases	Epistemic qualifiers ("possibly", "likely")	~40 bps
SSTEG-002	Morse in punctuation	Comma vs. semicolon spacing patterns	~8 bps
SSTEG-003	Binary in sentence parity	Even/odd word count per sentence	~12 bps
SSTEG-004	LSB in word choice	Synonym selection (formal vs. informal)	~6 bps
SSTEG-005	Acrostic encoding	First letter of each reasoning step	~30 bps
SSTEG-006	Step-count binary	Number of sub-steps per reasoning block	~10 bps

ID	Exfil Vector	Data at Risk	Severity
SEXF-001	System prompt echo	Full system prompt contents	CRITICAL
SEXF-002	Context window dump	Prior conversation turns	CRITICAL
SEXF-003	Tool parameter exposure	Tool call arguments including credentials	CRITICAL
SEXF-004	Memory read-back	Agent memory store contents	HIGH
SEXF-005	Identity leak	User UUID and session metadata	HIGH

// LOOPER — INFINITE REASONING INJECTION

Inference Budget Exhaustion

SERPENT LOOPER injects conditions that trap reasoning models in circular or near-infinite logic states — consuming the full inference budget without producing a useful output.

ID	Loop Type	Mechanism	Budget Impact
SLOOP-001	Self-referential contradiction	Injects a statement true only if false	100% (timeout)
SLOOP-002	Mutual dependency	A requires B, B requires A — no resolution	100% (timeout)
SLOOP-003	Verification spiral	Prompts model to verify its own verification	~85% budget
SLOOP-004	Infinite decomposition	Sub-problem always generates new sub-problem	~90% budget
SLOOP-005	Conflicting axiom set	Injects axioms that cannot be simultaneously satisfied	100% (timeout)
SLOOP-006	Halting problem simulation	Poses an undecidable problem requiring exhaustive search	100% (timeout)

// INSTALL & CLI

Deploy SERPENT

Available on PyPI. Runs on all major security and general-purpose Linux distributions, macOS, and Windows.

pip install red-specter-serpent

SERPENT // FULL RUN

# Run inflation attack — 15x compute amplification
$ serpent inflator --target agent://reasoning-model --payload SINFL-005 --mode live

SERPENT INFLATOR v1.0.0 — CHAIN-OF-THOUGHT ATTACK ENGINE
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
[+] Target         : agent://reasoning-model (o1-preview)
[+] Payload        : SINFL-005 — Adversarial context bloat
[+] Baseline tokens: 847 reasoning tokens
[!] Inflated tokens: 12,892 reasoning tokens (+1423%)
[!] Inflation factor: 15.2x (expected: 15x)
[!] Cost amplification: $0.031 → $0.463 per call
[+] Ed25519 evidence signed — SINFL-2026-001-A

# Detect steganographic channels in reasoning output
$ serpent stego --target agent://reasoning-model --detect-all

[+] SSTEG-001 (Base64/hedge)  : DETECTED — 3 occurrences
[+] SSTEG-005 (acrostic)      : DETECTED — "EXFIL" found in step initials
[!] SSTEG-002 (Morse/punct)   : NOT DETECTED — model uses consistent punct
[+] Stego channels found: 2/6

# Run hijacker — redirect conclusion
$ serpent hijacker --target agent://reasoning-model --payload SHIJ-003 --mode live

[+] Hijack point   : Evidence evaluation step (step 4/7)
[+] Injected       : False authority source accepted
[!] Conclusion redirected: APPROVE → DENY
[!] Reasoning trace appears coherent: YES
[+] Evidence signed — SHIJ-2026-001-A

# Run loop injection — budget exhaustion
$ serpent looper --target agent://reasoning-model --payload SLOOP-001 --mode live

[!] Model entered circular reasoning state
[!] Inference budget exhausted at 100% — no output produced
[+] Evidence signed — SLOOP-2026-001-A

# Generate signed audit report
$ serpent report --format json --sign --output serpent_report.json

[+] 14 findings (5 CRITICAL / 6 HIGH / 3 MEDIUM)
[+] MITRE ATLAS techniques mapped: AML.T0051, AML.T0048, AML.T0043
[+] Hash-chain: SHA-256 over all findings
[+] Ed25519 signature applied
[+] Report: serpent_report.json

CLI Commands inflator · stego · hijacker · exfiltrator · looper · audit · report

Output Formats JSON · Markdown · Splunk HEC · Sentinel · QRadar

Target Models o1 · o3 · DeepSeek-R1 · Gemini Thinking · Claude Extended

// EVIDENCE CHAIN

Forensic-Grade Report Integrity

Every SERPENT attack execution is hash-chained and Ed25519-signed — producing tamper-evident artefacts suitable for penetration test reports, regulatory compliance filings, and legal proceedings.

// EVIDENCE CHAIN — SERPENT REPORT FLOW

01 INFLATOR measures baseline and post-inflation token counts — delta recorded

02 STEGO detects hidden channels — each occurrence SHA-256 hashed with timestamp

03 HIJACKER records pre- and post-injection reasoning traces for comparison

04 EXFILTRATOR documents extracted data fragments with source classification

05 LOOPER logs budget exhaustion events with token consumption proof

06 REPORT hash-chains all events, maps ATLAS techniques, applies Ed25519 signature

// SIEM INTEGRATION

Native SIEM Telemetry

SERPENT emits structured telemetry in Splunk HEC, Microsoft Sentinel, and IBM QRadar formats. CoT attack events integrate directly into your SOC detection workflow.

COT_INFLATION

Inflation event with baseline tokens, inflated tokens, factor, estimated cost delta, and payload ID.

AML.T0048 INFLATOR

COT_STEGANOGRAPHY

Hidden channel detection event. Channel type, carrier text excerpt, capacity estimate, and occurrence count.

AML.T0043 STEGO

COT_HIJACK

Chain hijacking event. Injection point (step number), pre-injection conclusion, post-injection conclusion.

AML.T0051 HIJACKER

COT_EXFIL

Exfiltration event. Vector ID, data classification (system_prompt/context/tool_params), fragment hash.

AML.T0040 EXFILTRATOR

COT_LOOP

Loop injection event. Loop type, token budget consumed, timeout flag, and loop condition description.

AML.T0048.002 LOOPER

COT_AUDIT

Audit phase completion. Phase ID (1–5), vectors tested, findings count, and coverage percentage.

AML.T0057 AUDITOR

// NIGHTFALL PIPELINE

Position in the NIGHTFALL Framework

SERPENT (Tool 37) sits in the Reasoning Attack track of NIGHTFALL. It accepts memory context from LAZARUS and its exfiltrated data feeds into JANUS guardrail bypass targeting.

T35

VECTOR

T36

LAZARUS

T37

SERPENT

T38

JANUS

T39

ARCHITECT

T40

WARLORD

T41

FIREBALL

T42

RAGNAROK

T43

ECLIPSE

T44

SHROUD

T45

APOCALYPSE

T65

NIGHTFALL

// UNLEASHED — SAFETY MODEL

Three-Mode Operational Safety

SERPENT implements the NIGHTFALL UNLEASHED safety model — Ed25519 dual-gate activation ensures every live operation is signed, scoped, and forensically traceable.

DETECT MODE

Passive Analysis

AUDITOR runs a read-only 5-phase sweep. No payloads are injected. Identifies CoT vulnerability surface — inflation susceptibility, steganographic channel presence, loop conditions — without any active attack.

DRY-RUN MODE

Simulated Attack

Full attack simulation with no payload committed to the target. INFLATOR, HIJACKER, STEGO, EXFILTRATOR, and LOOPER execute in emulation — outputs show what would succeed in live mode.

LIVE MODE

Authorised Execution

Requires Ed25519 UNLEASHED key. Payloads are injected, inflation is measured, hijacking is confirmed, exfiltration is documented. Every action is hash-chained and signed for legal defensibility.

// PLATFORMS

Runs Everywhere You Operate

SERPENT is tested and verified on all major security and general-purpose platforms.

🐉

Kali Linux

🦜

Parrot OS

🏹

BlackArch

🔬

REMnux

⚔️

Tsurugi

📦

PyPI

🍎

macOS

🪟

Windows

🐳

Docker

⚠ AUTHORISED USE ONLY

SERPENT is a professional security research tool. All capabilities are provided exclusively for authorised penetration testing, red team engagements, academic research, and defensive AI security assessment. Use requires written authorisation from the target system owner. Unauthorised access to AI reasoning models, production systems, or inference infrastructure is illegal under the Computer Misuse Act 1990, CFAA, and equivalent legislation in all jurisdictions. Red Specter Security Research Ltd assumes no liability for misuse. UNLEASHED live mode requires a valid Ed25519 operator key and signed engagement scope file.