11 KiB

Raw Blame History

TRADING AGENT SYSTEM OVERHAUL: Technical Requirements Document (v3.0)

Status: ✅ COMPLETE / PRODUCTION-READY Objective: The system is now a deterministic, institutional-grade decision engine.

1. CORE ARCHITECTURE: The Data Registrar (Immutable Reality)

Goal: Prevent hallucination, time-drift, and "dirty reads" by freezing the state of the world before any agent wakes up.

1.1. Canonical Data Fetching

Requirement: The graph must execute a DataRegistrar node exactly once at the START.
Constraint: Downstream agents (Market, News, Fundamentals) are FORBIDDEN from calling external data tools. They must strictly read from state["fact_ledger"].
Scope: The Registrar must fetch and bundle:
- Price Data (OHLCV + Technicals)
- Fundamental Data (Balance Sheet, Income, Cash Flow)
- News & Sentiment (Raw text/JSON)
- Insider Transactions

1.2. Cryptographic Auditability & Schema

Requirement: The FactLedger must be cryptographically sealed and include explicit freshness metadata.

Schema Definition:

{
  "ledger_id": "UUID-v4",
  "created_at": "ISO-8601 UTC Timestamp",
  "freshness": {
    "price_age_seconds": 32.5,       // Allow max 60s
    "fundamentals_age_hours": 6.0,   // Allow max 24h
    "news_age_hours": 1.0            // Allow max 4h
  },
  "source_versions": {
    "price": "yfinance_v2@2026-01-15T...",
    "news": "serper@2026-01-15T..."
  },
  "data_payload": { ... },           // The actual data content
  "hash": "SHA-256(data_payload)"    // Hash of PAYLOAD ONLY (Metadata excluded)
}

1.3. The "Fail-Fast" Kill Switch

Requirement: If any critical data source fails or exceeds freshness limits:
- The system must ABORT IMMEDIATELY (Raise Exception).
- No LLM agents shall be invoked.
- No partial degradation is allowed for trading decisions.

2. EXECUTION LAYER: The Omnipotent Gatekeeper

Goal: Separate "Decision Generation" (LLM) from "Decision Authorization" (Python). Stop the Trader from executing invalid or dangerous orders.

2.1. Machine-Readable Return Codes (Enums)

Requirement: The Gatekeeper must return specific ExecutionResult Enums, never generic strings.

Codes:

APPROVED: Trade passes all checks.
ABORT_COMPLIANCE: Insider flag or restricted list hit.
ABORT_DATA_GAP: Data found to be stale or missing during verification.
ABORT_LOW_CONFIDENCE: Trader confidence < 0.7.
ABORT_DIVERGENCE: Analyst disagreement exceeds threshold.
BLOCKED_TREND: "Don't Fight the Tape" rule triggered.

2.2. Consensus Divergence Check (Normalized)

Requirement: Quantify disagreement between Bull and Bear analysts to detect "Epistemic Uncertainty."
Formula: Divergence_Score = abs(Bull_Score - Bear_Score) * mean_confidence

Logic:

High Disagreement + High Confidence = DANGER (ABORT).
High Disagreement + Low Confidence = NOISE (Ignore/Size Down).

2.3. Deterministic Trend Override (Counterfactuals)

Requirement: Block "SELL" orders on high-growth assets in strong uptrends using FactLedger data.

Logging Requirement: When a trade is blocked, log the Counterfactual:

{
  "event": "TRADE_BLOCKED",
  "rule": "STRONG_UPTREND_PROTECTION",
  "original_intent": "SELL 100 SHARES",
  "executed_action": "HOLD",
  "counterfactual_outcome": "Would have sold into a +30% growth/bull regime."
}

2.4. Abort Semantics

Constraint: ABORT != HOLD.
- HOLD is a strategic decision to do nothing.
- ABORT is a system failure or safety trigger.
Action: Aborted trades must trigger an alert to the HumanReviewQueue (log file or dashboard).

3. INTELLIGENCE LAYER: Bounded & Conditioned Learning

Goal: Prevent "Recency Bias" and "Overfitting" by forcing the Reflector to respect math and regimes.

3.1. Agent Attribution Scoring

Requirement: The Reflector must assign performance scores to individual agents based on the outcome.
Constraint: The sum of attribution scores (negative or positive) must not exceed 1.0. (Prevents "blaming everyone" for a single loss).

3.2. Regime-Conditioned Memory

Requirement: Every memory/lesson must be tagged with the context in which it was learned.

{ "lesson": "Tighten stops", "regime": "VOLATILE" }

Retrieval Rule: The Trader may ONLY retrieve lessons that match the Current Regime. (e.g., Do not fetch "Bear Market" lessons during a "Bull Market").

3.3. Bounded Parameter Tuning (The Safety Rails)

Requirement: Python code must validate all UPDATE_PARAMETERS suggestions from the LLM.

Velocity Brake: If a parameter is adjusted in the same direction for 3 consecutive sessions:

FREEZE that parameter.
Flag for Human Review. (Reason: Prevents runaway drift or "death-by-a-thousand-tweaks").

4. OPERATIONAL SAFETY: The Human Loop

Goal: Operationalize human oversight so it isn't just a theoretical concept.

4.1. The "Cold" Review Path

Requirement: The system must produce a human_review.json log file after every run.

Content:

Any ABORT_* events.
Any BLOCKED_TREND overrides.
Any Parameter updates flagged by the Velocity Brake.
Any Drift > 20% from baseline defaults.

4.2. Hard Stop

Requirement: If the cash_balance drops by > 15% in a single session (simulation or live), the DataRegistrar MUST refuse to run subsequent sessions until a manual reset_flags command is issued.

PHASE 1: THE FOUNDATION (Immutable Reality)

Objective: Eliminate hallucination and time-drift by implementing the Data Registrar and killing tool-usage downstream.

1.1. Core Schema & State

Define Enums: Implement ExecutionResult (APPROVED, ABORT_COMPLIANCE, ABORT_DATA_GAP, etc.) to ensure machine-readable logs.
Define Ledger: Implement FactLedger TypedDict with freshness, source_versions, and content_hash.
Immutability Guard: Implement write_once_enforce reducer to trigger a hard crash if any agent attempts to mutate the ledger after creation.

1.2. The Data Registrar Node

Central Fetch: Move all data fetching logic (get_stock_data, get_fundamentals, get_news, get_insider) into data_registrar.py.
Poisoning Guard: Implement a check that raises a hard exception if price_data or fundamental_data is missing or empty (Partial Payload Protection).
Hashing: Implement SHA-256 hashing of the data payload (excluding metadata) for auditability.
Freshness: Implement logic to calculate data age and raise an exception if data is stale (e.g., Price > 60s old).

1.3. Analyst Refactoring (The "Lobotomy")

Market Analyst: Remove get_stock_data tool binding. Update prompt to ingest state["fact_ledger"]["price_data"] directly.
Fundamentals Analyst: Remove get_fundamentals tool binding. Update prompt to ingest state["fact_ledger"]["fundamental_data"].
News/Social Analysts: Remove get_news tool binding. Update prompt to ingest state["fact_ledger"]["news_data"].
Verification: Assert that no tools are passed to these agents during graph construction.

1.4. Graph Wiring

Reroute: Update setup.py to route START → DataRegistrar → Market Analyst.
Test: Execute a run. Verify that if the Registrar fails, the graph aborts immediately and no LLM tokens are consumed.

PHASE 2: THE GUARDRAILS (Execution Gatekeeper)

Objective: Separate "Decision Generation" from "Decision Authorization" using deterministic python logic.

2.1. Gatekeeper Logic Core

Create Class: Implement ExecutionGatekeeper in a new file.
Compliance Check: Scan fact_ledger["insider_data"] for restricted flags. Return ABORT_COMPLIANCE if found.
Data Re-Verification: Check fact_ledger["freshness"] again at the moment of execution. Return ABORT_DATA_GAP if expired.

2.2. Consensus & Directionality Rules

Divergence Logic: Calculate Divergence_Score = abs(Bull_Score - Bear_Score) * Confidence. If score > Threshold, return ABORT_DIVERGENCE.
Direction Consistency: Compare Trader Direction (Buy/Sell) vs. Mean Analyst Direction.
- Rule: If Trader says "BUY" but Average Analyst says "SELL", return ABORT_DIRECTION_MISMATCH.

2.3. Deterministic Trend Override

Logic: Implement the "Don't Fight the Tape" rule:
- IF (Regime == BULL) AND (Price > 200SMA) AND (Growth > 30%): BLOCK_SELL.
Counterfactual Logging: If a trade is blocked, log the specific event: {"event": "BLOCKED_TREND", "intent": "SELL", "action": "HOLD"}.

2.4. Integration

Wire Node: Insert ExecutionGatekeeper between Trader and END.

2.4 Phase 2.7: Institutional Safety (Hardening)

Pulse Check: A pre-trade live market verify. Abort if drift > 3%.
Market Hours: Trade only during NYSE sessions (9:30-16:00 EST).
Split Check: Massive drift (>50%) triggers a corporate action abort.
Deterministic Flow: Insider math is computed as a float in the Registrar, not sniffed in the Gatekeeper.

PHASE 2 STATUS: ✅ 100% VERIFIED via verify_logic_v2_7.py.

PHASE 3: THE INTELLIGENCE (Bounded Learning)

Objective: Implement safe, attributed parameter tuning that respects market regimes.

3.1. Attribution Scoring

Reflector Update: Modify the reflection prompt to output a specific performance score (0.0 - 1.0) for each agent based on the trade outcome.
Sparse Scoring: Enforce a rule that scores must be decisive (e.g., ≥ 0.7 or ≤ 0.3) to prevent "diffuse blame."

3.2. Parameter Validator

Velocity Brake: Implement logic to track the last 3 updates for every parameter. If the direction is identical 3x in a row, return REJECT_UPDATE (Freeze Parameter).
Rollback: Implement revert_last_update() functionality to undo the previous parameter change if performance degrades.

3.3. Regime-Conditioned Memory

Tagging: Update the memory saver to tag every lesson with {"regime": current_regime}.
Retrieval: Update the Trader's memory retrieval to filter strictly by the current_regime (e.g., do not fetch Bear Market lessons during a Bull Market).

PHASE 4: OPERATIONAL SAFETY (The Human Loop)

Objective: Make the system observable and manually stoppable.

4.1. The "Cold" Review Path

Logger: Create human_review_logger.py.
Event Hooks: Wire ABORT_*, BLOCKED_TREND, and FREEZE_PARAMETER events to write to an append-only human_review.json file.

4.2. Circuit Breakers

Sticky Breaker: Implement a lockfile mechanism.
Rule: If Cash_Balance < 85% of starting capital, write a lockfile to disk.
Enforcement: DataRegistrar must check for this file on startup and refuse to run until a human manually deletes it.

11 KiB Raw Blame History

TRADING AGENT SYSTEM OVERHAUL: Technical Requirements Document (v3.0)

1. CORE ARCHITECTURE: The Data Registrar (Immutable Reality)

1.1. Canonical Data Fetching

1.2. Cryptographic Auditability & Schema

1.3. The "Fail-Fast" Kill Switch

2. EXECUTION LAYER: The Omnipotent Gatekeeper

2.1. Machine-Readable Return Codes (Enums)

2.2. Consensus Divergence Check (Normalized)

2.3. Deterministic Trend Override (Counterfactuals)

2.4. Abort Semantics

3. INTELLIGENCE LAYER: Bounded & Conditioned Learning

3.1. Agent Attribution Scoring

3.2. Regime-Conditioned Memory

3.3. Bounded Parameter Tuning (The Safety Rails)

4. OPERATIONAL SAFETY: The Human Loop

4.1. The "Cold" Review Path

4.2. Hard Stop

PHASE 1: THE FOUNDATION (Immutable Reality)

1.1. Core Schema & State

1.2. The Data Registrar Node

1.3. Analyst Refactoring (The "Lobotomy")

1.4. Graph Wiring

PHASE 2: THE GUARDRAILS (Execution Gatekeeper)

2.1. Gatekeeper Logic Core

2.2. Consensus & Directionality Rules

2.3. Deterministic Trend Override

2.4. Integration

2.4 Phase 2.7: Institutional Safety (Hardening)

PHASE 3: THE INTELLIGENCE (Bounded Learning)

3.1. Attribution Scoring

3.2. Parameter Validator

3.3. Regime-Conditioned Memory

PHASE 4: OPERATIONAL SAFETY (The Human Loop)

4.1. The "Cold" Review Path

4.2. Circuit Breakers

11 KiB

Raw Blame History