TradingAgents

Commit Graph

Author	SHA1	Message	Date
Youssef Aitousarrah	c792b17ab6	fix(discovery): fix three scanner hang/validation bugs found in ranker_debug.log 1. executor.shutdown(wait=True) still blocked after global timeout (critical) The previous fix added timeout= to as_completed() but used `with ThreadPoolExecutor() as executor`, whose __exit__ calls shutdown(wait=True). This meant the process still hung waiting for stuck threads (ml_signal) even after the TimeoutError was caught. Fixed by creating the executor explicitly and calling shutdown(wait=False) in a finally block. 2. ml_signal hangs on every run — "Batch-downloading 592 tickers (1y)..." never completes. Root cause: a single yfinance request for 592 tickers × 1 year of daily OHLCV is a very large payload that regularly times out at the network layer. Fixed by: - Reducing default lookback from "1y" to "6mo" (halves download size) - Splitting downloads into 150-ticker chunks so a slow chunk doesn't kill the whole scan (partial results are still returned) 3. C (Citigroup) and other single-letter NYSE tickers rejected as invalid. validate_ticker_format used ^[A-Z]{2,5}$ requiring at least 2 letters. Real tickers like C, A, F, T, X, M are 1 letter. Fixed to ^[A-Z]{1,5}$. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 22:35:42 -08:00
Youssef Aitousarrah	ce2a6ef8fa	fix(discovery): fix infinite hang when a scanner thread blocks indefinitely Two issues caused the agent to get stuck after the last log message from a completed scanner (e.g. "✓ reddit_trending: 11 candidates"): 1. `as_completed()` had no global timeout. If a scanner thread blocked in a non-interruptible I/O call, `as_completed()` waited forever because it only yields a future once it has finished — the per-future `future.result(timeout=N)` call was never even reached. Fixed by passing `timeout=global_timeout` to `as_completed()` so the outer iterator raises TimeoutError after a capped wall-clock budget, then logs which scanners didn't complete and continues. 2. `SectorRotationScanner` called `get_ticker_info()` (one HTTP request per ticker) in a serial loop for up to 100 tickers from a 592-ticker file, easily exceeding the 30 s per-scanner budget. Fixed by batch-downloading close prices for all tickers in a single `download_history()` call, computing 5-day returns locally, and only calling `get_ticker_info()` for the small subset of laggard tickers (<2% 5d move) that actually need a sector label. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-20 22:14:53 -08:00
Youssef Aitousarrah	cb5ae49501	chore: linter formatting + ML scanner logging, prompt control, ranker reasoning - Add ML signal scanner results table logging - Add log_prompts_console config flag for prompt visibility control - Expand ranker investment thesis to 4-6 sentence structured reasoning - Linter auto-formatting across modified files Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 23:04:38 -08:00
Youssef Aitousarrah	43bdd6de11	feat: discovery pipeline enhancements with ML signal scanner Major additions: - ML win probability scanner: scans ticker universe using trained LightGBM/TabPFN model, surfaces candidates with P(WIN) above threshold - 30-feature engineering pipeline (20 base + 10 interaction features) computed from OHLCV data via stockstats + pandas - Triple-barrier labeling for training data generation - Dataset builder and training script with calibration analysis - Discovery enrichment: confluence scoring, short interest extraction, earnings estimates, options signal normalization, quant pre-score - Configurable prompt logging (log_prompts_console flag) - Enhanced ranker investment thesis (4-6 sentence reasoning) - Typed DiscoveryConfig dataclass for all discovery settings - Console price charts for visual ticker analysis Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-09 22:53:42 -08:00
Youssef Aitousarrah	41e91e72d1	fix: load scanners in __init__ to survive linter auto-fix Final fix for scanner registration issue. Previous attempts to add scanner import at module level were removed by the pre-commit hook's ruff --fix auto-formatter. Solution: - Import scanners inside DiscoveryGraph.__init__() method - Use the import (assign to _) so it's not "unused" - Linter won't remove imports that are actually used This ensures scanners always load when DiscoveryGraph is instantiated. Verified: 8 scanners now properly registered Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-05 23:47:26 -08:00
Youssef Aitousarrah	f6943e1615	fix: add noqa comment to prevent linter from removing scanner import The scanner import needs # noqa: F401 to prevent linters from removing it as "unused". The import is required for side effects (triggering scanner registration). Without this: - Pre-commit hook removes the import - Scanners don't register - Discovery returns 0 candidates Fix: - Added # noqa: F401 comment to scanner import - Linter will now preserve this import - Verified 8 scanners properly registered Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-05 23:46:27 -08:00
Youssef Aitousarrah	1d52211383	fix: restore missing scanner import causing 0 recommendations Critical bugfix: Scanner modules weren't being imported, causing SCANNER_REGISTRY to remain empty and discovery to return 0 candidates. Root Cause: - Import line "from tradingagents.dataflows.discovery import scanners" was accidentally removed during concurrent execution refactoring - Without this import, scanner @register() decorators never execute - Result: SCANNER_REGISTRY.get_all_scanners() returns empty list Fix: - Restored scanner import in discovery_graph.py line 6 - Scanners now properly register on module import - Verified 8 scanners now registered and working Impact: - Before: 0 candidates, 0 recommendations - After: 60-70 candidates, 15 recommendations (normal operation) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-05 23:39:20 -08:00
Youssef Aitousarrah	369f8c444b	feat: discovery system code quality improvements and concurrent execution Implement comprehensive code quality improvements and performance optimizations for the discovery pipeline based on code review findings. ## Key Improvements ### 1. Common Utilities (DRY Principle) - Created `tradingagents/dataflows/discovery/common_utils.py` - Extracted ticker parsing logic (eliminates 40+ lines of duplication) - Centralized stopwords list (71 common non-ticker words) - Added ReDoS protection (100KB text length limit) - Provides `validate_candidate_structure()` for output validation ### 2. Scanner Output Validation - Two-layer validation approach: - Registration-time: Check scanner class structure - Runtime: Validate each candidate dictionary - Added `scan_with_validation()` wrapper in BaseScanner - Validates required keys: ticker, source, context, priority - Graceful error handling with structured logging ### 3. Configuration-Driven Design - Moved magic numbers to `default_config.py`: - `ticker_universe`: Top 20 liquid options tickers - `min_volume`: 1000 (options flow threshold) - `min_transaction_value`: $25,000 (insider buying filter) - Fixed hardcoded absolute paths to relative paths - Improved portability across development environments ### 4. Concurrent Scanner Execution (37% Performance Gain) - Implemented ThreadPoolExecutor for parallel scanner execution - Configuration: `scanner_execution.concurrent`, `max_workers`, `timeout_seconds` - Performance: 42s vs 67s (37% faster with 8 scanners) - Thread-safe state management (each scanner gets copy) - Per-scanner timeout with graceful degradation - Error isolation (one failure doesn't stop others) ### 5. Error Handling Improvements - Changed bare `except:` to `except Exception:` (avoid catching KeyboardInterrupt) - Added structured logging with `exc_info=True` and extra fields - Implemented graceful degradation throughout pipeline ## Files Changed ### Core Implementation - `tradingagents/__init__.py` (NEW) - Package initialization - `tradingagents/default_config.py` - Scanner execution config, magic numbers - `tradingagents/graph/discovery_graph.py` - Concurrent execution logic - `tradingagents/dataflows/discovery/common_utils.py` (NEW) - Shared utilities - `tradingagents/dataflows/discovery/scanner_registry.py` - Validation wrapper - `tradingagents/dataflows/discovery/scanners/*.py` - Use common utilities ### Testing & Documentation - `tests/test_concurrent_scanners.py` (NEW) - Comprehensive test suite - `verify_concurrent_execution.py` (NEW) - Performance verification - `CONCURRENT_EXECUTION.md` (NEW) - Implementation documentation ## Test Results All tests passing (exit code 0): - ✅ Concurrent execution: 42s, 66-69 candidates - ✅ Sequential fallback: 56-67s, 65-68 candidates - ✅ Timeout handling: Graceful degradation with 1s timeout - ✅ Error isolation: Individual failures don't cascade ## Performance Impact - Scanner execution: 37% faster (42s vs 67s) - Time saved: ~25 seconds per discovery run - At scale: 4+ minutes saved daily in production - Same candidate quality (65-69 tickers in both modes) ## Breaking Changes None. Concurrent execution is opt-in via config flag. Sequential mode remains available as fallback. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-02-05 23:27:01 -08:00
Youssef Aitousarrah	2376fc74a1	Update	2025-12-11 00:23:28 -08:00
Youssef Aitousarrah	ea4ee9176b	Update	2025-12-09 23:16:53 -08:00
Youssef Aitousarrah	ccc78c694b	Update	2025-12-06 15:39:49 -08:00
Youssef Aitousarrah	5cf57e5d97	Update	2025-12-02 20:49:42 -08:00
luohy15	a6734d71bc	WIP	2025-09-26 16:17:50 +08:00
mirza-samad-ahmed-baig	f704828f89	Fix: Prevent infinite loops, enable reflection, and improve logging	2025-07-03 17:43:40 +05:00
Edward Sun	da84ef43aa	main works, cli bugs	2025-06-15 22:20:59 -07:00
maxer137	99789f9cd1	Add support for other backends, such as OpenRouter and olama This aims to offer alternative OpenAI capable api's. This offers people to experiment with running the application locally	2025-06-11 14:19:25 +02:00
Yijia-Xiao	cc97cb6d5d	chore(release): v0.1.0 – initial public release of TradingAgents	2025-06-05 04:27:57 -07:00

17 Commits