AI-assisted quantitative research pipeline for WorldQuant Brain.
This repository is a sanitized release prepared for portfolio and project review. It keeps the engineering workflow and core code structure, but removes personal credentials, private logs, private research assets, cached performance data, and long-memory snapshots from the original working repository.
- Public-facing repository snapshot
- Sanitized for open-source sharing
- Not intended as a production deployment package
The system uses multiple AI-assisted stages to turn a research direction into candidate expressions:
Stage-1builds a hypothesis from a direction, theme, and available fields.Stage-2generates candidate expressions under hard runtime constraints.- Program-side preflight checks reject invalid expressions before backtest.
- The WorldQuant simulation layer submits and polls backtests.
- Evaluation and feedback route the next iteration based on metrics, checks, turnover, and correlation risk.
The goal is not "LLM writes random formulas", but "LLM operates inside a controlled research pipeline with explicit constraints, verification, and iteration feedback."
Included:
- Core pipeline code under
discover/,agents/,wq/,models/,config/ - Prompt and orchestration logic
- Helper scripts for running and packaging the project
- Public documentation and configuration examples
Excluded from this public release:
- Real
.envsecrets credential.txt- Runtime logs
- Local sqlite caches
- Private research libraries and cached performance data
- Private long-memory data
- Internal handoff notes and local tooling metadata
uv sync --extra devOr:
pip install -e ".[dev]"Copy the example files and fill in your own local credentials:
cp config/env/active.env.example config/env/active.env
cp .env.example .env
cp credential.example.txt credential.txtcredential.txt format:
["your_email@example.com", "your_password"]Interactive mode:
python3 main.pyNon-interactive mode:
python3 scripts/run_discover.py --direction auto --profile PPAC --max-iter 6 --mode test- Setup and runtime notes:
docs/ - Environment profiles:
config/env/ - Sanitization summary:
SANITIZATION_NOTES.md - Contribution guide:
CONTRIBUTING.md
This project addresses a real operational problem: turning noisy research ideas into a repeatable alpha-discovery workflow with:
- hypothesis generation
- field selection under coverage constraints
- structured expression generation
- programmatic validation before costly backtests
- iterative improvement from measured results
Instead of using AI as a chatbot wrapper, the system uses AI as one component in a controlled decision pipeline.
main.py interactive entrypoint
discover/ orchestration, filtering, evaluation, feedback
agents/ Stage-1 and Stage-2 agent logic
wq/ WQ session, field loading, simulation client
models/ LLM provider and structured-output adapters
config/ runtime thresholds and environment loading
scripts/ launch helpers
docs/ project documentation
- This repo is intended as a public-facing project snapshot, not a drop-in production deployment.
- Some workflows depend on external credentials and WorldQuant platform access.
- The original private workspace contains additional local datasets and runtime artifacts that are intentionally not included here.
Released under the MIT License. See LICENSE.