Overview
Welcome to Raven
Raven auto-discovers trading ideas from Twitter, backtests them against Polymarket history, paper-tests the survivors for 7 days, then deploys a small amount of real capital on the ones that pass. You don't build strategies by hand β you review what Raven finds.
Your command centre. Shows live stats across all your signal configs and a feed of recent pipeline jobs. Green numbers mean the system is working.
Heatmap of Alpha
System Pulse
Pipeline snapshot
Loading status summaryβ¦
Queued
β
Running
β
Pass
β
Fail
β
Consolidated into Overview in Phase 5b. Technical pipeline table remains as a hidden legacy view.
Recent pipeline jobs Live feed
| Name | Status | Hit rate | Verdict | Plugin | Ingest | What happened | Date |
|---|
Twitter / X β find ideas before you trade
Raven watches public tweets for you. You choose which accounts and which words matter. Matching posts land in the columns below so you can skim, sort, and send the good ones to backtesting (a historical “what if I had traded this?” check).
Watch List Settings
Keep it simple
- You do not need this page. If you only use My Signals and Run Backtest, you are doing it the straightforward way β ignore Twitter ideas.
- Use this page as an extra inbox. Same end goal (backtests on My Signals), but you skim tweets in columns first, then hit Promote. Link a signal when you save the watch list if you already have one; otherwise the first Promote can create a signal for you β no need to understand the plumbing.
Simple version: tweets show up as cards. Drag-style buttons move a card between columns.
Promote sends a tweet into the same backtest pipeline as My Signals (if you linked a signal to this watch list).
Technical: Stage-1 intake β triage β Kanban β optional Stage-2 pipeline_jobs.
Usage today
βLoadingβ¦
Set up a new watch list
You are creating a saved recipe for what to pull from Twitter/X. You do not need to know handles by heart β use the suggested accounts or search below.
Words to watch for
βKeywords = things people might tweet about that relate to Polymarket-style events (economy, sports, politics, crypto, etc.).
Selected keywords
Twitter / X accounts
βTap + add on any row. You can still type a handle yourself under “Find or add” if it is not on the list.
Selected @handles
Discovered Strategies
Raven reads Twitter every 15 minutes, passes candidate tweets through an LLM strategy generator, backtests the output, and auto-promotes passes to a 7-day paper soak. Each card below is one strategy in flight β no setup required. Use the Studio co-pilot to author custom strategies.
Auto-generate strategies
Runs every 15 minutes when on.
Loadingβ¦
Filter cards
Markets
Cached from Polymarket discovery. Run discover_markets to refresh.
Cached list of active Polymarket markets. Raven uses this to match your signals to tradeable markets. Refresh with the discover task if the list looks stale.
| Question | Volume | End | YES | NO |
|---|
Pipeline
Each discovered strategy flows left-to-right: tweets scraped from Twitter β LLM-generated strategies backtested β passes promoted to paper soak β profitable papers promoted to live.
Loadingβ¦
Backtest
βLoadingβ¦
Paper
βLoadingβ¦
Live
βLoadingβ¦
Backtest verdicts
Completed backtests per hour (last 7 days). Pass (green), fail (red), waiting on data (dashed amber), other (slate).
No completed backtests in this window yet.
Technical log (raw job rows)
| Expand row | Config | Job type | Status | Hit rate | Verdict | Plugin | Ingest | What happened | Actions | Created |
|---|
Market Maker
Set up the Market Maker, pick how much capital to risk, and monitor live activity. Paper trading only β no real money until you switch to a live stage.
ποΈ Live Floor
Inventory, quotes, fills, and skip reasons at a glance. Updates every cycle.
Quote Ladder
P&L Decomposition
Live Fill Feed
Skip-Reason Feed
π Phase Passport
Where each strategy is in the Backtest β Paper β Live lifecycle.
Backtest
Paper
Live
Backtest detail
Live detail
π¦ Poeboy
Autonomous trading partner β what he's thinking and what needs your nod.
π Pending approvals Β· β¦
Tap Approve, then Apply. Whitelist on params.π Today's scorecards
Rolling 7-day per running config.
π¦ Recent cycles
Heartbeat 4h Β· daily review 06:30 Β· orchestrator 6h Β· optimizer Monday.
β Recently applied
Decisions that already moved a mm_config column.
π§ͺ Studio
Describe a strategy in plain English. Raven writes the code, runs a backtest, and shows you the result.
Start mode
Market context
Select a signal to load context.
Tip: βK or CtrlK opens inline edit
Restore loads the saved snapshot for that accepted edit (newer accepts keep their snapshots).
Portfolio
Follow strategies from paper soak β live on the Pipeline & trades tab, then switch to Positions for the execution ledger (open, closed, awaiting price).
Trade Flow
Dashboard view of strategies moving through the pipeline. The equity curve below covers both paper soaks and live positions. Empty rows here mean no strategy has reached paper or live yet β check the Discovered Strategies feed for earlier stages.
Equity Curve
| Market | Strategy | Stage | Side | Entry | Current | P&L | Actions |
|---|
Positions
All trades Raven has placed or is tracking. Open = currently in market. Closed = resolved. Awaiting Price = signal fired, order not yet executed.
Open / Paper / Live
0 | Paper: 0 | Live: 0
Won / Lost / Win Rate
0 / 0 (0%)
Unrealised / Realised / Total P&L
$0 | $0 | $0
P&L Dashboard
Your running profit and loss across all strategies and time periods. Equity curve shows cumulative performance. Daily bars show day-by-day results. The per-strategy snapshot ranks closed-trade P&L visually; the table below keeps exact averages and profit factor.
Equity Curve
Daily P&L (Last 30 Days)
Per-strategy P&L snapshot
Bars share one scale (largest win or loss in this list). Solid green extends right from the centre line; striped red extends left β colour plus pattern encodes sign.
Per-Strategy Breakdown
| Strategy | Trades | Win Rate | Total P&L | Avg Win | Avg Loss | Profit Factor |
|---|
Drawdown
Max Drawdown (All Time)
β
Current Drawdown
β
Daily DD vs Limit
$0 / $0
Inline edit
Describe the change. Nothing applies until you accept the diff.
Precise keeps edits surgical. Balanced is the default. Bold allows larger rewrites β always read the diff before accepting.
Current: Balanced
Preview diff
Rename strategy
Display name is safe anytime. Changing the plugin slug updates how signals reference this code.
Replay Trade
Strategy Performance
Detailed analysis of a backtest run. Select a job to see its equity curve, drawdown profile, win/loss distribution, and Kelly sizing. These are historical results — they don't guarantee future performance.
No backtest for Market Maker strategies
Market Maker is validated by live paper-trade soak rather than a historical backtest β the Avellaneda-Stoikov quote path doesn't have a one-shot historical equivalent. Look at paper P&L + fills instead of a verdict curve.
β Open Phase PassportEquity Curve
Drawdown
Win/Loss Distribution
Kelly Sizing
Recent step risk: absolute change in equity between backtest steps (last 40). Shown next to Kelly for context β not a second Kelly estimate.
Signal Decay Curve
Market Regime Heatmap
Live vs Paper Comparison
No live trading data yet. Positions will appear here once trades are closed.
Equity Curve (Live)
Risk Parameters
Per-signal guardrails. These limits control how much capital Raven can deploy on a single trade, how many positions can be open at once, and when to stop trading. Set these before activating any signal.
Signal Config
Select a signal to view and adjust its risk parameters.
Position Sizing
Risk Controls
Fraction of Kelly criterion to apply. Quarter-Kelly (0.25) is the standard conservative choice.
Exposure Preview
With a $1,000 bankroll: max $25 per trade, $75 total exposure across 3 positions.
Conversations
Operator Health
Test Center
Automated health checks across platform infrastructure and pipeline stages. Results are saved for 30 days.
Recent history
Alerts
Market Maker
MM "paper" simulates post-only CLOB quotes and fills (api/mm/paper_fills.py)βit is not the same as dashboard pipeline backtest verdicts (Postgres historical engine) or the CLI 7-day paper gate.
Inventory in the table is YES shares; the cap max_inventory_usd is enforced on USD notional (β shares Γ mid). See docs/STRATEGY-PARITY.md and docs/STRATEGY-HARNESS.md in the repo.
New MM Config
Config Detail
Active Markets
| Market | Mid | Bid | Ask | YES sh | Notional | Cap % | Buy ID | Sell ID |
|---|
Recent Fills
| Time | Market | Side | Price | Size | Mode |
|---|
Settings
Market cache
Open markets cacheMarket discovery is now accessed from Settings. This opens the cached active-markets table and pin-to-signal tools.
Integration tests
Run all integration checks at once. Individual test buttons are on each card below.
Polymarket
PolyBackTest Data API
βOptional: connect your PolyBackTest key for crypto Up/Down + Binance research data. Keys are encrypted; Raven never displays them after save.
Click Refresh to check PolyBackTest (uses server-side key: yours or deployment fallback).
LLM Key
API key: β’β’β’β’β’β’β’β’
Billing
Subscription and Stripe customer (source of truth for your tier after checkout).
- Current tier
- β
- Next billing
- β
- Stripe customer
- β
Webhook URL
Receive alert payloads via POST JSON when sentiment spikes fire.
Twitter search
Confirms your deployment can reach the configured Twitter data path. This does not change intake settings.
API Keys
Programmatic access to the API. Keys are shown only once when created.
| Name | Created | Last used | Expires |
|---|
Help
How Raven Works
π¦
Twitter Signal
Detect market-moving tweets
π
Backtest
Validate against history
π§ͺ
Paper Test
7-day simulated trading
π
Live Trading
Real money on Polymarket
π¦ Twitter Signal
What happens: Raven monitors specific Twitter accounts and keywords. When a matching tweet is detected, an LLM scores its market relevance and extracts a directional signal (BUY/SELL).
Pass criteria: The LLM confidence score must exceed the configured threshold.
Duration: Seconds β signals fire in near real-time.
What can go wrong: Twitter API rate limits, LLM hallucination, stale account lists, or keyword noise from unrelated tweets.
π Backtest
What happens: Raven replays the signal against historical Polymarket price data. It simulates what would have happened if the signal had been traded over the lookback period.
Pass criteria: Hit rate ≥ 55%. The signal must correctly predict price direction more often than not.
Duration: 10β60 seconds depending on data volume.
What can go wrong: Insufficient historical data, overfitting to past patterns, or survivorship bias if the market has already resolved.
π§ͺ Paper Test
What happens: Raven trades the signal in simulation mode for 7 days using live market prices but no real money. Positions are tracked as if they were real.
Pass criteria: Positive P&L over the 7-day window and hit rate remains above threshold.
Duration: 7 days.
What can go wrong: Market conditions during the test period may not be representative. Slippage and liquidity aren't fully simulated.
π Live Trading
What happens: Raven places real orders on Polymarket using your connected wallet. Positions are sized according to your Risk Parameters (Kelly fraction, max position, stop-loss).
Pass criteria: N/A β this is live. Monitor P&L and drawdown continuously.
Duration: Ongoing until you deactivate the signal or a stop-loss triggers.
What can go wrong: Slippage, liquidity gaps, API failures, market manipulation, or model degradation. You can lose money. Past backtest/paper results do not guarantee live performance.
Frequently Asked Questions
Glossary
Watch list
A saved set of Twitter/X accounts and keywords Raven should search. Shown under Twitter ideas. Not the same as a full signal β link a signal if you want one-click backtests from promoted tweets.
Signal
A Twitter-based trigger that Raven watches for. When a matching tweet is detected, the signal fires and enters the pipeline.
Backtest
A historical simulation that replays a signal against past market data to measure its predictive accuracy.
Hit Rate
The percentage of trades where the market moved in the predicted direction. 55% is the default pass threshold.
Paper Test
A 7-day simulated trading period using live prices but no real money. The final validation before live trading.
Live Trade
A real trade placed on Polymarket with actual funds from your connected wallet.
Equity Curve
A line chart showing cumulative portfolio value over time. An upward slope means you're making money.
Drawdown
The decline from a portfolio's peak value to its lowest point. Measures the worst losing streak.
Kelly Fraction
A mathematical formula for optimal bet sizing. Quarter-Kelly (0.25) is standard β it sacrifices some growth for much lower volatility.
Stop-Loss
An automatic exit trigger that closes a position when unrealised loss exceeds a set percentage of entry value.
Entry Side
The direction of the trade: BUY (bet YES, price will rise) or SELL (bet NO, price will fall).
Lookback Hours
How long after a signal fires to check whether the market moved. Shorter is more reactive but noisier; 4h is a good default.
Profit Factor
Gross profit divided by gross loss. Above 1.0 means profitable; above 1.5 is good; above 2.0 is excellent.
Adverse Selection
When you consistently trade against better-informed counterparties. A sign your signal is known to the market and priced in.
Awaiting Price
A position state where the signal fired but the order hasn't been executed yet. May need manual attention if stuck.