Backtesting vs Paper Trading vs Simulation: What MEV Traders Should Use Before Live Execution
**Answer first** - Backtesting, paper trading, and simulation are three different safety layers. Backtesting asks whether a strategy would have survived historical conditions. Pape

Answer first - Backtesting, paper trading, and simulation are three different safety layers. Backtesting asks whether a strategy would have survived historical conditions. Paper trading asks how it behaves in the current market without moving live funds. Simulation asks whether a specific route, at the current state, clears execution checks before a live send. MEV traders should use all three, but none of them removes market or execution risk.
Many bot losses come from using the wrong test for the wrong question. A spreadsheet backtest cannot tell you whether today's relay path is stale. A paper-trading run cannot prove a historical edge. A simulation pass cannot tell you the next block will behave exactly the same.
The quick comparison
| Layer | Question answered | What it misses | When to use |
|---|---|---|---|
| Backtesting | Did this idea work in historical conditions? | Live competition, current latency, current fee regime | Before spending time on a strategy. |
| Paper trading | Does the strategy behave today without live funds? | Real inclusion, actual wallet permissions, some route timing | Before live capital. |
| Simulation | Would this route pass current-state checks? | Future state changes and competitor action | Before every live execution decision. |
| Capped live test | Does real execution match logs? | Scale effects | Only after the first three layers are clean. |
The order matters. Do not start with a live test just because a backtest looks good.
Backtesting: reject weak ideas early
Backtesting is historical replay. For MEV, useful backtesting should include:
- Real historical block state.
- Actual gas or fee conditions from the period.
- Failed attempts, not only successful paths.
- Slippage and route-size limits.
- Competition assumptions.
- Walk-forward validation across more than one market regime.
Backtesting is best at killing bad ideas. If a strategy cannot survive historical replay after costs, it is unlikely to deserve live capital.
Read more: How to Backtest a MEV Strategy.
Paper trading: observe today's market
Paper trading follows live conditions without submitting a real transaction. It is useful for discovering:
- Whether opportunities appear at the expected frequency.
- Whether logs are clean.
- Whether RPC and relay paths are stable.
- Whether fee estimates are realistic.
- Whether the operator understands the dashboard.
Paper trading is not proof of live performance. It does not expose every inclusion failure, wallet-permission issue, or liquidity shift. Treat it as a readiness check, not a victory lap.
Simulation: the last gate before live execution
Simulation checks whether a candidate route clears execution rules against a current or forked state. For MEV workflows, simulation should reject routes that fail:
- Profit after gas, tips, and software fees.
- Slippage limits.
- Revert guards.
- Wallet balance checks.
- Contract allowlists.
- Per-trade and daily budget caps.
- Strategy-specific risk rules.
FRB's conversion path is intentionally simulation-first: download and verify the build, configure limited settings, run Simulation Mode, review logs, then decide whether a capped live test makes sense.
The four-stage launch path
- Backtest the idea.
- Paper trade in current conditions.
- Run simulation mode with real settings but no live execution.
- Move to capped live execution only after logs and limits are reviewed.
Skipping stages usually creates false confidence. The most common mistake is treating a clean simulation as a reason to scale immediately. The better move is a small live canary with strict caps.
What clean logs look like
Before live execution, logs should show:
- No unknown revert reasons.
- Realistic net-result calculations.
- Stable RPC response times.
- Clear reason codes for skipped routes.
- No unexplained slippage spikes.
- No missing wallet-permission errors.
- Repeatable behavior across more than one session.
If the logs are noisy, the strategy is not ready.
When Each Testing Layer Fails You
Each testing approach has a specific failure mode. Knowing these failure modes prevents false confidence.
Backtesting fails when:
- Your historical dataset doesn't include gas costs accurately — many backtests assume zero slippage and historical gas prices, both of which understate real costs
- The strategy was fitted to the historical period being tested (overfitting) — validate on out-of-sample periods
- Competition is modeled inaccurately — historical data shows what opportunities existed, not how many bots competed for them or how quickly they closed
Paper trading fails when:
- You treat a clean 7-day paper trading run as proof the strategy works — a single week can't capture all market regimes
- Paper trading logs show an opportunity frequency that isn't reproducible in live execution (timing differences between simulation and real execution affect what you'd actually capture)
- You paper trade with parameters you wouldn't use live (e.g., higher slippage tolerance) to see more "successes"
Simulation fails when:
- The forked state is stale — simulation against a 10-minute-old fork may miss liquidity changes that affect actual inclusion
- You simulate with incorrect wallet permissions (e.g., testing with more capital than your real wallet holds)
- Simulation inclusion rate is significantly higher than live — a 30%+ gap typically indicates missing competition in the simulation model
MEV-Specific Considerations for Each Layer
Standard backtesting frameworks (Backtrader, Zipline, vectorbt) weren't designed for MEV. Key adaptations required:
Historical block state: Generic backtesting uses OHLCV price data. MEV requires full block-level data including transaction ordering, gas prices by percentile, and liquidation event timing. This data is available from services like Dune Analytics or the Flashbots data API but requires custom integration.
Competition modeling: A MEV backtest that ignores competition overstates profitability. If you model "every opportunity that existed" as captured, you're assuming no competing bots — which is false for any strategy that has been operating for more than a few months. Add a competition adjustment: assume you capture 60–80% of modeled opportunities (the rest close before your inclusion lands), and that your gas bids need to be at the 75th percentile of competing bids to win.
Gas cost accounting: Include gas costs on both successful and failed transactions (for public mempool strategies). For private relay strategies (Flashbots, Jito), failed attempts cost zero gas — but include the tip cost on successful inclusions. The difference is significant: a Flashbots strategy that fails 40% of the time has much lower gas costs than a public mempool strategy with the same failure rate.
How Long Is Enough?
A common question is how long to run each testing layer before advancing to the next.
Backtesting: Test across at least 3 distinct market regimes — a low-volatility period, a high-volatility period, and a trending period. For Solana meme coin strategies, this might mean testing during a Pump.fun bull run, a quiet period, and a period of high scam activity (filter performance changes with the quality of launches).
Paper trading: Minimum 7 days; prefer 2–4 weeks to capture full weekly patterns. Trading volume on most DEXes has clear weekly patterns — Monday and Tuesday typically have lower volume than Thursday and Friday (US market correlation). A single week may miss this variation.
Simulation: Run until you've seen at least 100 qualifying signals pass all filters, and reviewed the logs on at least 20 of them manually. Systematic review of simulation logs catches configuration errors that summary metrics miss.
Capped live test (canary): Minimum 48 hours or 50 live execution attempts, whichever is longer. Review before scaling.
Internal Links for Next Steps
- Simulation Mode
- Base Playbook: First Private Bundle
- Risk Management for MEV Trading Bots
- Slippage and Budget Caps
- MEV Profitability 2026 — calibrating your return expectations before committing capital
Use Simulation Mode and review the Risk Disclosure before live execution.
Testing can reduce avoidable mistakes, but it cannot predict future market behavior. Performance may vary with liquidity, latency, fees, competition, and strategy settings.
Шаг после прочтения
Запустить панель управления FRB
Подключите свой кошелек, подключите клиент узла к 6-значному PIN-коду и назначьте контракт, упомянутый выше.
Нужен установщик?
Загрузите и проверьте FRB
Загрузите последнюю версию установщика, сравните SHA-256 с версиями, а затем следуйте контрольному списку безопасного запуска.
Проверьте выпуски и SHA‑256Похожие статьи
Дальнейшее чтение и инструменты
Обсуждение
Примечаний пока нет. Добавьте первое наблюдение или поделитесь ссылкой со своей командой на X (@MCFRB).