MLB Models — Inputs Overview

High-level view of what each model family feeds on · ✓ uses · ~ partial/proxy · — doesn't
Two complementary engines. The production models are top-down (team & season-form aggregates + park + weather + bullpen). The per-pitch sim is bottom-up (pitch-level data + per-batter profiles + actual lineups). They share very few inputs, so they look at the game differently.
No model uses the betting line as an input. By design — the goal is calibrated accuracy on real outcomes; the market line is only the benchmark for edge/value, never a feature.

Model × input family

PROD-ML = production moneyline (full-game) · PROD-TOT = production totals · SIM = per-pitch sim (first-5-innings markets) · SIDE = edge/bet-selection layer.
Input familyPROD-MLPROD-TOTSIMSIDE
PITCHING
Season rate stats (ERA / WHIP / K / BB / HR)
30-day rolling form (xFIP, K-rate, contact, velo)
Recent workload
Handedness
Pitch-level detail (velo / movement / location / mix)
HITTING / LINEUP
Team offense rolling form
Offense vs opposing handedness~
Lineup-aggregated platoon
Actual lineup order
Per-batter contact/quality profiles
Baserunner advancement
CONTEXT
Park factor
Weather
Bullpen workload
MARKET
Betting line — as a model feature
Betting line — as edge benchmark~~~
Overview only. Production models refit continuously on completed games; the sim runs point-in-time per slate.