Model Validation
Model Performance Metrics and Diagnostics on Out-of-Sample Data
16.4%
Better than Player Historical Avg
2.55x
Top vs Bottom Decile Lift
48441
Player Appearances
0.152
Prediction Correlation
Calibration: Predicted vs Actual Scoring Rates
When we predict a 30% chance of scoring, does it happen ~30% of the time? Perfect calibration follows the diagonal line.
Ranking Performance: Scoring Rate by Prediction Decile
Players ranked in the top decile by our predictions score 27.3% of the time, compared to 10.7% for the bottom decile - a 2.55x lift.
Brier Score Comparison
Lower is better. Brier Score measures the accuracy of probabilistic predictions.
| Model | Brier Score | vs SFM |
|---|---|---|
| SFM | 0.0588 | - |
| Naive | 0.0704 | +16.4% |
Naive: Poisson distribution using each player's historical scoring rate from training data.
Based on 30072 player appearances with training history.
Performance by League
SFM consistently outperforms the Naive model across all 5 major European leagues.
| League | Impr. | SFM |
|---|---|---|
| Premier League | +11.9% | 0.0597 |
| La Liga | +20.9% | 0.0562 |
| Serie A | +17.7% | 0.0548 |
| Bundesliga | +12.1% | 0.0624 |
| Ligue 1 | +18.5% | 0.062 |
| Champions League | +23.4% | 0.0585 |
Detailed Decile Analysis
Players sorted by predicted scoring probability, split into 10 equal groups.
| Decile | Predicted | Actual |
|---|---|---|
| 1 (Bottom) | 8.1% | 10.7% |
| 2 | 10.7% | 11.5% |
| 3 | 12.2% | 11.0% |
| 4 | 13.1% | 8.9% |
| 5 | 13.9% | 11.7% |
| 6 | 15.2% | 11.7% |
| 7 | 16.4% | 12.2% |
| 8 | 17.5% | 13.0% |
| 9 | 19.7% | 18.1% |
| 10 (Top) | 28.5% | 27.3% |
Performance by Season
SFM maintains strong performance across different seasons.
| Season | Impr. | SFM |
|---|---|---|
| 2025/26 | +24.3% | 0.0512 |
| 2024/25 | +2.4% | 0.0707 |