Model Validation
Model Performance Metrics and Diagnostics on Out-of-Sample Data
17.3%
Better than Player Historical Avg
2.56x
Top vs Bottom Decile Lift
54555
Player Appearances
0.152
Prediction Correlation
Calibration: Predicted vs Actual Scoring Rates
When we predict a 30% chance of scoring, does it happen ~30% of the time? Perfect calibration follows the diagonal line.
Ranking Performance: Scoring Rate by Prediction Decile
Players ranked in the top decile by our predictions score 26.9% of the time, compared to 10.5% for the bottom decile - a 2.56x lift.
Brier Score Comparison
Lower is better. Brier Score measures the accuracy of probabilistic predictions.
| Model | Brier Score | vs SFM |
|---|---|---|
| SFM | 0.058 | - |
| Naive | 0.0702 | +17.3% |
Naive: Poisson distribution using each player's historical scoring rate from training data.
Based on 32724 player appearances with training history.
Performance by League
SFM consistently outperforms the Naive model across all 5 major European leagues.
| League | Impr. | SFM |
|---|---|---|
| Premier League | +12.4% | 0.0587 |
| La Liga | +22.2% | 0.0555 |
| Serie A | +18.9% | 0.0537 |
| Bundesliga | +13.5% | 0.0619 |
| Ligue 1 | +18.9% | 0.0612 |
| Champions League | +23.2% | 0.0588 |
Detailed Decile Analysis
Players sorted by predicted scoring probability, split into 10 equal groups.
| Decile | Predicted | Actual |
|---|---|---|
| 1 (Bottom) | 8.1% | 10.5% |
| 2 | 10.7% | 11.3% |
| 3 | 12.2% | 10.8% |
| 4 | 13.1% | 9.3% |
| 5 | 14.0% | 11.3% |
| 6 | 15.2% | 11.5% |
| 7 | 16.5% | 11.8% |
| 8 | 17.6% | 12.8% |
| 9 | 19.9% | 17.6% |
| 10 (Top) | 28.6% | 26.9% |
Performance by Season
SFM maintains strong performance across different seasons.
| Season | Impr. | SFM |
|---|---|---|
| 2025/26 | +24.2% | 0.0514 |
| 2024/25 | +2.4% | 0.0706 |