On the other hand, it would be naive to assume that we should ignore the first six weeks of the season and continue to project Manning as a 2.38 touchdown per game player for the rest of the year. The question becomes, how much do we base projection over the final 10 games on his preseason projection and how much do we base it on his 2013 results? In Part I, after four games, a regression model produced a projection of 2.56 touchdowns per game the rest of the year. But the problem with a regression analysis is that Manning is an extreme outlier among NFL quarterbacks; to project Manning, it would be best if we could limit ourselves to just quarterbacks named Manning Peyton Manning.
Before continuing, I want to give a special thanks to Danny Tuccitto, without whom this article wouldn’t be possible. Danny provided this great link and also spent a lot of time walking me through the process. To the extent I’ve mucked it up here, you should blame the student, not the teacher. But after walking through some models online, I realized that the best explanation about how to use Bayes Theorem for these purposes was on a sweet site called FootballPerspective.com. And the smartest person on that website had already laid out the blueprint.
In the comments to one of his great posts, Neil explained that we can calculate Manning’s odds using Bayes Theorem if we know four things:
His Bayesian prior mean (i.e., his historical average):
His Bayesian prior variance (the variance surrounding his historical average):
His observed mean:
His observed variance:
Let’s go through each of these:
1) Manning’s Bayesian prior mean: this is simply what we expected out of Manning before the season. I will use 2.38, since Footballguys is the gold standard of football projections in my admittedly biased opinion. But you can use any number you like, as I’ll provide the full formula at the end.
2) Manning’s Bayes prior variance: how much variance should we expect between what Footballguys projected and Manning’s actual results? Well, I have data on Footballguys’ projected number of touchdowns for Manning in every season since 2000. After converting it to a per-game basis, we get the following:
Year | Proj TD | Act TD | Proj TD/G | Act TD/G | Diff TD/G |
---|---|---|---|---|---|
2000 | 28 | 33 | 1.75 | 2.0625 | 0.3125 |
2001 | 29 | 26 | 1.8125 | 1.625 | -0.1875 |
2002 | 31 | 27 | 1.9375 | 1.6875 | -0.25 |
2003 | 29 | 29 | 1.8125 | 1.8125 | 0 |
2004 | 30 | 49 | 1.875 | 3.0625 | 1.1875 |
2005 | 41 | 28 | 2.5625 | 1.75 | -0.8125 |
2006 | 33 | 31 | 2.0625 | 1.9375 | -0.125 |
2007 | 33 | 31 | 2.0625 | 1.9375 | -0.125 |
2008 | 30 | 27 | 1.875 | 1.6875 | -0.1875 |
2009 | 28 | 33 | 1.75 | 2.0625 | 0.3125 |
2010 | 30 | 33 | 1.875 | 2.0625 | 0.1875 |
2012 | 27 | 37 | 1.6875 | 2.3125 | 0.625 |
Variance | 0.0541 | 0.1527 |
What I think we want to do is measure the variance of the errors, which is the difference between the variance of the actual TD/G column (0.1527), and the variance of the projected TD/G column (0.0541); the difference of those variances is 0.0986, which is Manning’s Bayes prior variance. I say “I think” because there are other ways one could come up with a prior variance. For example, you could take the variance of the data in the differential column, which is 0.2529. Or, you could use simply his yearly variance, which is 0.1527. I’ll invite the readers to comment as to which number (0.0986, 0.2529, or 0.1527) they would prefer to use.
3) Observed mean: 3.667. That one’s pretty easy, as Manning has thrown 22 touchdowns in 6 games.
4) Observed variance of current mean. This one’s a bit trickier. But what I think we need to do here is calculate the number of passing touchdowns per game Manning averaged in the first six games of each season since 2000, along with his average over the rest of the season (in eight, nine, or ten games). Then I took the difference of the variances of each column, as we did in step two.
Year | TD/G Thru 6 | ROY G | TD/G ROY | Diff |
---|---|---|---|---|
2000 | 2 | 10 | 2.1 | -0.1 |
2001 | 2 | 10 | 1.4 | 0.6 |
2002 | 1.67 | 10 | 1.7 | -0.03 |
2003 | 2 | 10 | 1.7 | 0.3 |
2004 | 2.83 | 9 | 3.56 | -0.72 |
2005 | 1.5 | 8 | 2.38 | -0.88 |
2006 | 2 | 10 | 1.9 | 0.1 |
2007 | 1.83 | 9 | 2.22 | -0.39 |
2008 | 1.33 | 9 | 2 | -0.67 |
2009 | 2.5 | 8 | 2.25 | 0.25 |
2010 | 2.17 | 10 | 2 | 0.17 |
2012 | 2.33 | 10 | 2.3 | 0.03 |
Variance | 0.1740 | 0.2849 |
Manning’s variance over the rest of the season is 0.2849 TDs/G, while his variance through six games is 0.1740; the differential there is 0.1109, which is the variance of our current mean. Although again, reasonable minds could differ — you could take simply the variance of the differential (which is 0.2088), or just his variance through six games (0.1740) or perhaps you’d prefer to compare his variance through six games to his variance over the course of the season. This is intended to be an inclusive post, so add your thoughts in the comments.
Once you have your number for these four variables, then you substitute those numbers into this equation:
Result_mean = [(prior_mean/prior_variance)+(observed_mean/observed_variance)]/[(1/prior_variance)+(1/observed_variance)]
Or, using our numbers:
[(2.38 /0.0986) + (3.667 / 0.1109)] / [(1/0.0986) + (1/0.1109]
which becomes
[24.13 + 33.07] / (19.16) = 2.99
In other words, this formula projects Manning to average 2.99 passing touchdowns per game the rest of the year…. which would have him break Brady’s record by two touchdowns. [2]If you wanted to just use 0.1527 — Manning’s yearly variance — and 0.1740 — his typical variance through six games — you end up with 2.985, so no practical difference … Continue reading We can also calculate the standard deviation around that average, which is just the square root of 1/ 19.16, which is 0.23. That’s a pretty decent-sized range, of course, but this says that we can be reasonably confident that the best going forward projection for Manning would be between 2.76 and 3.21. Manning needs to average 2.8 passing touchdowns per game to tie the record, which would means he only needs to do better than -0.83 standard deviations below average. [3]If you permit the making of certain assumptions, that would imply a roughly 80% chance of breaking the record.
There are lots of things that Bayes theorem doesn’t consider, of course. Strength of schedule is one, and Manning’s schedule is about to get a lot harder. Risk of injury is another, and Manning has been fully healthy this year (although he did miss parts of the Eagles game once the score got out of hand). There are lots of subjective factors that we have to consider, but the main issue at hand is figuring out where on the spectrum of 2.38 to 3.667 we should project Manning. This basically says that the two are about equivalent in value.
If you projected Manning to average 2.00 passing touchdowns per game in the preseason, this model would bump the project to 27.8 touchdowns the rest of the season. If you were pretty clairvoyant — say, you projected him at 45 touchdowns this year, a 2.8 TD/G average — this would say you should revise your projections to 32.1 touchdowns over his final ten games. Pre-season projections still matter, but you’re going to have to amp up your projections based on how deadly efficient Manning has been to date.
References
↑1 | That was after removing week 17 of the ’04, ’05, ’07, ’08, and ’09 seasons, and week 16 of the ’05 and ’09 seasons, when Manning left early. Why did I pick the last ten years? I don’t know, but he won his first MVP in ’03, so that seemed like a useful starting point. |
---|---|
↑2 | If you wanted to just use 0.1527 — Manning’s yearly variance — and 0.1740 — his typical variance through six games — you end up with 2.985, so no practical difference there. |
↑3 | If you permit the making of certain assumptions, that would imply a roughly 80% chance of breaking the record. |