But in fact luck is a huge factor in the results any pitcher produces. And I don't mean W-L--I mean ERA, WHIP, and other stats that fans often think of as reliable measures of pitcher skill.
If an unusually large number of bloopers fall in for hits, or if an unusually large number of flyballs drift out, or if the team defense is bad, or if the bullpen allows an unusually high number of inherited runners to score... these are all factors outside a pitcher's control that can affect a pitcher's ERA. And the effects aren't just incremental differences--relatively common variations from one season to a next can raise or drop a pitcher's ERA by a run or more.
The "luck stats" that I look at most closely are:
1. Home runs per flyballs (HR/FB%): Research shows that pitchers cannot consistently control the percentage of flyballs that become home runs. This one takes a little bit for people to accept, but it's true. Any pitcher with a rate much above or below 11% is due for a correction.
2. Strand rate (LOB%): The percentage of hitters who reach base that are stranded. A typical LOB% is around 70%, and although a better pitcher can sustain a better LOB rate than that (a great pitcher could consistently reach 74-75%), if you have a pitcher whose LOB rate is much out of step with the rest of his rates, look for regression.
3. BABIP: batting average on balls in play off the pitcher. Research shows that pitchers have very little control over the rate at which a ball put in play is converted into an out. Groundballs are more often hits than flyballs are, though groundballs never become homers, so groundball pitchers are better, all other things equal. BABIP is mainly a function of defense and raw luck. A typical BABIP is around .290, though again groundball pitchers may be a little higher and flyball pitchers a little lower.
There are a couple good stats that attempt to measure a pitcher's true skill-based performance, with the luck factors removed. Fielding independent pitching (FIP), which is listed on the player pages at Fangraphs.com, is the best known and is based on walks, strikeouts, and home runs, but it assumes that HR/FB is a skill and that all pitchers are equally good at stranding runners. The Hardball Times' xFIP is an improvement over FIP because it substitutes a league average HR/FB rate, eliminating that element of luck from the equation. The best (and most complicated) of all is, tRA*, which you can find at StatCorner.com. That stat (which I assume is pronounced "T-R-A-star," but I don't really know) uses rates for groundballs, flyballs, line drives, hit-by-pitches, walks, strikeouts, and homers (a pretty complete list of the events in a pitcher's control) to project runs allowed and regresses them to the league average, which helps erase sample size randomness. FIP and xFIP are scaled to look like ERA, while tRA* is scaled to RA (run average, including earned and unearned).
For a fuller explication of these points, check out this indispensable post from USS Mariner, which I still must admit taught me most of what I know about evaluating pitching.
So with all that context out of the way, let's look at the Washington Nationals pitchers and what these stats tell us. Who's performances in 2008 were most elevated (or depressed) by luck?
The chart below gives you a bunch of stats from 2008. It lists each pitcher's actual ERA, HR/FB rate, strand rate, BABIP, and tRA*. Then, I created a metric called "luck," which attempts to roughly quantify the degree to which luck influenced each pitcher's ERA. Luck factor is simply tRA* minus 0.40 (which is the typical difference between RA and ERA) minus actual ERA. (That 0.40 number is a pretty blunt instrument, so don't try to translate luck factor into anything too precise--this just gives you a relative sense of each pitcher's good or bad fortune.)
Here are the Nationals pitchers from most to least lucky in 2008:
--We should expect Lannan and Olsen, who are tentatively sketched in as our #1 and #2 starters, to see a rise in ERA. In Lannan's case, that BABIP will certainly rise, probably by 30 or more points given his strong groundball rates (54.2% in 2008). Olsen's BABIP is even less sustainable (his career number is .300), and his strand rate will fall too. Of course, their good luck could continue, or they could counter the expected regression in good luck with actual better pitching, but we shouldn't count on it.
--We should write off Hinckley's and Estrada's ERAs on sample size. We just can't make much if any conclusions about them from those numbers. Martis too.
--Hill, Bergmann, and Balester were all significantly victimized by tough luck. They were all hurt pretty badly by low strand rates, and Hill's BABIP is just silly.
--Steven Shell was about as lucky as a pitcher can be.