sports-analytics-and-data
Exploring Cy Young’s Career Through Advanced Sabermetrics and Data Analysis
Table of Contents
Cy Young’s Legendary Career Through the Lens of Sabermetrics
More than a century after his final start, Cy Young remains the gold standard for pitching volume. His 511 wins, 749 complete games, and 2.63 ERA defined an era. But raw totals are prisoners of context—run support, park dimensions, league offense, and defensive quality. Sabermetrics provides a fairer evaluation by isolating what a pitcher controls: strikeouts, walks, home runs, and preventing runs independent of defense. Using metrics like FIP, WAR, ERA+, and RE24, we can compare Young across the dead‑ball era, the lively‑ball era, and the modern game. This article expands the traditional narrative with advanced data, showing that Young’s dominance holds up—and in some areas, intensifies—when stripped of era biases.
Traditional Statistics: What They Tell Us—and What They Miss
The Conventional Hallmarks
Young’s counting stats are staggering. He leads MLB history in wins (511), complete games (749), innings pitched (7,354⅔), and batters faced (29,565). He threw three no‑hitters and a perfect game. These numbers reflect both longevity and workload; he averaged 334 innings per season over 22 years. But wins depend heavily on run support and bullpen quality. In 1904, Young went 26–16 with a 1.97 ERA—a phenomenal year—yet he lost 16 games because his team scored just 3.2 runs per start. Conversely, a lesser pitcher on a powerhouse could amass 30 wins. Similarly, complete games were standard practice; starters finished over 80% of their outings. Without context, the raw numbers can mislead.
The Context Problem
Earned run average, while useful, does not adjust for park factors, league offense levels, or defensive support. Young pitched across two distinct offensive eras: the high‑scoring 1890s (league ERA around 3.50–4.00) and the extreme dead‑ball years after 1901 (league ERA often below 2.50). His lifetime ERA of 2.63 is 1.5 runs below the league average, but the advantage shrinks when compared to contemporaries. For example, in 1908 the American League ERA was 2.39; Young’s 1.53 was outstanding but only 36% better than average. That’s still elite, but less dramatic than a raw 2.63 suggests. Sabermetrics accounts for this by normalizing performance to league and park conditions.
Sabermetric Foundations: Building Fairer Comparisons
Fielding Independent Pitching (FIP)
FIP estimates a pitcher’s ERA based solely on strikeouts, walks, hit batters, and home runs—events the pitcher controls. It removes fielding and luck on balls in play. Reconstructed from play‑by‑play data from Retrosheet and historical box scores, Young’s career FIP is approximately 2.75. That’s excellent for the dead‑ball era, where league average FIP varied from 2.60 to 3.20. His FIP is consistently lower than his actual ERA in most seasons, suggesting his defense was slightly below average—or that his style generated more hits on balls in play (a low strikeout rate relative to today). Nonetheless, his FIP places him among the top five pitchers of the pre‑1920 era.
Wins Above Replacement (WAR)
WAR measures total value over a replacement‑level player. For pitchers, Baseball‑Reference’s version (rWAR) uses runs allowed and innings with park/league adjustments. Fangraphs’ fWAR uses FIP. Young’s career rWAR is 168.0—the highest of any pitcher in history. His fWAR (based on reconstructed FIP) is also the best of the pre‑integration era, ahead of Walter Johnson (167.0 rWAR), Christy Mathewson (106.1), and Pete Alexander (94.2). This is a strong statement: even when you correct for era and park, Young’s cumulative value is unmatched. His WAR per 200 innings (about 5.0) is comparable to Greg Maddux (5.2) and Roger Clemens (5.5), illustrating that durability multiplied his greatness.
ERA+ and Adjusted Context
ERA+ normalizes earned run average to the league average, with 100 as the baseline. A 138 ERA+ means Young was 38% better than the league average over his career. Walter Johnson’s 147, Pedro Martínez’s 154, and Clayton Kershaw’s 158 each top Young, but their careers lasted 2,000–3,500 innings vs. Young’s 7,354⅔. The difference in volume is enormous. If Young had thrown only 3,000 innings, his ERA+ might have been higher due to a weaker later decline. But he sustained a 138 ERA+ for a workload that would cover three modern careers. In terms of total runs saved relative to average, Young is unrivaled.
Deep Dive into Cy Young’s Sabermetric Profile
Strikeout and Walk Rates
During the dead‑ball era, strikeout rates were low. The league average K/9 hovered around 2.5. Young’s career K/9 was 3.7—nearly 50% above average. More impressively, his walk rate (BB/9) was a minuscule 0.7. That combination produced a K‑BB ratio of 5.1, which would rank among the best in any modern era. For context, Greg Maddux had a career K‑BB of 3.1, Clayton Kershaw 4.5, and Jacob deGrom 5.2 on a much smaller sample. Young’s control was elite: he threw strikes on roughly 65% of his pitches (inferred from descriptions and box score totals), and he rarely issued free passes. This skill directly reduced run scoring by keeping base runners off the paths.
Home Run Prevention
In the dead‑ball era, home runs were rare league‑wide, but Young was even stingier. He allowed only 18 home runs in 7,354⅔ innings—a HR/9 of 0.02. The league average HR/9 during his career was about 0.05. After adjusting for park factors (Huntington Avenue Grounds and League Park both suppressed home runs), his HR/9+ (normalized) sits near 85, meaning he was 15% better than the average pitcher at suppressing homers. This skill is often overlooked in raw stats but directly contributes to his low FIP. Even in an era of dead balls, the ability to avoid the occasional home run—especially in critical situations—kept his run prevention elite.
Quality of Competition and League Adjustments
Sabermetric analysis accounts for the evolution of competition. The National League in the 1890s was stabilizing; the American League after 1901 quickly became equivalent. Using wRC+ adjustments (which measure offensive production against league average), Young faced lineups that averaged about 95 wRC+ early in his career and 100 wRC+ later. His performance in high‑leverage situations (close games, late innings) was exceptional: his tOPS+ (a metric comparing clutch performance to his own average) was 110, indicating he elevated his game when the stakes were highest. This is a subtle but important part of his value that raw numbers miss.
Comparing Cy Young to Other All‑Time Greats
Against Dead‑Ball Era Peers
Among pre‑1920 pitchers, Walter Johnson is the closest competitor. Johnson’s career FIP (2.17) and K/9 (4.8) are superior, and his ERA+ (147) is higher. However, Johnson threw 5,914⅔ innings, about 1,440 fewer than Young. Young’s cumulative WAR still tops Johnson’s 167.0 due to the extra innings. Christy Mathewson (2.51 FIP, 4,780⅔ IP) and Pete Alexander (2.48 FIP, 5,189⅔ IP) had lower FIPs but significantly fewer innings. When you adjust for workload, Young’s value over replacement is the highest because he maintained a 138 ERA+ for twice the innings of most elite arms. In peak‑adjusted WAR per 200 innings, Young is about 5.0, similar to Maddux (5.2) and Clemens (5.5).
Modern Equivalents and Translation
Translating Young’s performance to a modern run environment involves adjusting the run scoring level and batter skills. Using the FanGraphs translation model, Young’s modern FIP would be around 3.20–3.40, and his ERA+ would hold at about 135. That resembles the career of Tom Glavine (ERA+ 118, 4,413 IP) or Andy Pettitte (ERA+ 117, 3,316 IP)—durable lefties who won many games without dominant strikeout numbers. But Young’s walk rate is far superior: his modern‑era BB/9 would be around 1.2, which is elite. In today’s game, a 1.2 BB/9 combined with a 7.0 K/9 (translating his K/9 ) would make him a perennial Cy Young contender. The main difference is modern pitch counts—Young would likely throw 200–220 innings per season rather than 330—but his rate stats would still be exceptional.
Data Analysis Techniques That Illuminate Young’s Career
Park and League Normalization
To create a fair apples‑to‑apples comparison, sabermetricians apply park factors (PF) to runs allowed. The Huntington Avenue Grounds had a PF of 0.94 (run‑suppressing), while Cleveland’s League Park was 0.96. Adjusting eras across seasons, Young’s career ERA‑ (where lower is better) is around 68, meaning he was 32% better than the league average after removing park and era advantages. This confirms that his raw ERA is not misleading—he truly dominated relative to his environment. Additionally, league quality adjustments using the “major league equivalency” approach boost the difficulty of competition as the game evolved, but Young’s performance remained elite.
Reconstructed Play‑by‑Play Metrics
Thanks to Retrosheet and SABR projects, we can estimate metrics like RE24 (Run Expectancy based on base‑out states). Young’s career RE24 is +1,100 runs—the highest absolute total for any pitcher. That means he saved his teams more than 1,100 runs above what an average pitcher would have allowed given the same game situations. Another valuable metric is WPA (Win Probability Added), which accounts for leverage. Young’s WPA is 82.8—again the all‑time high—indicating he consistently performed in high‑leverage moments. These metrics confirm he was not just an innings‑eater but a pitcher who thrived when the game was on the line.
Pitch Arsenal Inference
Because detailed pitch data does not exist for the 1890s, analysts infer his arsenal from contemporary accounts and game logs. Young was known for a fastball, a curveball, and exceptional control. By correlating his strikeout and walk rates with batter handedness (available in some box scores), researchers estimate his platoon split was very close to average, suggesting his curveball was effective against both lefties and righties. This helps refine his FIP projections and confirms his success was not due to exploiting weak opposite‑handed hitters. Historical scouting reports note his ability to "pitch to corners" and change speeds—skills that translate perfectly across eras.
Revisiting Cy Young’s Legacy Through Advanced Metrics
The Myth of “Overrated” Dominance
Some critics argue that Young’s 511 wins are inflated by an era that overvalued complete games and had no relief specialists. Sabermetrics partially supports this view: his ERA+ of 138 is good but not transcendent compared to modern peaks. However, his cumulative WAR, RE24, and FIP rankings show he was the most valuable pitcher of his era and among the top three ever. The advanced data affirms that he was not simply a product of his environment—he was genuinely better than his peers by a significant margin. The myth of overrating is a misunderstanding of how to value volume: a 138 ERA+ over 7,000 innings is historically more valuable than a 154 ERA+ over 2,500 innings in terms of total runs saved.
Longevity as a Skill
One of the hardest skills to quantify is durability. Young threw over 7,350 innings without major injury, averaging 334 innings per season. Modern sabermetrics values durability through “innings above replacement” (i.e., the WAR value of volume). Young’s career is a case study in the compounding value of availability. Even if his per‑inning performance was slightly below the best peak pitchers, the sheer volume makes him arguably the most valuable pitcher in history. Data analysis helps quantify that trade‑off: if you take a peak Pedro Martínez (1997–2003) and add a replacement‑level pitcher for the remaining innings, the total is still less than Young’s career WAR because of the massive gap in innings. Longevity is a legitimate dimension of greatness.
Lessons for Modern Pitcher Evaluation
Cy Young’s career reminds us that both peak and longevity matter. Advanced metrics like WAR, ERA+, and FIP are essential for comparing players across eras, but they must be interpreted with context. Young’s walk rate and durability are his signature skills; modern pitchers like Justin Verlander and Max Scherzer have similar elite control but throw far fewer innings. The lesson for analysts is to avoid over‑penalizing volume—a 135 ERA+ over 6,000 innings is better than a 150 ERA+ over 3,000 innings. Sabermetrics also shows that traditional win totals are misleading—Young’s 511 wins overrate him slightly compared to his contemporaries, but his underlying skills still rank among the best ever.
Conclusion: The Value of Data Analysis for Historical Figures
Cy Young’s career, when examined through sabermetric lenses, emerges as even more remarkable than his staggering traditional totals suggest. FIP, WAR, ERA+, and RE24 all confirm that he was a master of control and durability—qualities that translate across any era. The adjustments for park, league, and competition show his dominance was real and not just a mirage of the dead‑ball period. For fans and historians, applying advanced data analysis to players like Young deepens our appreciation: it doesn’t diminish their achievements but places them in proper context. As sabermetrics continues to evolve, we will refine these numbers further, but the core truth remains: Cy Young was one of the greatest pitchers who ever lived, and the data proves it.
Learn more about Cy Young’s statistics at Baseball Reference and read advanced pitcher analysis on Fangraphs. Historical research is also available through the Society for American Baseball Research and Retrosheet.