Wins above replacement
Wins above replacement or wins above replacement player, commonly abbreviated to WAR or WARP, is a non-standardized sabermetric baseball statistic developed to sum up "a player's total contributions to his team".[1] A player's WAR value is claimed to be the number of additional wins his team has achieved above the number of expected team wins if that player were substituted with a replacement-level player: a player who may be added to the team for minimal cost and effort.[2] Individual WAR values are calculated from the number and success rate of on-field actions by a player (in batting, baserunning, fielding, and pitching), with higher values reflecting larger contributions to a team's success.[2] WAR value also depends on what position a player plays, with more value going to key defensive positions like catcher and shortstop than positions with less defensive importance such as first base.[2] A high WAR value built up by a player reflects successful performance, a large quantity of playing time, or both. OverviewThe basis for a WAR value is the estimated number of runs contributed by a player through offensive actions such as batting and base running, and runs denied to opposition teams by the player through defensive actions like fielding and pitching. Statistics such as weighted on-base average (wOBA), ultimate zone rating (UZR), ultimate base running (UBR), and defense independent pitching statistics (DIPS) measure the effectiveness of a player at creating and saving runs for their team, on a per-plate appearance or per-inning basis. These statistics can be multiplied by the playing time of a player to give an estimate of the number of offensive and defensive runs contributed to their team. Additional runs contributed to a team lead to additional wins, with 10 runs estimated to be equal to roughly one win.[3] Therefore, a 1.0 WAR value for a player signifies a contribution of roughly 10 more runs than a replacement-level player, over a specified period of time. A replacement-level player is defined by FanGraphs as contributing 17.5 runs fewer than a player of league-average performance, over 600 plate appearances.[4] Therefore, a 1.0 WAR player has contributed an estimated −7.5 runs relative to average over the same number of plate appearances, a 2.0 WAR player has contributed +2.5 runs, and a 5.0 WAR player has contributed +32.5 runs. For an individual player, WAR values may be calculated for single seasons or parts of seasons, for several seasons, or across the whole career of the player. Collective WAR values for multiple players may also be estimated, for example to determine the contribution a team receives from its outfielders, its relief pitchers or from specific positions such as catcher.[5][6] It is also possible to extrapolate a future WAR value from a player's past performance data.[7] CalculationNo clearly established formula exists for WAR. Sources that provide the statistic calculate it differently. These include Baseball Prospectus, Baseball-Reference, and FanGraphs. All of these sources publish the method they use to calculate WAR, and all use similar basic principles to do so.[8] The version published by Baseball Prospectus is named WARP,[9] that by Baseball-Reference is named bWAR or rWAR ("r" derives from Rally or RallyMonkey, a nickname for Sean Smith, who implemented that site's version of the statistic)[10] and that for Fangraphs is named fWAR.[11] Compared to rWAR, the calculation of fWAR places greater emphasis on peripheral statistics.[2] WAR values are scaled equally for pitchers and batters; that is, pitchers and position players will have roughly the same WAR if their contribution to their team is deemed similar. However, the values are calculated differently for pitchers and position players: position players are evaluated using statistics for fielding, base running, and hitting, while pitchers are evaluated using statistics related to the opposing batters' hits, walks, and strikeouts in FanGraphs' version and runs allowed per 9 innings with a team defense adjustment for Baseball-Reference's version. Because the independent WAR frameworks are calculated differently, they do not have the same scale[12] and cannot be used interchangeably in an analytical context. Position playersBaseball-ReferenceBaseball-Reference uses six components to calculate WAR for position players: batting runs (Rbat), baserunning runs (Rbaser), runs added or lost due to grounding into double plays in double play situations (Rdp), fielding runs (Rfield), positional adjustment runs (Rpos), and replacement level runs (Rrep). The first five factors are compared to league average, so a value of 0 represents an average player.[13] The term may be calculated from the first five factors (Rbat + Rbaser + Rdp + Rfield + Rpos), and the other term from Rrep.[13] After each of these components are computed and summed, runs are converted to wins by dividing by a runs-per-win value determined using Pythagenpat.
FanGraphsSimilarly to bWAR, FanGraphs uses six components to calculate fWAR for position players. These components are all on the scale of runs, and are then converted to wins based on a runs-per-win value that changes each season based on the run environment. The formula used is [15]
PitchersBaseball-ReferenceBaseball-Reference uses two components to calculate WAR for pitchers: runs allowed (both earned and unearned) and innings pitched. These statistics are then used in a number of further calculations to better contextualize the numbers.[19] FanGraphsRather than focus on actual runs allowed, Fangraphs uses fielding independent pitching (FIP) as their main component to calculate WAR as they feel it better reflects the contributions of the pitcher.[20] AnalysisIn 2009, Dave Cameron stated that fWAR does an "impressive job of projecting wins and losses".[21] He found that a team's projected record based on fWAR and that team's actual record has a strong correlation (correlation coefficient of 0.83), and that every team was within two standard deviations (σ=6.4 wins).[21] In 2012, Glenn DuPaul conducted a regression analysis comparing the cumulative rWAR of five randomly selected teams per season (from 1996 to 2011) against those teams' realized win totals for those seasons. He found that the two were highly correlated, with a correlation coefficient of 0.91, and that 83% of the variance in wins was explained by fWAR (R2=0.83).[22] The standard deviation was 2.91 wins. The regression equation was: which was close to the expected equation: in which a team of replacement-level players is expected to have a .320 winning percentage, or 52 wins in a 162-game season. To test fWAR as a predictive tool, DuPaul executed a regression between a team's cumulative player WAR from the previous year to the team's realized wins for that year. The resultant regression equation was:[22] which has a statistically significant correlation of 0.59, meaning that 35% (the square of 0.59) of the variance in team wins could be accounted for by the cumulative fWAR of its players from the previous season.[22] UsageWAR is recognized as an official stat by Major League Baseball and by the Elias Sports Bureau, and ESPN publishes the Baseball-Reference version of WAR on its own statistics pages for position players and pitchers.[2] The importance of WAR compared to typical statistical categories has been the subject of ongoing debate. For example, nearing the end of the 2012 Major League Baseball season and afterward, there was much debate about which player should win the Major League Baseball Most Valuable Player Award for the American League.[23] The two candidates considered by most writers were Miguel Cabrera, who won the Triple Crown, and Mike Trout, who led Major League Baseball in WAR.[24] The debate focused on the use of traditional baseball statistics, such as RBIs and home runs, compared with sabermetric statistics such as WAR.[23] Cabrera led the American League in batting average, home runs, and RBIs, but Trout was considered a more complete player by some.[25] Relative to the average player, Cabrera contributed an extra 53.1 runs through batting, but −8.2 through defense and −2.9 through baserunning,[26] while Trout contributed 50.1 batting runs, 13.0 defensive runs, and 12.0 baserunning runs.[27] Cabrera, the only one of the two players whose team entered the postseason, won the award in a landslide, with 22 of 28 first-place votes from the Baseball Writers' Association of America. He and Trout posted similar seasons in 2013; Cabrera again won the MVP.[28][29] Dave Cameron disagreed, in a FanGraphs article:
CriticismsBill James states that there is a bias favoring players from earlier eras because there was greater variance in skill levels at the time, so "the best players were further from the average than they are now".[2] That is, in modern baseball, it is more difficult for a player to exceed the abilities of his peers than it was in the 1800s and the dead-ball and live-ball eras of the 1900s.[2] James's criticism originates from the evolutionary biologist Stephen Jay Gould who, in 1996, published the book Full House which argued the same point with respect to batting averages.[31] The bias mentioned by Gould and James was confirmed in a statistical study which showed that ranking lists based on WAR do in fact include too many players from the earlier eras.[32] This study challenges the stance that WAR properly adjusts for era differences.[33] James's criticism has also stemmed from the application and usage of WAR in recent years. In the 2017 Major League Baseball season, there was debate similar to 2012 regarding who should be the recipient of the American League Most Valuable Player Award: Jose Altuve or Aaron Judge. Judge outranked Altuve in FanGraphs' calculation of WAR that season, finishing first with a WAR of 8.2, to Altuve's 7.5. Based on Baseball-Reference's calculation, Altuve had the edge, 8.3 to 8.1. However, in James's words, the usage of WAR in this particular MVP argument was "...nonsense. Aaron Judge was nowhere near as valuable as Jose Altuve…. It is not close. The belief that it is close is fueled by bad statistical analysis.” He goes on to say that WAR,“...is dead wrong because the creators of that statistic have severed the connection between performance statistics and wins, thus undermining their analysis.” He goes on to point out that Judge performed worse than Altuve in critical situations, such as the late innings of close games, and that WAR does not properly take this into account.[34] Other advanced statistics such as RE24 suggest the opposite, with Judge at 50.91 and Altuve at 38.76. [35] Alternatives to WARSome sabermetricians "have been distancing themselves from the importance of single-season WAR values"[22] because some of the defensive metrics incorporated into WAR calculations have significant variability. For example, during the 2012 season, the Toronto Blue Jays employed an infield shift against some left-handed batters, such as David Ortiz or Carlos Peña, in which third baseman Brett Lawrie would be assigned to shallow right field. This resulted in a very high Defensive Runs Saved (DRS) total for Lawrie,[36] and hence a high rWAR, which uses DRS as a component.[37] Ben Jedlovec, an analyst for DRS creator Baseball Info Solutions, said that Lawrie was "making plays in places where very few third basemen are making those plays" because of the "optimal positioning by the Blue Jays".[38] Another fielding metric, Ultimate Zone Rating (UZR), uses the DRS data but excludes runs saved as a result of a shift.[38] Jay Jaffe, a writer for Baseball Prospectus and a member of the Baseball Writers' Association of America, adapted WAR for a statistic he developed in 2004 called "Jaffe Wins Above Replacement Score," or JAWS. The metric averages a player's career WAR with their seven-year peak WAR (not necessarily consecutive years). The final number is then used to measure the player's worthiness of being inducted into the Baseball Hall of Fame by comparing it to the average JAWS of Hall of Fame players at that position. Baseball-Reference's explanation of JAWS says, "The stated goal is to improve the Hall of Fame's standards, or at least to maintain them rather than erode them, by admitting players who are at least as good as the average Hall of Famer at the position, using a means via which longevity isn't the sole determinant of worthiness."[39] For example, as of November 30, 2021, retired third baseman Adrián Beltré has accumulated 93.5 career WAR, and 48.7 WAR from his best seven seasons combined. Averaged together, these numbers give Beltré a JAWS of 71.1.[40] See also
Citations
General references
External links
|