tournament selection e ciency: an analysis of the...
TRANSCRIPT
Tournament Selection Efficiency: An Analysis of the PGA TOUR’s
FedExCup1
Robert A. Connolly and Richard J. Rendleman, Jr.
June 27, 2012
1Robert A. Connolly is Associate Professor, Kenan-Flagler Business School, University of North Carolina,Chapel Hill. Richard J. Rendleman, Jr. is Visiting Professor, Tuck School of Business at Dartmouth andProfessor Emeritus, Kenan-Flagler Business School, University of North Carolina, Chapel Hill. The authorsthank the PGA TOUR for providing the data used in connection with this study, Pranab Sen, Nicholas Halland Dmitry Ryvkin for helpful comments on an earlier version of the paper and Ken Lovell for providing com-ments on the present version. Please address comments to Robert Connolly (email: robert [email protected];phone: (919) 962-0053) or to Richard J. Rendleman, Jr. (e-mail: richard [email protected]; phone: (919)962-3188).
Tournament Selection Efficiency: An Analysis of the PGA TOUR’s
FedExCup
Abstract
Analytical descriptions of tournament selection efficiency properties can be elusive for realistic
tournament structures. Combining a Monte Carlo simulation with a statistical model of player
skill and random variation in scoring, we estimate the selection efficiency of the PGA TOUR’s
FedExCup, a very complex multi-stage golf competition, which distributes $35 million in prize
money, including $10 million to the winner. Our assessments of efficiency are based on traditional
selection efficiency measures. We also introduce three new measures of efficiency which focus on
the ability of a given tournament structure to identify properly the relative skills of all tournament
participants and to distribute efficiently all of the tournament’s prize money. We find that reason-
able deviations from the present FedExCup structure do not yield large differences in the various
measures of efficiency.
1. Introduction
In this study, we analyze the selection efficiency of the PGA TOUR’s FedExCup, a large-scale
athletic competition involving a regular season followed by a series of playoff rounds and a “finals”
event, where an overall champion is crowned. FedExCup competition began in 2007. Each year,
at the completion of the competition, a total of $35 million in prize money is distributed to 150
players, with those in the top three finishing positions earning $10 million, $3 million and $2 million,
respectively.1
Research into selection efficiency highlights the importance of the criterion for assessing tour-
nament properties.2 Most who study tournament competition emphasize the probability that the
best player will be declared the winner (“predictive power”) as the critical measure of tournament
selection efficiency. Largely maintaining the focus of the selection efficiency literature on a single
player, Ryvkin and Ortmann (2008) and Ryvkin (2010) introduce two additional selection efficiency
measures, the expected skill level of the tournament winner and the expected skill ranking of the
winner. They develop the properties of these selection efficiency measures in simulated tournament
competition.
While we use these efficiency measures in our work, we also develop three new measures of
selection efficiency that evaluate the overall efficiency of a tournament structure, not just the the
mean skill and mean skill rank of the first-place finisher and the expected finishing position of
the most highly-skilled player. Much of the existing literature (e.g., Ryvkin (2010), Ryvkin and
Ortmann (2008)) assumes a specific set of distributions (e.g., normal, Pareto, and exponential)
to describe competitor skill and random variation in performance. In this paper, we integrate an
empirical model of skill and random variation in performance with a detailed tournament simulation
to explore the selection efficiency of FedExCup competition. We do not specify the matrix of
winning probabilities as in some studies; instead, it is generated naturally from the underlying
estimated distributions of competitor skill and random variation and the tournament structure
itself.
In the next section of the paper we describe the characteristics of FedExCup competition. We
develop tournament selection efficiency measures in Section 3. We present an overview of the
1See http://www.pgatour.com/r/stats/info/?02396.2See Ryvkin and Ortmann (2008) for an excellent recap of existing work along these lines.
1
statistical foundations of our work in Section 4, describe our simulation methods in Section 5, and
present results and a discussion of practical implications of our work in Section 6. We summarize
our findings in the final section. Appendix A describes the details of our simulation.
2. Characteristics of FedExCup Competition
2.1. Structure of FedExCup Competition
Under current FedExCup rules, similar in structure to NASCAR’s Sprint Cup points system, PGA
TOUR members accumulate FedExCup points during the 35-week regular PGA TOUR season.3
As shown in the “Regular Season Points” portion of Table 1, points are awarded in each regular
season PGA TOUR-sanctioned event to those who make cuts using a non-linear points distribution
schedule, with the greatest number of points given to top finishers relative to those finishing near
the bottom. At the end of the regular season, PGA TOUR members who rank 1 - 125 in FedExCup
points are eligible to participate in the FedExCup Playoffs, a series of four regular 72-hole stroke
play events, beginning in late August.
In the Playoffs, points continue to be accumulated, but at a rate equal to five times that of
regular season events. The field of FedExCup participants is reduced to 100 after the first round
of the Playoffs (The Barclays), reduced again to 70 after the second Playoffs round (the Deutsche
Bank Championship), and reduced again to 30 after the third round (the BMW Championship).
At the conclusion of the third round, FedExCup points for the final 30 players are reset according
to a predetermined schedule, with the FedExCup Finals being conducted in connection with THE
TOUR Championship. The player who has accumulated the greatest number of FedExCup points
after THE TOUR Championship wins the FedExCup.4
2.2. FedExCup Competition Objectives
It is clear that the objectives of FedExCup competition are multidimensional and complex. From
the November 25, 2008 interview with PGA TOUR Commissioner Tim Finchem (PGA TOUR
3The rules associated with FedExCup competition have been changed twice. Detail about the revisions is presentedin Hall and Potts (2010).
4A primer on the structure and point accumulation and reset rules may also be found athttp://www.pgatour.com/fedexcup/playoffs-primer/index.html.
2
(2008)), it is possible to identify a number of these dimensions.
1. The points system should identify and reward players who have performed exceptionally wellthroughout the regular season and Playoffs. As such, among those who qualify for the Playoffs,performance during the regular season should have a bearing on final FedExCup standings.
2. The Playoffs should build toward a climactic finish, creating a “playoff-type feel,” holding faninterest and generating significant TV revenue throughout the Playoffs.
3. The points system should be structured so that the FedExCup winner is not determinedprior to the Finals. (In 2008, Vijay Singh only needed to “show up” at the Finals to winthe FedExCup. This led to significant changes in the points structure at the end of the 2008PGA TOUR season.)
4. The points system should give each participant in the Finals a mathematical chance of win-ning. We note that Bill Haas, the 2011 FedExCup winner and lowest-seeded player to everwin, was seeded 25th among the 30 players who competed in the Finals.5
5. The points system should be easy to understand. Under the current system, any player amongthe top five going into the Finals who wins the final event (THE TOUR Championship) alsowins the FedExCup. Otherwise, understanding the system, especially during the heat ofcompetition, can be very difficult.
We do not attempt to quantify the PGA TOUR’s objectives, as summarized above. Instead, we
evaluate the optimal selection efficiency of FedExCup competition based on two decision variables.
The first is the Playoffs points multiple. Presently, Playoffs points are five times regular season
points. This has a potential impact on Commissioner Finchem’s objective points 1 and 2 above.
Talking with PGA TOUR officials, we understand that the TOUR reassesses the FedExCup points
structure at the end of every season and that this multiple is an important part of the discussion.
Reflecting these discussions, we vary the multiple between 1 and 5 in integer increments. Our
second decision variable is whether or not to reset accumulated FedExCup points at the end of
the third Playoffs round. The present reset system is structured to satisfy objectives 3 and 4 and
guarantee that any player among the top five going into the Finals who wins the final event will
win the FedExCup (objective 5, at least in part).
Although we are able to identify optimal competition structures evaluated in terms of our six
efficiency measures, we find that the cost of deviating from optimal structure appears to be small.
This finding suggests that the costs of the implicit constraints associated with the objectives listed
above may not be high.
5Although confusing, we adopt the convention used throughout sports competition that a “low” seeding or finishingposition is a higher number than a “high” position. For example, in a 10-player competition, the “highest” seed isseeding position 1, while the lowest seed is position 10.
3
3. Measures of Efficiency
In order to measure the selection efficiency of various FedExCup competition structures, we sim-
ulate entire seasons of regular PGA TOUR competition followed by four Playoffs rounds. In each
simulation trial, we begin with a set of “true” player skills, or expected 18-hole scores. Throughout
the regular season and Playoffs competition, each simulated score for a given player equals his ex-
pected score, as given by his true skill level, plus a residual random noise component. As the season
progresses, and throughout the Playoffs, each player accumulates FedExCup points according to a
defined set of rules as described in Section 4.1. We then estimate the efficiency of the FedExCup
points system using the criteria described below.
3.1. Ryvkin/Ortmann Selection Efficiency Measures
We use the following three measures of tournament selection efficiency, examined in detail by Ryvkin
and Ortmann (2008) and Ryvkin (2010).
1. The winning (%) rate of the most highly-skilled player, also known as “predictive power.”
2. The mean skill level (expected 18-hole score) of the tournament winner.
3. The mean skill ranking of the tournament winner.
Note that these three criteria focus on a single player, either the most highly-skilled player
(predictive power) or the tournament winner. No weight is placed on the finishing positions of
other players other than through their effect on the finishing position of the most highly-skilled
player or the mean skill ranking or skill level of the tournament winner.
We propose three new measures of selection efficiency that capture the ability of a given tour-
nament format to properly classify all tournament participants according to their true skill levels,
not just the player who is the most highly skilled, and to properly allocate tournament prize money.
Even if the most highly-skilled player in FedExCup competition wins most of the time, the FedEx-
Cup would surely lose credibility if the worst players in the competition could frequently finish
near the top and win a significant portion of the prize money. Ideally, the FedExCup design would
not only identify the single best player in the competition with high probability but would also
4
place players in finishing positions relatively close to their true skill rankings. As such, tournament
prize money would generally be the highest for the most highly skilled and lowest for the lowest
skilled and, therefore, players would be rewarded in relation to their true skill levels. Our final
three measures of selection efficiency take the form of loss functions that reflect these tradeoffs.
3.2. Mean Squared Rank Error (LRE)
Consider a tournament of N players, i = 1, 2, ..., N , ordered by true skill (or expected score) Yi,
with Y1 < Y2, ... < YN . Let j(i) denote the tournament finishing position of player i. For example,
if the most highly-skilled player finishes the tournament in 5th position, j(1) = 5. Then Yj(i) is
the inverse transformation of true skill implied by player i ’s tournament finishing position, j(i),
which we, henceforth, refer to as “implied skill.” Finally, let Mj(i) denote the monetary prize to
player i if he finishes the tournament in position j(i), with M1 > M2, ... > MN . Thus, Mi denotes
what player i ’s prize would have been if his tournament finishing position had equalled his true
skill ranking and Mj(i) denotes player i ’s actual prize.
Our first loss function, the mean squared ranking error, LRE , measures the extent to which the
tournament errs in identifying the true skill rankings of the N tournament participants.
LRE =1
N
N∑i=1
(i− j (i))2
= 2σ2i(1 − ρi,j(i)
), (1)
where σ2i =(N2 − 1
)/12 is the variance of the ranking positions, i = 1, 2, ..., N , and ρi,j(j) is the
Spearman rank order correlation of the true skill ranks, i, and tournament finishing positions, j(i).
Thus, a tournament scheme that maximizes the Spearman rank, ρi,j(j), will minimize the mean
squared ranking error, LRE .6
We note that LRE weights all ranking errors equally, regardless of the actual skill differences of
the players who have been miss-ranked. Our final two efficiency measures reflect these differences.
6We note that Spearman’s footrule, another measure of ranking error, is equivalent to minimizing the sum ofabsolute ranking errors rather than squared ranking errors.
5
3.3. Mean Squared Skill Error (LSE)
The mean squared skill error is defined as follows:
LSE =1
N
N∑i=1
(Yi − Yj(i)
)2= 2σ2Y (1 − βY ) . (2)
Here, σ2Y is the variance of true true player skill, and βY is the OLS slope coefficient associated
with a regression of true player skill Yi on implied player skill, Yj(i), or vice versa. When true skill
rankings and tournament finishing positions are perfectly aligned, βY = 1, and LSE = 0. Note that
if Y is linear in skill rank, LSE = LRE . The mean squared skill error takes the form of a quadratic
loss function, equivalent to the loss function underlying OLS regression and Taguchi’s (2005) loss
function used in quality control.
3.4. Mean Money-Weighted Squared Skill Error (LWSE)
Here we weight each value of(Yi − Yj(i)
)2in (2) by wi = Mi/
N∑i=1
Mi, where Mi is the dollar
tournament prize to the player among N participants who finishes the tournament in position i.
Thus, the mean money-weighted skill error is computed as follows:
LWSE =N∑i=1
(Yi − Yj(i)
)2wi. (3)
In this form, the greatest weight is given to implied skill errors for which the most money is on the
line. Unlike the previous two loss functions as expressed in Equations (1) and (2), this expression
cannot be simplified further without substantial restrictions on the functional form of the weighting
function (and the consequent loss of generality).
3.5. Mean Squared Error Deflators
Inasmuch as the value of each mean squared error is difficult to interpret without a reference point,
we deflate each by the corresponding variance of the variable whose error we are attempting to
estimate (i.e., true skill rankings, true player skill and money-weighted true player skill.) Thus,
6
the deflators for the ranking error, skill error and weighted skill error are, respectively, DRE =(N2 − 1
)/12, DSE = σ2Y , and DWSE =
N∑i=1
Y 2i wi − (
N∑i=1
Yiwi)2.
4. Optimizing FedExCup Competition
4.1. FedExCup Points Distribution and Accumulation
The “Regular Season Points” section of Table 1 shows the distribution of regular season FedExCup
points. WGC and Majors are allocated slightly more points than regular PGA TOUR events.
“Additional events,” which are events held opposite of some WGC events and majors, are allocated
half the points associated with each regular event finishing position.
During the regular PGA TOUR season, players accumulate FedExCup points based on the
regular season points schedule. At the end of the regular season, the top 125 players in accumulated
FedExCup points qualify to participate in the Playoffs. Each participant in the Playoffs carries his
accumulated FedExCup points into the Playoffs, but once in the Playoffs, FedExCup points are
awarded and accumulated according to the schedule shown in the “Playoffs Points” section of the
table. Note that the points distribution schedule for the first three rounds of the Playoffs is exactly
five times the points distribution for regular PGA TOUR events conducted prior to the Playoffs.
At the end of the first Playoffs round (The Barclays), only the top 100 players in accumulated
FedExCup points are eligible to continue to the second round. After the second round (The
Deutsche Bank Championship), only the top 70 players are eligible to continue to the third round.
After the third round (The BMW Championship) only the top 30 players qualify for the FedExCup
Finals (The TOUR Championship). Immediately prior to the Finals, points are reset for each of
the Finals qualifiers according to the schedule shown in the “Finals Reset” column of the “Playoffs
Points” section. Points are awarded during the Finals according to the schedule shown in the last
column of Table 1. (Note that this is exactly the same distribution of points awarded to finishing
positions 1-30 during the first three rounds of the Playoffs.) The points reset was put into place
after the second year of FedExCup competition to ensure that no single player could have won the
FedExCup prior to the Finals event and also to give each participant in the Finals a mathematical
chance of winning the FedExCup.
7
4.2. What We Evaluate
We limit our analysis of selection efficiency to the 125 players who qualify for the FedExCup
Playoffs. Using all six efficiency measures, we evaluate the efficiency of the regular season points
distribution system.7 For this same group of 125 players, we then evaluate each of the six efficiency
measures at the end of each round of the Playoffs in an attempt to determine if each successive
round of the Playoffs improves selection efficiency for this group of 125 players.
We also evaluate selection efficiency over all six measures at the end of every Playoffs round, but
only for those players who qualify to play in each round. Our concern is whether the points system
improves efficiency incrementally at the end of each round for remaining participating players.
5. Statistical Foundations
5.1. Data
Our data, provided by the PGA TOUR, covers the 2003-2010 PGA TOUR seasons. It includes 18-
hole scores for every player in every stroke play event sanctioned by the PGA TOUR for years 2003-
2010 for a total of 151,954 scores distributed among 1,878 players. We limit the sample to players
who recorded 10 or more 18-hole scores. The resulting sample consists of 148,145 observations of
18-hole golf scores for 699 PGA TOUR players over 366 stroke-play events. Most of the omitted
players are not representative of typical PGA TOUR players. For example, 711 of the omitted
players recorded just one or two 18-hole scores.8
5.2. Player Skill Estimation Model
We employ a variation of the Connolly and Rendleman (2008) model to estimate time-varying
player skill and random variation in scoring for a group of professional golfers representative of
PGA TOUR participants during the eight-year period 2003-2010. As in Connolly and Rendleman
(2008), we employ the cubic spline methodology of Wang (1998) to estimate skill functions and
autocorrelation in residual errors for players with 91 or more scores. We employ a simpler linear
7We take the regular season points distribution schedule as shown in Table 1 as given as well as the number ofplayers who qualify for the Playoffs and each of its stages.
8Generally, these are one-time qualifiers for the U.S. Open, British Open and PGA Championship who, otherwise,would have little opportunity to participate in PGA TOUR sanctioned events.
8
representation without autocorrelation, as in Connolly and Rendleman (2012), for players with 10 to
90 scores over the full sample period.9 Simultaneously, we estimate fixed course effects and random
round effects. We note that the model does not take account of specific information about playing
conditions (e.g., adverse weather as in Brown (2011), pin placements, morning or afternoon starting
times, etc.) or, in general, the particular conditions that could make scoring for all players more
or less difficult, when estimating random round effects. Nevertheless, if such conditions combine
to produce abnormally high or low scores in a given 18-hole round, the effects of these conditions
should be reflected in the estimated round-related random effects.10
When estimating player skill functions, we also obtain sets of player-specific residual scoring
errors, denoted as θ and η. The θ errors represent potentially autocorrelated differences between a
player’s actual 18-hole scores, reduced by estimated fixed course and random round effects, and his
predicted scores. The η errors represent θ errors adjusted for estimated first-order autocorrelation,
and are assumed to be white noise. We refer to a player’s skill estimate at a given point in time
as an estimate of his “neutral” score, since estimated fixed course effects and random round effects
have been removed.
6. Simulation of FedExCup Competition
6.1. Simulation Design
We structure each of 40,000 simulation trials so that the composition of the player pool is similar to
what one might observe in a typical PGA TOUR season. As such, we do not include all 699 players
from the statistical sample in each trial. Instead, the number of players per trial varies between
9We established the 91-score minimum in Connolly-Rendleman (2008) as a compromise between having a samplesize sufficiently large to employ Wang’s (1998) cubic spline model (which requires 50 to 100 observations) to estimateplayer-specific skill functions, while maintaining as many established PGA TOUR players in the sample as possible.The censoring of a sample in this fashion will have a tendency to exclude older players who are ending their careersin the early part of the sample and younger players who are beginning their careers near the end. If player skill tendsto vary with age, such a censoring mechanism can create a spurious relationship, where mean skill across all playersin the sample appears to be a function of time. (Berry, Reese and Larkey (1999) show that skill among PGA TOURgolfers tends to improve with age up to about age 29 and decline with age starting around age 36. Thus, ages 30-35tend to represent peak years for professional golfers.) To eliminate any type of age-related sample bias arising from acensored sample, we employ a 10-score minimum, rather than a 91-score minimum, and use simpler linear functionsto estimate skill for those who recorded between 10 and 90 scores.
10Interacted random round-course effects, with similar justification, are also estimated in Berry, Reese and Larkey(1999) and Berry (2001). We also estimate random round-course effects in our original 2008 model. However, webelieve that the course component of a potential round-course effect is better modeled as fixed than random.
9
415 and 459 and reflects the actual number of players in the sample in each year, 2003-2010. We
also structure the simulations so that the simulated distributions of player skill (mean neutral score
per round), scoring, and player tournament participation rates during the simulated regular season
closely approximate those observed in the actual sample. Simulation details are provided in the
Appendix.
6.2. Simulation Results
Table 2 summarizes the simulation sample mean value of each of six efficiency measures at the end of
the regular PGA TOUR season and at the end of each round of the Playoffs evaluated with respect
to the 125 players who qualify for the Playoffs in simulated competition. Each of the six panels of
Table 2 represents one of the six selection efficiency measures. For the efficiency measure shown in
Panel A, “First Place Rate of Best Player,” higher values are better. In the remaining five panels,
lower values indicate greater efficiency. Each efficiency measure is evaluated using a Playoffs points
to regular season points weighting ratio that varies from 1 to 5. All efficiency measures shown in
Panels D-F are the values computed from Equations 1-3, deflated by the corresponding variance of
the variable whose error we are attempting to estimate. Efficiency for Playoff round 4 is evaluated
without a points reset (NR) and with a reset (R). The points reset schedule is that given in Table 1
times weighting ratio divided by 5. We denote the end of the PGA TOUR regular season as “Stage
0” and the end of Playoffs rounds 1-4 as Stages 1 through 4, respectively.
In each panel, the best efficiency value is shown in bold for each stage 1-4. Except for the
few efficiency measures shown in italics, the measures shown in bold are statistically superior in a
one-sided test at the 0.05 level relative to all other values shown for the same stage.11
Regardless of the points weighting or efficiency measure, efficiency improves during each stage
of competition during Playoffs rounds 1-3 and from round 3 to round 4 when there is no points
reset. Only in Panel C (mean skill rank of player in first place) and Panel D (mean squared rank
error) are there any entries where the efficiency measure improves from round 3 to round 4 when
points are reset after round 3. Thus, from the standpoint of pure mathematical efficiency, without
regard to non-mathematical objectives that might lead to a reset being optimal, the 125 players in
11We estimate statistical significance using 10,000 bootstrap samples drawn from the simulated data generated by40,000 trials.
10
the Playoffs are generally ordered more efficiently after the third round of the Playoffs than after
the final round with a reset.
The optimal Playoffs points weight seems to vary by selection efficiency measure; weights of 3
and 4 generally provide the best efficiency. Although this suggests that the present Playoffs points
weight of 5 may be too high, we argue in Section 6.3 that these differences may have little practical
significance.
Table 3 is organized similarly to Table 2. In contrast to Table 2 where the focus is on the
125 Playoffs participants, the focus in Table 3 is on selection efficiency computed incrementally
for each round of the Playoffs for just those players participating in a specific round. Each value
shown in the table reflects the mean value over 40,000 simulation trials of the ratio of the efficiency
measure computed at the end of the stage to the efficiency measure computed at the beginning
of the stage for stage participants only. In all but Panel A, a ratio less than 1 represents an
improvement in efficiency from one stage of Playoffs competition to the next. Again, values shown
in bold correspond to the best values per stage. Unless shown in italics, all other values in the
same stage are significantly inferior at the 0.05 level than the best value shown in bold.
Again, the points reset after the third Playoffs round tends to decrease selection efficiency;
players who participate in the finals with a points reset tend to be ordered less efficiently after the
final stage of competition than they were ordered prior to the final stage. However, a decrease in
selection efficiency is not indicated for all measures. For example, there is unambiguous improve-
ment in the mean squared rank error (Panel D) and an indication of improvement, depending upon
the Playoffs points weighting, in Panel C (mean skill rank of player in first place) and in Panel E
(mean square skill error). However in only one case (a weight of 2 in Panel D) is the efficiency
value with a points reset better than that without.
As in Table 2, optimal points weightings tend to vary by efficiency measure. Nevertheless, a
weight of around 3 appears to produce the best efficiency measures, but in some cases, the current
weight of 5 appears to be optimal.
6.3. Practical Significance
Despite finding optimal values for the Playoffs points weighting and the decision whether to reset
FedExCup points going into the final Playoffs round, we believe that the practical differences are
11
insignificant among efficiency outcomes based on optimal tournament design and those based on
non-optimal design over the range of possible Playoffs design schemes that we consider. The entries
in Table 4, which show the best and worst efficiency outcomes from the corresponding panels of
Table 2 along with efficiency outcomes where the outcomes in each regular season and Playoffs event
are determined randomly, provide support for this view.12 (We show results for random outcomes
using a Playoffs weight of 3 only. With random tournament outcomes, the weight has essentially
no impact on any of the efficiency measures.)
The outcomes in Panels A-C, based on the efficiency measures of Ryvkin and Ortmann, are
the most straightforward to interpret. Panel A shows the rate at which the best player in the
competition wins. At the end of the competition, the best and worst outcomes associated with
regular (non-random) tournament competition fall between 63% and 44%. By contrast, with ran-
dom tournament outcomes, the best player wins less than 1% of the time. Panel B shows the mean
skill level (mean neutral score per round) of the first-place finisher. The best and worst outcomes
at the end of regular tournament competition fall between 68.51 and 68.78 compared with 70.65
when tournament outcomes are determined randomly. Panel C shows the mean skill rank of the
player who finishes the competition in first place. Here the best and worst outcomes at the end of
regular tournament competition fall between 3.17 and 4.58 compared with 64.80 when outcomes
are determined randomly. Clearly, on the basis of these three measures, (non-random) regular tour-
nament competition dramatically improves each of the three efficiency measures over what might
have otherwise been obtained with random tournament outcomes. Whether tournament design is
technically optimal appears to be of second-order importance relative to the general structure of
the competition itself.
Each of the efficiency measures in Panels D-F are the values computed from Equations 1-3,
respectively, deflated by the corresponding variance of the variable whose error is being estimated.
If we further divide the values in Panel D by 2, we obtain(1 − ρi,j(i)
), where ρi,j(i) is the Spear-
man rank order correlation of the true skill ranks and tournament finishing positions. Best and
worst values from regular tournament competition fall between 66% and 71%, which correspond to
Spearman rank correlations of 0.673 and 0.644. By contrast, the 1.975 value for the same efficiency
12We maintain exactly the same simulation design as described in the appendix, but instead of basing tournamentoutcomes on scores, outcomes are based on random orderings of tournament participants, both before and after cuts.
12
measure corresponds to a Spearman rank correlation of 0.013, essentially zero. Clearly the tour-
nament competition, whether optimally designed in terms of Playoffs point weights and the reset,
significantly improves the rank ordering of participating players.
If we divide the values in Panel E by 2, we obtain (1 − βY ), where βY is the OLS slope co-
efficient associated with a regression of true player skill on skill implied by tournament finishing
position. Best and worst values in Panel E fall between 0.561 and 0.613, which correspond to
slope coefficients of 0.720 and 0.694. With random ordering, the 1.975 value for the same efficiency
measure corresponds to a slope of 0.013, essentially zero. As in Panel D, the efficiency values from
non-random competition, whether or not they reflect optimal tournament design, are substantially
better than that obtained by a random ordering of players.
The values for the money-weighted squared skill error, shown in Panel F, are not as readily
interpreted. Nevertheless, best and worst values associated with regular competition fall between
0.273 and 0.394 compared with 3.245 with random tournament outcomes, suggesting that achieving
exactly optimal tournament design is not critical.
Finally, it is clear that the reset substantially reduces efficiency as measured by the winning
rate of the best player as summarized in the Panel A sections of Tables 2-4. With a points reset,
efficiency, as measured at the end of the competition, is actually worse than at the end of the regular
season (Stage 0). Although it is not so easy for the average person following professional golf to
appreciate all the dimensions of the other efficiency measures, we suspect that most would have an
intuitive feel for the skill levels of the best players in golf.13 If the best players are not winning the
FedExCup at a reasonably high rate, and in particular Tiger Woods over the 2003-2010 period of
our study, it isn’t unreasonable to expect that the competition could lose credibility among those
who follow professional golf. Otherwise, we see little cost in changing Playoffs points weights or
the reset to satisfy PGA TOUR objectives that might not be easy to quantify.
13In fact, the TOUR publishes player scoring averages and scoring averages adjusted for field strength throughoutthe PGA TOUR season. Although neither of these measures corresponds exactly to our mean neutral score, theseaverages, along with Official World Golf Rankings and other performance measures, make it relatively straightforwardto identify the best golfers.
13
6.4. Competitiveness and Excitement
It is clear from PGA TOUR Commissioner’s November 25, 2008 interview that the PGA TOUR
strives to create a competitive and exciting playoffs system, building toward a climactic finish,
that will hold fan interest throughout. While aiming to reward players who have performed ex-
ceptionally well throughout the regular season, the TOUR does not want the FedExCup winner to
be determined prior to the Finals. Thus, the TOUR is seeking to achieve a fine balance between
player performance during both the regular season and Playoffs. This balance may not be easily
quantified in terms of the tournament selection efficiency measures we have considered thus far.
By construction, the points reset ensures that the ultimate winner of the FedExCup cannot be
determined until the completion of the final Playoffs event. In the same simulations that underlie
the results summarized in Tables 2 and 3, the winner of the competition would be determined prior
to the Finals with probability 0.337 under the present Playoffs points weighting scheme (weight =
5) if there were no reset. With Playoffs points weights of 1, 2, 3 and 4, the probabilities would
be 0.143, 0.200, 0.252, and 0.299, respectively. Clearly, if the FedExCup winner were determined
prior to the FedExCup Finals, there would be little fan interest in the final event, THE TOUR
Championship, which occurs during the middle of the professional and college football seasons. As
such, we believe that the PGA TOUR would view these probabilities as being unacceptably high.
(If Tiger Woods, or equivalently, a player with his scoring characteristics, is excluded from the
simulations, the probabilities for Playoffs weights of 1-5 would be 0.039, 0.050, 0.066, 0.087, and
0.113, respectively.14 More detailed results for simulations that exclude Woods are provided in the
online appendix.)
Table 5 shows FedExCup winning percentages for 20 of the 125 players in the Playoffs. In Panel
A, the players are ordered by their seeding positions, 1-20, at the beginning of the Playoffs. In Panel
B, players are ordered by their skill rankings, 1-20, in relation to the field of Playoffs participants.
We include an online appendix as a supplement to this paper, which shows results in both panels
for players in all positions, 1-125.
Table 5 shows that without a reset and without giving more weight to FedExCup points earned
14Since Woods was such a dominant player during the 2003-2010 period on which our simulations are based,projections based on the inclusion of a player of Woods’ skill may be misleading, at least at the top position, forfuture periods where there may be no equivalently dominant player.
14
during the Playoffs relative to the regular season, the player who finished the regular season in first
place would win the FedExCup 81.8% of the time. (If Woods is excluded from the simulations, this
estimate falls to 78.6%.) Moreover, without a reset and with a Playoffs weight of 1, all players in
seeding positions 5-125 have less that a 1% probability of winning. (This is also the case if Woods
in not included.) Clearly the winning rate of 81.9% for the top-seeded player and the very low
winning rates associated with players seeded beyond position 4 are inconsistent with the TOUR’s
objectives. Even with a Playoffs weight of 5, the top-seeded player going into the Playoffs wins the
FedExCup over 50% of the time with no reset, and no player beyond seeding position 9 has over
a 1% chance of winning. (Without Woods, 33.5%, and no player past position 14 has more than
a 1% chance of winning.) By contrast, regardless of the weight, the winning percentage rate of
the top-seeded player is substantially lower with a reset, 35.6% to 39.2%, but not so low that his
performance during the regular season goes unrewarded, and many more players have a legitimate
chance to win. (Without Woods, 21.4% to 29.9%.)
From the “Stage 3” column of Table 2, we see that the most-highly skilled player in the com-
petition is the number 1 Playoffs seed 56% to 60% of the time. Therefore, ignoring the remote
possibility that this player would not make the Playoffs, the number 1 seed would be the most
highly-skilled among the 125 Playoffs participants 56% to 60% of the time.15 When this player is
not the top seed, he is very likely be near the top. Panel B shows that with a reset, the winning
percentage rate of the most highly-skilled player in the playoffs is greater than that of the number
1 seed. This suggests that even if the most highly-skilled player is not the number 1 seed, he still
has a reasonably high chance of winning. By contrast, without a reset and with a Playoffs weight
of 1, the most highly-skilled player does not win as often as the number 1 seed, but he does win
more often with a Playoffs weight of 5. (These same relationships also hold when Tiger Woods is
excluded from the simulations.)
Table 6 shows percentage rates per FedExCup finishing position (through finishing position 10)
for the top-10-seeded players going into the Finals. (The online appendix shows the same results
for all 30 players in the Finals over all 30 possible finishing positions.) Panels A and B indicate
that the percentage rates per finishing position are hardly affected by the Playoffs weighting scheme
when there is a points reset going into the Finals. With either a weight of 1 or 5, the top 5 seeds all
15Tiger Woods misses the Playoffs in 39 of 40,000 simulation trials.
15
have a reasonable chance to win, ranging from 5.5% to 45.8%. Although not shown, winning rates
for players in seeding positions 11-30 are all less than 1%. Also, we estimate that a player seeded
in position 25 or worse, the same as Bill Haas’ position going into the 2011 Finals, would win the
FedExCup only 0.26% of the time under the present system with a reset and Playoffs weight of 5.
Thus, Haas’ win was clearly a very rare event.
Panels C and D, show percentage rates per finishing position without a reset. With a Playoffs
weight of 1, there is little remaining uncertainty about the ultimate winner and other top finishers;
all are very likely to finish in the positions in which they started. This problem is mitigated
somewhat with a Playoffs weight of 5. Nevertheless, the number 1 seed wins almost four out of
five times. Interestingly, these same results tend to hold, but just to a slightly lesser extent, when
Tiger Woods is not included in the FedExCup competition.
Although not entirely evident, since not all players and finishing positions are shown, in panels
A and B, except for the first and last seeds, each player’s most likely finishing position is worse
than his initial seed. In Panel C, where there is no points reset and points per Playoffs event are
the same as those of regular season events, the most likely finishing position for almost every Finals
participant is his Finals seeding position. (The only exceptions are for seeding positions 16-20.)
This suggests that a competition scheme with no reset and no differential weighting of Playoffs and
regular season events leaves very little drama and potential for position changes in the Finals. By
contrast, even without a reset, but with a Playoffs points weight of 5, players in Finals seeding
positions 4-28 are most likely to finish worse than they started (positions 10-28 not shown in the
table).
Taken as a whole, we believe that the reset and the weighting of Playoffs points more heavily than
those for regular season events plays a critical role in maintaining drama and potential fan interest
throughout the Playoffs. From a pure efficiency standpoint, the reset tends to be suboptimal.
Nevertheless, it is clear that without a reset, the PGA TOUR could not satisfy its objectives of
conducting a meaningful regular season leading to playoffs with a climactic finish that both holds
fan interest and has the potential to generate significant TV revenue.
16
7. Summary and Conclusions
In this paper we introduce several new tournament selection efficiency measures and apply these
measures and several existing measures in a systematic evaluation of the selection efficiency of
the FedExCup competition run by the PGA TOUR. Our new measures are defined on the full
range of tournament outcomes, not just the characteristics of the top finisher or most highly-
skilled player. Using simulation, we evaluate the efficiency characteristics of specific alternative
tournament structures.
Our simulations show that relative to random selection, every variation on the FedExCup tour-
nament selection method that we consider produces significant improvements in selection efficiency.
Beyond this result, perhaps the most important regularity is that the points reset impairs tourna-
ment efficiency. On the other hand, one important aim of the points reset is to ensure that the
competition is in doubt until the last moment. We show that the reset and weighting of Playoffs
points more heavily than those of regular season events are critical elements in creating an exciting
and dramatic set of Playoffs events. We acknowledge that our analysis of excitement and drama is
much less scientific than our more direct mathematical assessment of tournament selection efficiency
and believe that a more formal development of this aspect of competition could be an interesting
area for future research.
17
AppendixSimulation Methodology
A. FedExCup Regular Season and Playoffs Competition
In simulating the accumulation of FedExCup points during the regular PGA TOUR season andPlayoffs, we make the following assumptions.
1. Between 415 and 459 players participate for a full “regular season” prior to the FedExCupPlayoffs in 35 4-round stroke play events.16 144 players participate in each event. There is no“picking and choosing” of tournaments nor any qualifying requirements.17 The probabilitythat any single player participates in a regular season event reflects his actual participationfrequency on the TOUR.
2. After the first two rounds of each regular season event, the field is cut to the lowest-scoring70 players who then continue for two more rounds of tournament play.18
3. FedExCup points are awarded for each tournament using the “PGA TOUR Regular Seasonevents points distribution” schedule shown in Table 1, assuming each of the 35 tournamentsis a regular PGA TOUR event rather than a “major,” a World Golf Championship event oran “alternate” event held opposite tournaments in the World Golf Championship series.
4. At the end of the 35-event regular season, the Playoffs begin with the top 125 players inFedExCup points participating in The Barclays, the first of four Playoffs events. The Barclaysemploys a cut after the first two rounds, with the lowest-scoring 70 players advancing to thefinal two rounds. At the completion of play, FedExCup points are added to those previouslyaccumulated for each of the 125 Playoffs participants according to the schedule of Playoffspoints shown in Table 1.
5. After The Barclays, the top 100 players in FedExCup points advance to the Deutsche BankChampionship. The Deutsche Bank employs a cut after the first two rounds, with the lowest-scoring 70 players advancing to the final two rounds. FedExCup points are added to thosepreviously accumulated for each of the remaining 100 Playoffs participants according to theschedule of Playoffs points shown in Table 1.
6. After the Deutsche Bank Championship, the top 70 players in FedExCup points advanceto the BMW Championship, where there is no cut. FedExCup points are added to thosepreviously accumulated for each of the remaining 70 Playoffs participants according to theschedule of Playoffs points shown in Table 1.
1635 regular season events reflects the number of weeks of regular season PGA TOUR competition prior to theFedExCup Playoffs during 2010. In three of the 35 weeks, two PGA TOUR sanctioned events were played simulta-neously, but no single player could have participated in the two events at the same time. Therefore, to simplify thesimulations, we treat these weeks as if a single event were held.
17A standard PGA TOUR event consists of 144 players. In the early and late parts of the PGA TOUR season,regular events tend to be reduced in size to 144 players due to limited daylight hours. The TOUR also conducts a few“invitationals” with smaller fields, along with a few smaller field select events, including tournaments in the WorldGolf Championship series. In addition, the Masters, one of the four “majors,” is a small field event, with 97 playersparticipating in 2010.
18Generally, the lowest-scoring 70 players and ties make the cut in regular PGA TOUR events. It is almost certainthat no ties will occur with our simulation methodology, but in the unlikely event that a tie does occur, the tie isbroken randomly.
18
7. After the BMW Championship, the top 30 players in FedExCup points advance to THETOUR Championship.
8. When simulating the present TOUR Championship structure, the number of FedExCuppoints for the 30 participating players is reset according to reset schedule shown in Table1. Players are then awarded additional FedExCup points according to their finishing posi-tion in THE TOUR Championship, a four-round stroke play event with no cut, using thepoints distribution schedule for the Finals as shown in Table 1. The FedExCup winner is theplayer who has earned the most FedExCup points, not necessarily THE TOUR Championshipwinner.
B. Player Selection
Players are selected for regular season tournament participation using the following procedure.
1. A single year from our statistical sample, 2003-2010, is selected, with each year being selectedexactly 40, 000/8 = 5, 000 times.
2. All players who actually participated in the selected year become the regular season playerpool.
3. Players from the regular season pool are selected randomly for participation in each of the35 regular season events, where the probability of any player being selected among the 144tournament participants is equal to the proportion of total player weeks in which he actuallyparticipated in the year selected, assuming sampling without replacement.19
C. Simulated 18-Hole Scoring
The following procedure is used to generate 18-hole scores for players who could potentially competein a given randomly selected PGA TOUR season.
1. A single mean skill level (mean neutral score) for each player is selected at random fromthe portion of his estimated spline-based skill occurring in the selected PGA TOUR season,2003-2010. This becomes the player’s mean skill level for the entire season.20
2. For each player k, a single θ residual is selected at random from among the entire distributionof nk θ residuals estimated in connection with his cubic spline-based skill function.
3. For each player k, 166 η residuals are selected randomly with replacement from among theentire distribution of nk η residuals estimated in connection with his cubic spline-based skillfunction.
4. Using the initial randomly selected θ residual, the vector of 166 randomly-selected η residuals,and player k ’s first-order autocorrelation coefficient as estimated in connection with his cubicspline fit, a sequence of 166 estimated θ residuals is computed.
19In determining the extent of individual player participation on the TOUR, we use weeks played rather thantournament played, since, in a few weeks each year, two PGA TOUR-sanctioned events are held simultaneously.
20We assume that the level of effort for each player throughout the entire regular season and Playoffs is the sameas that reflected, implicitly, in his estimated skill function.
19
5. The 166 θ residuals are applied to player k ’s skill estimate to produce 166 simulated random18-holes scores. The first 10 scores are not used in simulated competition but, instead, aregenerated to allow the first-order autocorrelation process to “burn in.” The next 156 are thescores required for a player who might be selected to play in every regular season tournamentand who misses no cuts during the regular season (35 × 4 = 140) or during the four roundsof the Playoffs (4 × 4 = 16). We note that it is highly unlikely that all 156 scores would beused for any single player.
6. Starting with the 11th score, scores for each player k are applied in sequence as needed tosimulate scoring during the regular season and Playoffs.21
21Suppose player 1 makes the cut in the first regular season event and player 2 missed the cut. If both are selectedto play in the second regular season event, then simulated scoring in the second event will start with scores 15 and13 for players 1 and 2, respectively.
20
Table 1: FedExCup Points Distribution and Reset Schedule
Regular Season Points Playoffs PointsFinishing Regular WGC Additional FinalsPosition Events Events Majors* Events Rounds 1-3 Reset Finals
1 500 550 600 250.0 2,500 2,500 2,5002 300 315 330 150.0 1,500 2,250 1,5003 190 200 210 95.0 1,000 2,000 1,0004 135 140 150 70.0 750 1,800 7505 110 115 120 55.0 550 1,600 5506 100 105 110 50.0 500 1,400 5007 90 95 100 45.0 450 1,200 4508 85 89 94 43.0 425 1,000 4259 80 83 88 40.0 400 800 40010 75 78 82 37.5 375 600 37511 70 73 77 35.0 350 480 35012 65 69 72 32.5 325 460 32513 60 65 68 30.0 300 440 30014 57 62 64 28.5 285 420 28515 56 59 61 28.0 280 400 28016 55 57 59 27.5 275 380 27517 54 55 57 27.0 270 360 27018 53 53 55 26.5 265 340 26519 52 52 53 26.0 260 320 26020 51 51 51 25.5 255 310 25521 50 50 50 25.0 250 300 25022 49 49 49 24.5 245 290 24523 48 48 48 24.0 240 280 24024 47 47 47 23.5 235 270 23525 46 46 46 23.0 230 260 23026 45 45 45 22.5 225 250 22527 44 44 44 22.0 220 240 22028 43 43 43 21.5 215 230 21529 42 42 42 21.0 210 220 21030 41 41 41 20.5 205 210 205. .. .
66 5 5 5 2.5 2567 4 4 4 2.0 2068 3 3 3 1.5 1569 2 2 2 1.0 1070 1 1 1 0.5 5
71-75 576-85 4
* = includes THE PLAYERS Championship. Points for regular events during regular seasondecreased by 0.02 points per finishing position past 70.
21
Tab
le2:
Effi
cien
cyM
easu
res
Com
pu
ted
for
all
125
Fed
ExC
up
Pla
yoff
sQ
ual
ifier
s
Panel
A:
Fir
stP
lace
Rate
of
Bes
tP
layer
Panel
D:
Mea
nSquare
dR
ank
Err
or
(Defl
ate
d)
Wei
ght
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
10.4
92
0.5
14
0.5
37
0.5
60
0.5
80
0.4
37
0.7
96
0.7
57
0.7
30
0.7
15
0.7
13
0.7
12
20.4
92
0.5
30
0.5
65
0.5
97
0.6
24
0.4
49
0.7
96
0.7
41
0.7
03
0.6
83
0.6
81
0.6
80
30.4
92
0.534
0.573
0.606
0.633
0.4
54
0.7
96
0.7
35
0.6
93
0.6
71
0.6
68
0.6
68
40.4
92
0.5
22
0.5
62
0.5
99
0.6
28
0.458
0.7
96
0.735
0.691
0.668
0.665
0.665
*5
0.4
92
0.4
94
0.5
45
0.5
85
0.6
18
0.459
0.7
96
0.7
39
0.6
95
0.6
71
0.6
68
0.6
68
Panel
B:
Mea
nSkill
of
Pla
yer
inF
irst
Pla
ceP
anel
E:
Mea
nSquare
dSkill
Err
or
(Defl
ate
d)
Wei
ght
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
168.8
23
68.7
61
68.7
04
68.6
54
68.6
11
68.7
77
0.6
89
0.6
49
0.6
23
0.6
06
0.5
98
0.6
13
268.8
23
68.7
21
68.6
41
68.5
75
68.5
24
68.7
38
0.6
89
0.6
34
0.5
99
0.5
76
0.5
67
0.5
87
368.8
23
68.707
68.620
68.555
68.507
68.7
26
0.6
89
0.631
0.594
0.570
0.561
0.581
468.8
23
68.7
16
68.6
31
68.5
64
68.5
13
68.721
0.6
89
0.6
36
0.5
99
0.5
74
0.5
63
0.5
82
568.8
23
68.7
47
68.6
56
68.5
84
68.5
27
68.722
0.6
89
0.6
45
0.6
08
0.5
82
0.5
70
0.5
88
Panel
C:
Mea
nSkill
Rank
of
Pla
yer
inF
irst
Pla
ceP
anel
F:
Money
-Wei
ghte
dSquare
dSkill
Err
or
(Defl
ate
d)
Wei
ght
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
17.8
14
6.3
42
5.3
69
4.7
66
4.4
55
4.5
82
0.5
08
0.4
47
0.4
03
0.3
69
0.3
47
0.3
94
27.8
14
5.6
86
4.5
23
3.8
76
3.5
90
3.8
71
0.5
08
0.4
17
0.3
57
0.3
15
0.2
88
0.3
57
37.8
14
5.3
64
4.1
88
3.5
56
3.2
87
3.5
93
0.5
08
0.4
03
0.340
0.298
0.273
0.3
44
47.8
14
5.2
02
4.0
69
3.463
3.1
85
3.4
85
0.5
08
0.401
0.340
0.298
0.273
0.341
57.8
14
5.134
4.046
3.459
3.170
3.452
0.5
08
0.4
08
0.3
48
0.3
06
0.2
78
0.3
43
Sta
ge
0=
end
of
regula
rse
aso
n;
Sta
ge
1=
end
of
firs
tP
layoff
sro
und
(Barc
lays)
;Sta
ge
2=
end
of
seco
nd
Pla
yoff
sro
und
(Deu
tsch
eB
ank);
Sta
ge
3=
end
of
thir
dP
layoff
sro
und
(BM
W);
Sta
ge
4N
R=
end
of
final
Pla
yoff
sR
ound
(TO
UR
Cham
pio
nsh
ip)
wit
hno
poin
tsre
set;
Sta
ge
4R
=en
dof
final
Pla
yoff
sR
ound
(TO
UR
Cham
pio
nsh
ip)
wit
hp
oin
tsre
set.
“W
eight”
isth
ew
eighti
ng
of
Fed
ExC
up
poin
tsaw
ard
edp
erto
urn
am
ent
finis
hin
gp
osi
tion
duri
ng
the
Pla
yoff
sre
lati
ve
toth
ose
award
edduri
ng
the
regula
rse
aso
n.
Each
effici
ency
mea
sure
reflec
tsth
em
ean
valu
eco
mpute
dov
er40,0
00
sim
ula
tion
tria
ls,
5,0
00
per
yea
rfo
rea
chyea
r2003-2
010.
For
each
effici
ency
mea
sure
exce
pt
the
firs
t,a
low
erva
lue
isb
ette
r.T
he
bes
tva
lue
per
stage
issh
own
inb
old
.V
alu
esin
each
stage
that
are
not
signifi
cantl
yin
feri
or
stati
stic
ally
ina
one-
sided
test
at
the
0.0
5le
vel
rela
tive
toth
eopti
mal
valu
efo
rth
esa
me
stage
are
show
nin
italics
.T
he
opti
mal
stage-
4va
lue
wit
hno
poin
tsre
set
isalw
ays
signifi
cantl
yb
ette
rat
the
0.0
5le
vel
than
the
opti
mal
valu
ew
ith
ap
oin
tsre
set
exce
pt
wher
enote
dw
ith
an
ast
eris
k,
inw
hic
hca
seth
eopti
mal
valu
ew
ith
rese
tis
bet
ter.
22
Tab
le3:
Effi
cien
cyM
easu
res
Com
pu
ted
Incr
emen
tall
yp
erP
layo
ffs
Sta
ge
Panel
A:
Fir
stP
lace
Rate
of
Bes
tP
layer
Panel
D:
Mea
nSquare
dR
ank
Err
or
(Defl
ate
d)
Wei
ght
Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
11.0
44
1.0
45
1.0
42
1.0
35
0.782
0.9
51
0.9
47
0.9
44
0.9
42
0.919
21.0
77
1.0
64
1.0
58
1.0
46
0.7
51
0.9
31
0.9
26
0.9
23
0.9
31
0.9
26
31.085
1.0
73
1.0
57
1.0
44
0.7
51
0.9
24
0.9
18
0.9
14
0.9
27
0.9
30
41.0
60
1.0
75
1.0
67
1.0
49
0.7
64
0.923
0.915
0.9
10
0.9
25
0.9
31
51.0
04
1.101
1.074
1.057
0.785
0.9
28
0.915
0.908
0.923
0.9
30
Panel
B:
Mea
nSkill
of
Pla
yer
inF
irst
Pla
ceP
anel
E:
Mea
nSquare
dSkill
Err
or
(Defl
ate
d)
Wei
ght
Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
10.9
99
0.9
99
0.9
99
0.9
99
1.002
0.9
43
0.9
37
0.9
31
0.9
28
0.987
20.9
99
0.9
99
0.9
99
0.9
99
1.0
02
0.9
21
0.9
15
0.9
07
0.9
11
1.0
11
30.998
0.9
99
0.9
99
0.9
99
1.0
02
0.916
0.909
0.9
01
0.9
08
1.0
10
40.9
98
0.9
99
0.9
99
0.9
99
1.0
02
0.9
23
0.9
10
0.8
98
0.9
06
0.9
99
50.9
99
0.999
0.999
0.999
1.0
02
0.9
37
0.9
11
0.896
0.902
0.986
Panel
C:
Mea
nSkill
Rank
of
Pla
yer
inF
irst
Pla
ceP
anel
F:
Money
-Wei
ghte
dSquare
dSkill
Err
or
(Defl
ate
d)
Wei
ght
Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
10.8
12
0.8
39
0.8
68
0.896
0.937
0.8
81
0.8
82
0.8
84
0.8
80
1.003
20.7
28
0.7
90
0.8
40
0.895
0.9
97
0.8
21
0.8
36
0.8
43
0.852
1.0
72
30.6
87
0.775
0.837
0.9
00
1.0
12
0.7
94
0.822
0.836
0.854
1.0
92
40.6
66
0.777
0.8
40
0.898
1.0
07
0.790
0.8
29
0.838
0.855
1.0
80
50.657
0.7
83
0.8
45
0.897
0.9
97
0.8
03
0.8
34
0.8
42
0.854
1.0
56
Sta
ge
1=
end
of
firs
tP
layoff
sro
und
(Barc
lays)
;Sta
ge
2=
end
of
seco
nd
Pla
yoff
sro
und
(Deu
tsch
eB
ank);
Sta
ge
3=
end
of
thir
dP
layoff
sro
und
(BM
W);
Sta
ge
4N
R=
end
of
finalP
layoff
sR
ound
(TO
UR
Cham
pio
nsh
ip)
wit
hno
poin
tsre
set;
Sta
ge
4R
=en
dof
final
Pla
yoff
sR
ound
(TO
UR
Cham
pio
nsh
ip)
wit
hp
oin
tsre
set.
“W
eight”
isth
ew
eighti
ng
of
Fed
ExC
up
poin
tsaw
ard
edp
erto
urn
am
ent
finis
hin
gp
osi
tion
duri
ng
the
Pla
yoff
sre
lati
ve
toth
ose
award
edduri
ng
the
regula
rse
aso
n.
Each
effici
ency
mea
sure
reflec
tsth
em
ean
valu
eof
the
rati
oof
the
effici
ency
mea
sure
com
pute
dat
the
end
of
the
stage
toth
eeffi
cien
cym
easu
reco
mpute
dat
the
beg
innin
gof
the
stage
for
stage
part
icip
ants
only
over
40,0
00
sim
ula
tion
tria
ls,
5,0
00
per
yea
rfo
rea
chyea
r2003-2
010.
For
each
effici
ency
mea
sure
exce
pt
the
firs
t,a
low
erva
lue
isb
ette
r.T
he
bes
tva
lue
per
stage
issh
own
inb
old
.V
alu
esin
each
stage
that
are
not
signifi
cantl
yin
feri
or
stati
stic
ally
ina
one-
sided
test
at
the
0.0
5le
vel
rela
tive
toth
eopti
mal
valu
efo
rth
esa
me
stage
are
show
nin
italics
.T
he
opti
mal
stage-
4va
lue
wit
hno
poin
tsre
set
isalw
ays
signifi
cantl
yb
ette
rat
the
0.0
5le
vel
than
the
opti
mal
valu
ew
ith
ap
oin
tsre
set
exce
pt
wher
enote
dw
ith
an
ast
eris
k,
inw
hic
hca
seth
eopti
mal
valu
ew
ith
rese
tis
bet
ter.
23
Tab
le4:
Effi
cien
cyM
easu
res
wit
hR
and
omT
ourn
amen
tO
utc
omes
Panel
A:
Fir
stP
lace
Rate
of
Bes
tP
layer
Panel
D:
Mea
nSquare
dR
ank
Err
or
(Defl
ate
d)
Met
hod
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Bes
t0.4
92
0.5
34
0.5
73
0.6
06
0.6
33
0.4
59
0.7
96
0.7
35
0.6
91
0.6
68
0.6
65
0.6
65
Wors
t0.4
92
0.4
94
0.5
37
0.5
60
0.5
80
0.4
37
0.7
96
0.7
57
0.7
30
0.7
15
0.7
13
0.7
12
Random
0.0
06
0.0
08
0.0
08
0.0
08
0.0
08
0.0
08
1.9
69
1.9
69
1.9
73
1.9
75
1.9
75
1.9
75
Panel
B:
Mea
nSkill
of
Pla
yer
inF
irst
Pla
ceP
anel
E:
Mea
nSquare
dSkill
Err
or
(Defl
ate
d)
Met
hod
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Bes
t68.8
23
68.7
07
68.6
20
68.5
55
68.5
07
68.7
21
0.6
89
0.6
31
0.5
94
0.5
70
0.5
61
0.5
81
Wors
t68.8
23
68.7
61
68.7
04
68.6
54
68.6
11
68.7
77
0.6
89
0.6
49
0.6
23
0.6
06
0.5
98
0.6
13
Random
70.6
64
70.6
54
70.6
53
70.6
52
70.6
51
70.6
51
1.9
67
1.9
69
1.9
74
1.9
75
1.9
75
1.9
75
Panel
C:
Mea
nSkill
Rank
of
Pla
yer
inF
irst
Pla
ceP
anel
F:
Money
-Wei
ghte
dSquare
dSkill
Err
or
(Defl
ate
d)
Met
hod
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Sta
ge
0Sta
ge
1Sta
ge
2Sta
ge
3Sta
ge
4N
RSta
ge
4R
Bes
t7.8
14
5.1
34
4.0
46
3.4
59
3.1
70
3.4
52
0.5
08
0.4
01
0.3
40
0.2
98
0.2
73
0.3
41
Wors
t7.8
14
6.3
42
5.3
69
4.7
66
4.4
55
4.5
82
0.5
08
0.4
47
0.4
03
0.3
69
0.3
47
0.3
94
Random
65.3
62
64.8
81
64.8
86
64.8
00
64.8
01
64.8
01
3.2
69
3.2
47
3.2
48
3.2
46
3.2
45
3.2
45
Sta
ge
0=
end
of
regula
rse
aso
n;
Sta
ge
1=
end
of
firs
tP
layoff
sro
und
(Barc
lays)
;Sta
ge
2=
end
of
seco
nd
Pla
yoff
sro
und
(Deu
tsch
eB
ank);
Sta
ge
3=
end
of
thir
dP
layoff
sro
und
(BM
W);
Sta
ge
4N
R=
end
of
final
Pla
yoff
sR
ound
(TO
UR
Cham
pio
nsh
ip)
wit
hno
poin
tsre
set;
Sta
ge
4R
=en
dof
final
Pla
yoff
sR
ound
(TO
UR
Cham
pio
nsh
ip)
wit
hp
oin
tsre
set.
“M
ethod”
isth
em
ethod
by
whic
hva
lues
are
com
pute
d,
wit
h“O
pti
mal”
den
oti
ng
the
opti
mal
valu
efr
om
the
corr
esp
ondin
gpanel
of
Table
2,
“W
ors
t”den
oti
ng
the
wors
tva
lue
from
the
corr
esp
ondin
gpanel
of
Table
2and
“R
andom
”den
oti
ng
the
valu
ew
hen
all
tourn
am
ent
outc
om
esare
det
erm
ined
random
lyusi
ng
aP
layoff
sw
eight
of
3.
Each
effici
ency
mea
sure
reflec
tsth
em
ean
valu
eco
mpute
dov
er40,0
00
sim
ula
tion
tria
ls,
5,0
00
per
yea
rfo
rea
chyea
r2003-2
010.
For
each
effici
ency
mea
sure
exce
pt
the
firs
t,a
low
erva
lue
isb
ette
r.
24
Tab
le5:
Fed
ExC
up
Win
nin
gP
erce
nta
ges
by
Pla
yoff
sSee
din
gP
osit
ion
and
Rel
ativ
eS
kil
lR
ank
Panel
A:
Posi
tion
Base
don
Pla
yoff
sSee
din
gP
osi
tion
Panel
B:
Posi
tion
Base
don
Rel
ati
ve
Skill
Rankin
gs
Wit
hR
eset
Wit
hout
Res
etW
ith
Res
etW
ithout
Res
etP
osi
tion
Wei
ght
=1
Wei
ght
=5
Wei
ght
=1
Wei
ght
=5
Wei
ght
=1
Wei
ght
=5
Wei
ght
=1
Wei
ght
=5
139.2
35.6
81.8
52.8
43.7
45.9
58.0
61.8
216.6
14.5
10.8
14.4
14.7
13.1
20.1
14.4
39.8
8.0
3.1
6.7
8.7
8.2
6.6
6.1
46.8
5.5
1.4
4.1
5.8
5.3
3.8
3.5
55.2
4.0
0.8
2.8
4.2
4.0
2.2
2.3
63.9
3.2
0.5
2.2
2.8
2.7
1.4
1.6
73.0
2.4
0.3
1.5
2.3
2.1
1.1
1.2
82.4
2.1
0.3
1.3
1.9
2.0
1.1
1.2
91.8
1.8
0.2
1.1
1.8
1.8
0.8
1.1
10
1.4
1.5
0.1
0.9
1.4
1.4
0.7
0.7
11
1.2
1.4
0.1
0.8
1.1
1.2
0.5
0.7
12
1.0
1.3
0.1
0.7
1.0
1.0
0.4
0.5
13
0.8
1.1
0.1
0.7
0.9
0.8
0.4
0.4
14
0.7
1.0
0.0
0.5
0.7
0.8
0.3
0.4
15
0.7
1.0
0.1
0.6
0.7
0.7
0.3
0.3
16
0.6
0.8
0.0
0.5
0.6
0.6
0.2
0.3
17
0.5
0.7
0.0
0.4
0.6
0.6
0.2
0.3
18
0.5
0.7
0.0
0.4
0.6
0.6
0.2
0.3
19
0.4
0.6
0.0
0.3
0.4
0.5
0.2
0.3
20
0.4
0.6
0.0
0.3
0.5
0.5
0.2
0.2
Base
don
40,0
00
sim
ula
tion
tria
ls(8
,000
per
yea
rfo
ryea
rs2003-2
010).
“W
eight”
isth
ew
eighti
ng
of
Fed
ExC
up
poin
tsaw
ard
edp
erto
urn
am
ent
finis
hin
gp
osi
tion
duri
ng
the
Pla
yoff
sre
lati
ve
toth
ose
award
edduri
ng
the
regula
rse
aso
n.
InP
anel
B,
skill
rankin
gs
are
rela
tive
toth
e125
pla
yer
sin
the
Pla
yoff
s.If
dis
pla
yed
toa
pre
cisi
on
of
0.1
%,
all
entr
ies
bel
owp
osi
tion
91
would
equal
zero
.
25
Tab
le6:
Fin
ish
ing
Pos
itio
nP
erce
nta
geR
ates
by
Fin
als
See
din
gP
osit
ion
Panel
A:
Wit
hR
eset
,P
layoff
sW
eight
=1
Panel
C:
Wit
hout
Res
et,
Pla
yoff
sW
eight
=1
Fin
ishin
gP
osi
tion
Fin
ishin
gP
osi
tion
See
d1
23
45
67
89
10
12
34
56
78
910
143.8
37.0
16.7
2.4
0.1
0.0
0.0
0.0
0.0
0.0
92.7
6.9
0.4
0.0
0.0
0.0
0.0
0.0
0.0
0.0
216.6
21.0
39.0
20.5
2.9
0.1
0.0
0.0
0.0
0.0
5.5
80.8
12.4
1.3
0.1
0.0
0.0
0.0
0.0
0.0
39.8
7.8
18.5
40.0
20.7
3.2
0.1
0.0
0.0
0.0
1.2
7.6
71.1
17.5
2.4
0.2
0.0
0.0
0.0
0.0
47.1
5.2
5.6
21.4
39.5
18.5
2.6
0.1
0.0
0.0
0.4
2.5
8.8
63.3
21.0
3.5
0.4
0.0
0.0
0.0
55.5
3.1
3.1
5.5
21.4
37.2
20.3
3.5
0.2
0.0
0.1
1.1
3.3
8.5
57.2
24.0
5.0
0.7
0.1
0.0
63.7
2.8
2.1
3.0
4.7
15.6
36.7
25.9
5.2
0.3
0.1
0.5
1.6
3.6
8.4
52.4
26.1
6.3
0.9
0.1
72.7
1.3
2.2
1.8
2.9
3.8
14.9
37.8
26.6
5.6
0.0
0.3
0.9
2.2
3.6
7.9
48.1
27.9
7.7
1.3
82.2
1.0
1.1
1.8
1.5
2.3
3.4
13.3
32.4
29.6
0.0
0.1
0.6
1.3
2.1
3.5
8.0
44.8
28.9
8.7
91.8
1.1
0.5
1.4
1.2
1.0
2.0
3.3
9.5
26.5
0.0
0.1
0.3
0.8
1.5
2.2
3.3
7.8
41.2
30.0
10
1.1
1.3
0.2
0.5
1.4
1.1
0.7
1.9
2.9
5.6
0.0
0.0
0.2
0.6
1.0
1.5
2.0
3.2
7.9
38.9
Panel
B:
Wit
hR
eset
,P
layoff
sW
eight
=5
Panel
D:
Wit
hout
Res
et,
Pla
yoff
sW
eight
=5
Fin
ishin
gP
osi
tion
Fin
ishin
gP
osi
tion
See
d1
23
45
67
89
10
12
34
56
78
910
145.8
36.2
15.6
2.2
0.1
0.0
0.0
0.0
0.0
0.0
79.4
16.7
3.4
0.5
0.1
0.0
0.0
0.0
0.0
0.0
214.5
20.8
41.3
20.5
2.7
0.1
0.0
0.0
0.0
0.0
9.6
50.4
28.8
9.1
1.7
0.3
0.0
0.0
0.0
0.0
39.2
6.9
18.2
41.2
21.0
3.3
0.1
0.0
0.0
0.0
3.9
9.6
34.4
33.9
14.2
3.4
0.6
0.1
0.0
0.0
46.8
4.7
5.3
20.8
40.5
19.0
2.8
0.1
0.0
0.0
2.2
5.2
7.7
26.1
34.7
17.9
5.1
1.0
0.1
0.0
55.5
3.3
3.2
5.1
19.7
37.6
21.4
4.0
0.2
0.0
1.3
3.6
4.4
6.6
21.6
33.2
20.2
7.1
1.6
0.3
63.9
3.0
2.0
3.0
4.8
14.2
35.5
27.4
5.9
0.3
1.0
2.7
3.2
3.6
6.1
18.6
30.8
21.7
8.9
2.7
72.7
1.4
2.4
1.9
2.9
3.8
14.0
36.8
27.5
6.2
0.6
2.0
2.4
2.7
3.5
5.7
16.6
28.4
22.6
10.7
82.3
1.2
1.2
1.8
1.5
2.4
3.3
12.4
31.1
30.6
0.5
1.6
1.9
1.9
2.3
3.3
5.9
15.2
26.3
23.1
91.9
1.1
0.5
1.5
1.2
1.0
2.1
3.5
9.1
24.9
0.3
1.3
1.5
1.6
1.9
2.4
3.2
5.9
14.0
24.5
10
1.3
1.4
0.2
0.5
1.5
1.2
0.7
2.0
3.0
5.4
0.2
1.1
1.3
1.3
1.4
2.0
2.4
3.3
5.9
13.4
Base
don
40,0
00
sim
ula
tion
tria
ls(8
,000
per
yea
rfo
ryea
rs2003-2
010).
“P
layoff
sW
eight”
isth
ew
eighti
ng
ofF
edE
xC
up
poin
tsaw
ard
edp
erto
urn
am
ent
finis
hin
gp
osi
tion
duri
ng
the
Pla
yoff
sre
lati
ve
toth
ose
award
edduri
ng
the
regula
rse
aso
n.
26
References
Berry, S. M, “How Ferocious is Tiger?,” Chance 14:3 (2001) 51-56.
Berry, S. M., C. S. Reese and P. D. Larkey, “Bridging Different Eras in Sports,” Journal of the AmericanStatistical Association 94:3 (1999) 661-676.
Brown, Jennifer,“Quitters Never Win: The (Adverse) Incentive Effects of Competing with Superstar,” Jour-nal of Political Economy 119:5 (2011), 982-1013.
Connolly, Robert A. and Richard J. Rendleman, Jr., “Skill, Luck and Streaky Play on the PGA Tour,” TheJournal of The American Statistical Association 103:1 (2008), 74-88.
Connolly, Robert A. and Richard J. Rendleman, Jr., “What it Takes to Win on the PGA TOUR (If YourName is “Tiger” or If It Isn’t),” Interfaces published online May 31 (2012).
Hall, Nicholas G. and Chris N. Potts, “A Proposal for Redesign of the FedEx CUp Playoff Series on thePGA TOUR,” Interfaces (2010, forthcoming).
PGA TOUR, “FedExCup Q&A with Commissioner Finchem,” November 25, 2008.http://www.pgatour.com/2008/fedexcup/11/25/tuesday.transcript.finchem/index.html (accessed November21, 2011).
Ryvkin, Dmitry, “The Selection Efficiency of Tournaments,” European Journal of Operational Research 206:3(2010), 667 - 675.
Ryvkin, Dmitry. and Andreas Ortmann, “The Predictive Power of Three Prominent Tournament Formats,”Management Science 54:3 (2008), 492-504.
Taguchi, G. et. al. Taguchi’s Quality Engineering Handbook “Part III : Quality Loss Function.” John Wiley& Sons, NJ. (2005), 171-98.
Wang, Y., “Smoothing Spline Models with Correlated Random Errors,” The Journal of The AmericanStatistical Association 93:1 (1998), 341-348.
27