soccer/world football data collection and … · soccer/world football david r. brillinger...

16
UNCORRECTED PROOFS Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 1 SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football, also known as asso- ciation, is a sport involving two teams of players kicking and heading a medium-sized ball about on a large rectangular field. Each team has a goal that is quite wide and high and seeks to shoot the ball into the other team’s goal. The basic objective is to see which team of the two can score the most goals and thereby win the game. The sport is subject to laws set down by the F´ ed´ eration Interna- tionale de Football Association (FIFA). Typ- ically, there is a group of teams playing against each other in a league or tourna- ment, with a formal schedule of games. A team’s intention then is to win a tourna- ment or championship by scoring the most points, where 3 points are awarded for a win, 1 for a tie, and 0 for a loss. The sport of soccer has a long and full his- tory possibly going back to 2500 BC [1]. In the present time, the teams each have 11 players. One, the goalie, may handle the ball but only within a restricted area in front of his goal. There is a neutral referee whose decisions are final. The grandest tournament, the World Cup, moves amongst different countries and occurs nominally every 4 years. The past world champions are Uruguay, Argentina, Brazil, England, France, Germany, and Italy. The sport has become a multibillion dollar business. Tactics are basic to the games, with many styles and plating formations available. A team’s choice has been based, in part, on studies of match statistics. The pioneer in the field of match performance analysis is Charles Reep [2–4]. Wiley Encyclopedia of Operations Research and Management Science, edited by James J. Cochran Copyright © 2010 John Wiley & Sons, Inc. DATA COLLECTION AND DESCRIPTIVE ANALYSES Data collection and analysis have been basic to soccer for many years. The most common statistics are a game’s result and the number of goals scored by each team. Outputs of the data analyses include reports, tables, graphs, talks, images, and videos. Many things go on during a game. A basic question is what is to be recorded for the analysis and how to do so. Charles Reep [2,3] developed solutions starting in the early 1950s. The study of basic events in a game has been called both as match performance analysis and match analysis [2,5–7]. A recent addition to the type of infor- mation available for individual games is near-continuous high frequency digital spatial-temporal data. Companies come to stadium and set up an array of video cameras. By signal processing, they then develop the spatial-temporal coordinates of the changing locations of the players on the field, the ball, and the referee. Companies doing this include Match Analysis, Prozone Holdings, and Sport-Universal SA. Di Salvo et al. [8] have validated one such system. The basic quantities that may now be tabled directly include counts of tackles, crosses, distances covered, key moments and actions, possessions, passes, interceptions, runs with the ball, incorrect referee decisions, fouls, penalty kicks, challenges, entries into the opponent’s area, ball touches, blocks, forward passes, long balls, high intensity running, ball velocity and accuracy, and balls received. Individual player’s performances may be evaluated directly. The data can lead to changes in tactics during a game [9,10]. Substantial soccer databases have been available for many years. Websites include www.uefa.com, www.fifa.com [11,12], Q1 www.soccerbase.com, www.soccerway.com, and www.soccerpunter.com. 1

Upload: others

Post on 12-Mar-2020

14 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 1

SOCCER/WORLD FOOTBALL

DAVID R. BRILLINGER

Statistics Department, Universityof California, Berkeley,California

Soccer/world football, also known as asso-ciation, is a sport involving two teams ofplayers kicking and heading a medium-sizedball about on a large rectangular field. Eachteam has a goal that is quite wide and highand seeks to shoot the ball into the otherteam’s goal. The basic objective is to see whichteam of the two can score the most goals andthereby win the game. The sport is subjectto laws set down by the Federation Interna-tionale de Football Association (FIFA). Typ-ically, there is a group of teams playingagainst each other in a league or tourna-ment, with a formal schedule of games. Ateam’s intention then is to win a tourna-ment or championship by scoring the mostpoints, where 3 points are awarded for a win,1 for a tie, and 0 for a loss.

The sport of soccer has a long and full his-tory possibly going back to 2500 BC [1]. In thepresent time, the teams each have 11 players.One, the goalie, may handle the ball but onlywithin a restricted area in front of his goal.There is a neutral referee whose decisions arefinal. The grandest tournament, the WorldCup, moves amongst different countries andoccurs nominally every 4 years. The pastworld champions are Uruguay, Argentina,Brazil, England, France, Germany, and Italy.The sport has become a multibillion dollarbusiness.

Tactics are basic to the games, with manystyles and plating formations available. Ateam’s choice has been based, in part, onstudies of match statistics. The pioneer inthe field of match performance analysis isCharles Reep [2–4].

Wiley Encyclopedia of Operations Research and Management Science, edited by James J. CochranCopyright © 2010 John Wiley & Sons, Inc.

DATA COLLECTION AND DESCRIPTIVEANALYSES

Data collection and analysis have been basicto soccer for many years. The most commonstatistics are a game’s result and the numberof goals scored by each team. Outputs of thedata analyses include reports, tables, graphs,talks, images, and videos.

Many things go on during a game. A basicquestion is what is to be recorded for theanalysis and how to do so. Charles Reep[2,3] developed solutions starting in the early1950s. The study of basic events in a gamehas been called both as match performanceanalysis and match analysis [2,5–7].

A recent addition to the type of infor-mation available for individual games isnear-continuous high frequency digitalspatial-temporal data. Companies cometo stadium and set up an array of videocameras. By signal processing, they thendevelop the spatial-temporal coordinates ofthe changing locations of the players on thefield, the ball, and the referee. Companiesdoing this include Match Analysis, ProzoneHoldings, and Sport-Universal SA. Di Salvoet al. [8] have validated one such system.The basic quantities that may now be tableddirectly include counts of tackles, crosses,distances covered, key moments and actions,possessions, passes, interceptions, runswith the ball, incorrect referee decisions,fouls, penalty kicks, challenges, entries intothe opponent’s area, ball touches, blocks,forward passes, long balls, high intensityrunning, ball velocity and accuracy, and ballsreceived. Individual player’s performancesmay be evaluated directly. The data canlead to changes in tactics during a game[9,10]. Substantial soccer databases havebeen available for many years. Websitesinclude www.uefa.com, www.fifa.com [11,12], Q1www.soccerbase.com, www.soccerway.com,and www.soccerpunter.com.

1

Page 2: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 2

2 SOCCER/WORLD FOOTBALL

One early investigation of basic data isthat of Moroney ([13], pp. 101–103). He stud-ied the numbers of goals scored in 480 gamesand prepared a histogram of the counts. Inanother early study, the number of successfulpasses in passing movements were studiedfor the English First Division during theyears 1957–1958 and 1961–1962 [14]. Theresults were presented in tabular form. Reepand Benjamin found a stable correspondencebetween shots on goal and goals scored ofabout 10 to 1. Distances traveled by individ-ual players during a game are graphed byplayer position in Ref. 5. Hughes and Franks[15] present scatter and time series plots.One early discovery was the existence of ahome advantage. Schedules are often set upso that the two teams meet both, at homeand away to deal with this. Jochems [16,17],while addressing the question of whether theDutch football pools were breaking the gam-bling laws, found the percentages to be 46.6for a home win, 31.0 for a visitor win, and22.4 for a draw/tie. Managers make use ofsuch data in their decision-making, more soevery year. Stochastic models are importantin this connection.

STOCHASTIC MODELING

Stochastic models are pertinent to soccerstatistics, because there is much uncertaintyin what happens and what may happen. Dur-ing the preceding 50 years, stochastic modelshave been constructed to address soccer ques-tions.

Between Game Modeling

Some models may be distinguished as to con-cerning goals, win–tie–loss, or points. Spe-cific distributions and models that have beenemployed include bivariate Poisson, expo-nential, extreme value, GARCH, generalizedlinear, logistic, Markov, negative binomial,ordinal, regression, prior, point process, andstate space. Many analyses are based on thePoisson distribution whose probability func-tion is given by

Prob{Y = y} = μye−y/y! for y = 0, 1, 2, . . . .

Here Y might represent the number ofgoals a team scores in a future game. Inan early work, Moroney [13], using the datamentioned above, compared the number ofgoals scored by a team in a game to theirestimated expected frequency assuming thata Poisson distribution held. On examiningthe result, he was led to seek to improveit and fit a negative binomial distribution.This is a generalization of the Poisson. Thisfit was satisfactory. However, Greenhoughet al. [18] found that when data from 169countries were pooled, extreme value distri-butions were needed.

Reep et al. [14,19] work on the numberof passes in successful passing movements.They fit the negative binomial and found itwas good when 0-length cases were excluded.

Suppose next that team i at home isplaying against team j visiting. Denote thefinal score by (Xij, Yij). Various authors haveassumed that Xij and Yij are independentPoissons with respective means,

αiβj, γiδj Ref. 20

exp{α + η + γi + δj},exp{α + γj + δi}

Refs 21 and 22

αiβjη, αjβi Ref. 23

exp{α + η + γi − δj − φ

(εi − εj)}, exp{α + γj − δi+ φ(εi − εj)}

Ref. 24

here η is the home advantage, φ the psycho-logical effect, and εi = γi + δi. In their fitting,Dixon and Coles modify the low score modelprobabilities. The probability that team iwins the game may be estimated having esti-mated the model parameters.

Maher [20] estimates α, β, γ , δ by max-imum likelihood and mentions fitting thebivariate Poisson. Karlis and Ntzoufras [25]give details of fitting a bivariate Poisson,studying the data for 24 leagues. They findthat the assumption of independence is notrejected in 15 out of the 24 cases. They alsoconsider the negative binomial distributionand the inclusion of interaction terms.

Page 3: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 3

SOCCER/WORLD FOOTBALL 3

The goal difference Xij − Yij is also impor-tant. It determines whether a game result iswin, tie or loss (W, T, or L) and is used toresolve conflicts when the points are equal.Stefani [26] considers the model

Xij − Yij = η + ρi − ρj + random error

with η the home advantage and ρi the rank-ings. Karlis and Ntzoufras [27] fit the Poissondifference distribution to observed goal differ-ences including a home effect, attacking, anddefensive parameters.

Fahrmeir and Tutz [28] employ a logisticlatent variable-cutpoint setup to model anordinal-valued variable. Their model is

Prob{Yij = r} = F(θr + αi − αj)

−F(θr−1 + αi − αj)

for r = W, T, L,

with F the logistic and αi the ability of team i.In contrast, Brillinger [29–33] employs anextreme value variable-cutpoint approach. Itleads to the model

Prob{ i wins at home playing j}= 1 − exp{− exp{βi + γj + θ2}},

Prob{ i ties at home against j}= exp{− exp{βi + γj + θ2}}

× − exp{− exp{βi + γj + θ1}},Prob{ i loses at home against j}

= exp{− exp{βi + γj + θ1}},

with θ2 >θ1 cutpoints and the standardiza-tions

∑βi,

∑γi = 0. The parameters βi and

γi represent home and away effects of team i.The extreme value distribution employed islonger tailed to the right. The result of thepreceding game and the distances betweenthe cities involved were considered asexplanatories. The βi − γi can be interpretedas the advantage of team i when playingat home. These values were also consideredin Ref. 34. These authors model the goaldifferences as

Xij − Yij = η + αi + βj + random error,

calling the αi and βj ‘‘forces.’’ Least squaresestimation is employed. Including pastresults did not improve things. Goodard [35]fits an ordered probit model, that is, F aboverelates to the normal, with explanatories.

In a linear regression analysis, Panaretos[36] studies the number of points collectedin the course of a double round robin tour-nament and the effect of the explanatoriesgoals scored, goals conceded, and time of ballpossession. The regression model

points = α

+βlog(goals scored/goals conceded)

+ γ ball possession + random noise

is fitted and a proportion of varianceexplained of 0.971 is found.

Barnett and Hilditch [37] study whetherthe field being artificial turf affects the gameresult. They find that the home team’s advan-tage was increased. The use of artificial turfwas abandoned.

Attention now turns to the case wheretime or round, t, plays an essential role inthe modeling. Jochems [16,17] defines, as ameasure of strength of team i at home andj visiting, λij = Wi − Wj, with Wi the averagepoints of team i in its preceding games. Theprediction of the result for team i is

i wins if λij > 0,

i wins if λij = 0 home advantage,

i loses if λij < 0.

Jochem’s study had been commissioned bythe Netherlands Lottery Commission to see ifany skill was involved on the part of tipsters.

The models listed above can be turned intoforecasting procedures by simply adding a t tothe subscripts of the parameters and estimat-ing them by using the data up to time t. Onethen uses the schedule of remaining games todevelop forecasts. For example, Stefani [26]considers the model

Xijt − Yijt = ηijt + ρi − ρj + random error,

Page 4: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 4

4 SOCCER/WORLD FOOTBALL

with ρi a rating based on i’s past games andηijt = ±η represents a home advantage. Thequantity ρi represents team i’s ability. The fitis by conditional least squares.

Dixon and Coles [23] employ indepen-dent Poissons and introduce an exponentiallydecaying time function to damp out the effectsof previous rounds. The Poisson means areαi(t)βj(t)η and αj(t)βi(t), respectively. The vari-able t is time or round number and η standsfor home advantage. Estimation is by maxi-mizing the likelihood.

Fahrmeir and Tutz [28] work with a latentvariable-motivated ordinal model,

Prob {Rtij = r} = F(θtr − αti − αtj)

−F(θtr−1 − αti − αtj),

with R the result, r = 0, 1, 2 is a win–tie–lossindicator. Further F is the logistic, and αti arerandom walks in t. The computations involvea Kalman filter and simulation. Rue andSalvesen [24] employ a Bayesian dynamicgeneralised linear model with independentnormals having conditional means given thepast

exp {α + η + γ ti − δt

j − φ(εti − εt

j )},exp {α + γ t

j − δti + φ(εt

i − εtj )},

to predict next weekend’s results. Theyemploy a directed graph to describe thecausal structure of the model as a function oftime.

Brillinger [32,33], for each round, t,uses previous round results. He fits alatent variable-based model and then usessimulation to make projections of the finalpoints and standings of the teams using theknowledge of the remaining games in theschedule. Estimates are then available forfuture rounds.

Harvey and Fernandes [38] study theseries of goals scored by England againstScotland in a biyearly match. In a state spaceapproach, they assume a negative binomialmodel with mean an exponentially decayingfunction of past values.

Within Game Modeling

Next, consider modeling the progress of asingle game. One approach breaks the course

of a game into states corresponding to thezones in the field. Pollard and Reep [39]carry out a logistic regression analysis tomodel the probability of scoring from variouspositions on the field. In a series of papers[40–42], Markov chains with a finite numberof states are employed. Hirotsu and Wright[40] use a four-state model to determine theoptimal timing of substitution and tacticaldecisions. The expected number of points tobe gained from a change in tactics is consid-ered an objective, and dynamic programmingis employed. The model includes transitionrates of scoring and conceding, and ratesof gaining and losing possession. Hirotsuand Wright found evidence for replacing adefender with an attacker when losing.

A common problem is to look for changetaking place. Croucher [43] studied the effectof changing the points awarded. The objectof this change is to generate more attackingplay. Ridder et al. [44] study the effect of ared card in game. They infer that the scoring Q2rate does change after the ejection, and has anegative effect on the team with fewer play-ers. Other studies that deal with the effectare given in Refs 45–47. The latter take apoint process approach. When team i is play-ing team j, the times of goals are modeled byPoisson processes with log rates

νij(t) = λi(t) exp {αj + βzij(t)} and

νji(t) = λj(t) exp {αi + βzji(t)},

respectively, t being time. The function, λi(t),is referred to as the attack intensity for teami and αi as the defense parameter. The zij arethe explanatories

zij1(t) = 1 when rival player off with red

card, 0 otherwise;zij2(t) = 1 if actual score positive;zij3(t) = 1 if actual score negative;

where t is time, β and zij are the 3 vectors.The parameters are estimated using datafrom the 2006 World Cup. The fitted modelmay be employed to generate simulations.Dixon and Robinson [48] also take a point

Page 5: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 5

SOCCER/WORLD FOOTBALL 5

process approach, employing a birth process,to model the scoring.

The match video data being collectedthese days may be processed to determinetrajectories of players and the ball. Thenmodels can be developed. Xie et al. [49] takea video of a game, break it into segments,for example, ‘‘play’’ or ‘‘break.’’ They dealwith camera pans and zooms using thegreen grass ratio and motion intensity.Their analysis uses dynamic programmingand spatial-temporal hidden Markov pro-cesses. Min et al. [50] work on the sameproblem making use of rule-based reason-ing, Bayesian inference, simulation, andexpert systems. Palavi et al. [51] developsa graph-based multiplayer detection andtracking system. In the simplest case of anoteable goal scoring movement in one game,Brillinger [31] develops a potential functionapproach to obtain a stochastic model.

Berument et al. [52] look for evidence ofsoccer results affecting the Turkish stockmarket. They do find an effect for one teamas well as a day of the week effect.

Garvia [53] asks, ‘‘Is soccer dying?’’ Heuses a Markov switching model to look fora break in the series of yearly average goalsper game. He finds a drop for England, Italy,and Spain around 1965.

RANKING

Rankings/ratings/seedings are numericalvalues meant to describe the relativeperformances of teams in a league. Theyare used for a variety of specific purposes.One is the preparation of schedules forknock-out tournaments. Another is for themaximization of the probability that the bestcompetitors meet in a tournament’s laterstages. If there are N teams the collectionof ranking values might be the sequenceof integers 1 to N. A famous exampleis the FIFA/Coca-Cola world ranking[www.fifa.com.worldfootball/ranking/procedure/men.html [54]], where the values derivefrom a formula involving major games duringthe previous 4 years, strength of opponent,W–T–L points, and several other items. Itorders the teams of all the countries that are

members of the FIFA and is updated steadilyas more game results become available.

Jochems [16,17] employed the differencesof average game scores and compared this totipsters’ predictions. Hill [55] uses Kendall’stau to compare tipster results with the endof season standings and finds association.Stefani [26] suggests the model

Xij − Yij = η + ρi − ρj + random error,

with the ρi’s rankings. Stefani [56] surveyedthe major world sports rating systems. Bas-sett [57] suggests improving the estimatesby employing a robust estimation methodto handle long tails. Forrest and Simmons[58,59] investigate the predictive quality ofnewspaper tipsters match results. Stefaniand Pollard [60] provide a critical survey ofranking procedures. Gelade and Dobson [61]relate the international standing of a coun-try’s team to various social factors includingnumber playing regularly, wealth, and cli-mate. McHale and Davies [62] focus on theFIFA rankings.

There are papers on ratings for othersports; for example, Refs 63–65.

TOURNAMENTS AND SCHEDULING

Scheduling is a basic step involved whengames need to be organized amongst themembers of a group of teams. Two basictypes of tournament design are the roundrobin and the knock-out. In the round robin,each pair of teams plays against each otherthe same number of times, equally at homeand away. The champion of the group is theteam with the most points at the end of allthe games. In the knock-out case, there isa seeding order and the first round is basedon it. The tournament progresses with twoteams playing in pair setup by the seedingorder and the winner going on to the follow-ing round. This continues until only one teamis left, the champion.

Sometimes there are numerical-valuedobjective functions that can be employedin developing schedules. Their forms caninclude distance traveled, total expenses,numbers of consecutive home or away

Page 6: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 6

6 SOCCER/WORLD FOOTBALL

games, and referees’ time. There are oftenconstraints that need to be taken noteof in the scheduling. For example, theremay be multiple venues, television sched-ules, desired dates for two teams fromthe same city. Analytic tools employed inan optimization include integer program-ming, maximization, knapsack algorithms,simulation and stochastic models.

Kuonen [21,66] employs a logistic regres-sion model and shows how output can beused to estimate probabilities of interest for aknock-out tournament. Unlike a round robin,future opponents for the following rounds areonly known probabilistically. Kuonen workswith European Cup Winners Cup data. Heinvestigates three methods for calculatingseeding coefficients based on the data for 3preceding years.

Urban and Russell [67] consider schedul-ing competitions on multiple venues, venuesnot associated with any of the participants.Della Croce and Oliveri [68] present aninteger linear programming approach forscheduling the Italian League. They had todeal with cable TV and with games betweenteams in the same city.

Scheduling the Chilean League has alsoposed special problems. As described in Refs69 and 70, these were dealt with via usingmathematical programming, recognizedexisting weaknesses, stadiums availability,international requirements, and reducingtravel distance.

Objectives may be described probabilisti-cally and then stochastic models are involved.When the goal is to determine a champion,it is natural to ask questions like: what isthe probability that the ‘‘best’’ team actuallywins the tournament, or how much better isthe first team than the second? Appleton [71]provides definitions and reviews many tour-nament structures. He employs simulations.

Scarf et al. [72] provide an extensivereview including discussion of: tournamentmetrics, how the design influences outcomeuncertainty, the use of simulation, robust-ness (to teams dropping out), effects of rulechanges, and probability model employed.

He defines the parameter

PqR = Prob{team in top 100q

pretournament rank percentile progresses};

here R is round achieved and 0 < q < 1. Tounderstand the PqR values, he graphs themagainst q.

It is necessary to schedule the refereesalso. Yavuz et al. [73] provide an extensivereview. The topics covered include leaguesin different countries requiring differentobjective functions and constraints, tensionbetween referees and clubs caused by pastincidents, reducing frequent assignment ofthe same referee to a team’s games. In ama-teur leagues, there may be several gamesthe same day for same referee, perhaps atdifferent places even.

GAME THEORY

The expression ‘‘game theory’’ has both a gen-eral and a technical meaning. The ‘‘technical’’comes from the classic [74] setup. The ‘‘gen-eral’’ refers to others, particularly tactics.Both meanings provide general frameworkswith which to study soccer. A difficulty aris-ing in the theories’ applications to soccer isthat the game is highly dynamic. Jones andTrantner’s [75] book is unusual in that, itstarts by emphasizing the defensive forma-tions. Wilkinson’s [76] book is devoted to thetopic of tactics.

The formation selected would be controlledby the coach. The 2-3-5 was very common formany years. In it, there are two defensive spe-cialists, five attacking, and three midfieldersin between. Newer formations include 3-5-2, 5-4-1, 4-5-1, 4-3-3, and even 4-1-4-1 withan extra row. The formation 4-4-2 is favoredby various famous teams. The two attackerscan drop back to assist in obtaining the ball.When the team has possession of the ball, thetwo outside midfielders can move forward toincrease pressure on the opponent. Wilson[77] provides a review and insightful anal-ysis of many of the formations employed ingames dating from 1878 to 2008.

Pollard and Reep [39] seek to quantify theeffectiveness of different playing strategies.

Page 7: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 7

SOCCER/WORLD FOOTBALL 7

They break the action of a game down intodiscrete events, for example, pass, center,shoot. They define the ‘‘yield’’ as the probabil-ity a goal is scored minus the probability oneis conceded. This quantity is used to evaluatethe expected outcome of a team possession.

The book, The Science of Soccer, has achapter on ‘‘game theory’’ [78]. It discusseshow the strength and deployment of the teammoderates the apparent random motion anddiscusses the fact that low scoring benefitsthe weaker team.

One of the momentous occasions in a soc-cer game is the awarding of a penalty shot.There have been a number of formal studies.Greenless et al. [79] study how the penaltytakers’ uniform color and prepenalty gazeaffect the impressions formed by the goal-keeper. When penalties take place to breaka tie, McGarry and Franks [80] identify anoptimal order for a team to take the shots in.Their conclusion is to begin by ranking theplayers 1 (best) to 11 (worst). Then use theorder: 5, 4, 3, 2, 1, and if necessary after that6, 7, 8, 9, 10, and 11. Jordet et al. [81] consid-ers the roles of stress, skill, and fatigue foranswering why the English are poor at penal-ties. He analyzed 200 shots from World Cupand European Championship. In an analysisof many penaltues, Franks et al. [82] learnedthat in 80% of the cases studied, if the non-kicking foot points to the left, the ball willbe shot to the left. They developed a trainingprogram to test this discovery.

Larsen [2] describes the arrival of Reep’sideas in Norway and the success that teamsadopting it enjoyed.

The theory of von Neumann and Morgen-stern provides general definitions and leadsto random strategies. Haigh [83] provides thefollowing intuitive example. The table showsestimated percentages of successful penaltyshots based on 1876 attempts.

KickerGoalie Left Center Right

Left 60 90 93

Center 100 30 100

Right 94 85 60

The situation can be considered is a two-person zero-sum game. If the kicker wins byscoring, the goalie loses. If the goalie winsby saving, the kicker loses. Haigh finds thatNash equilibrium theory leads to the goalie’schosing to move left, center or right withthe respective probabilities of 0.44, 0.13, and0.43. For the kicker, they are 0.37, 0.29, and0.34, respectively. If either player uses theirstrategy, 80% of the shots would be scored inthe long run. This turns out to be close to theactual percentage.

Other references to applications of the for-mal theory include Refs 84–86. Hirotsu andWright [85] consider a zero-sum game focus-ing on who wins the game, while Hirotsu et al.[87] focus on points gained and where one hasa nonzero-sum game. Bennett et al. [88] usehypergame analysis, involving the rankingof various forms of hooligans’ actions andauthorities’ reactions followed by hooligan’sreactions and so on, followed by exploring theimplications of such scenarios.

ECONOMICS AND MANAGEMENT

Reilly and Williams [89] write, ‘‘In the 1980sit became apparent that the football (soccer)industry and professionals in the game couldno longer rely on the traditional methodsof previous decades. Methods of managementscience were applied to organizing the big soc-cer clubs and the training of players could beformulated on a systematic basis.’’ Desbordes[90, Chapter 10] advocates the introductionof modern management tools into soccer. Thismakes sense if for no other reason than thefact that billions of dollars are now involvedin the sport. One does need to remember thatat the youth and amateur levels, the amountof money involved is little. Really, all that isneeded for a game is a ‘‘ball’’ and a field or abeach. This is the charm of the game.

Management refers to a sport’s structure,owners, sponsors, marketers, financiers,schedulers, and others with some form ofcontrol. Management provides the super-structure. The fans expect both their team’splayers and its management to be successfulin acquiring players, providing facilities,and hiring the coach. Management assists

Page 8: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 8

8 SOCCER/WORLD FOOTBALL

by supporting the enterprise generally. Itprovides the infrastructure. Other thingsthat management is responsible for includecash inflows–outflows, advertising, man-power planning and hiring, planning newand keeping up old stadiums, and sign-ing players. The stadium owners providefacilities for security, ticket collection, food,souvenirs, and parking. Management studiesthe performance of their players and others[10,91].

Sometimes, there is a numerical-valuedobjective function for the management towork with. Then optimization methods maybe employed to good advantage. Other con-cerns are loss functions, earnings/revenue,prestige/honor, recruitment, salary negotia-tion, scouting, referees, merchandise, com-mercialism, productivity, profitability, travel,and television rights.

There is a common wish to improve indi-vidual player’s and a team’s performance. Anaccompanying question is how to assess per-formance and efficiency. One paper on thetopic for English soccer is Ref. 92. Others areRefs 93 and 94. The latter concerns the Span-ish Soccer League. There have been analyses.Audas et al. [95] study the impact of man-agerial change on team performance usingthe data of English football match resultssince the 1970s, while Partovi and Corre-doira [96] investigate models for prioritizingand designing rule changes.

In a study of competitive balance in theDutch League, Koning [97] employs the val-ues Cij, corresponding to W, T, L, generatedvia

Dij = ϕi − ϕj + εij with the ε’s mean 0,

common variance normals,

Cij = 1 if Dij > c2

= 0 if Dij = 0

= −1 if Dij < c1.

It is used to study changes in competitivebalance in Holland.

A novel application of programming the-ory is presented in Ref. 98. He applies aninteger program to the generalized knapsack

problem to find a ‘‘Dream Team.’’ Player val-ues were established by a public poll withthe participants given a budget of 2.5 milliondollars each. A knapsack algorithm was thenemployed to establish the best team at thelowest price.

Calster et al. [99] study an unwanted hap-pening in soccer, the scoreless draw. Theoccurrence is related to indices of a team’soffensive performance including total goalsand earned points per game.

SOME REMAINING TOPICS

The book, The Inner Game of Soccer, pro-vides details of the game from a referee’sstandpoint [100]. It discusses the laws andthe mechanics and talks of the characteris-tics of the best referees and the ones who canturn benign games into ugly unsportsman-like confrontations.

Reference 78 lays out some of the physicsof soccer. For example, the section titled Q3‘‘Tournaments and Scheduling’’ analyzes themovement of a ball spinning in flight and howplayers can make it bend. Hall [101] elabo-rates on that describing the physical forces atwork in a double banana shot, which curvesin one direction then swerves in another.Such a shot is very difficult for the goalie tohandle.

Sports biomechanics concerns physicalactions that occur in sport that can beobserved and improved, for example, toreduce injuries. The Refs 102 and 103 areon the topic of biomechanics in soccer. Grim-pampi et al. [104] concern computationalbiomechanics. Bradley et al.’s study [105] isa human biology study. It refers to results onhigh-intensity running using data collectedvia an array of television cameras.

There are many articles on employingstochastic models in evaluating bets. Mostof the articles referred above go on to studythe efficiency of various gambling strategies.

What is ahead for the game? Wright [106]writes on 50 years of operations research insport to provide a background. The futureis discussed in Ref. 107 and by the sportswriter Garner [108] in World Soccer. Hesuggests the following changes: bigger goals,

Page 9: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 9

SOCCER/WORLD FOOTBALL 9

smaller penalty area, a revised offside rule,fouls denoted contact or technical, red-cardedplayer punished but replacement allowed.Reviewing the literature and practice sug-gest that one can expect to see much more inthe way of the collection and analysis of thespatial-temporal data collected in a game.

REFERENCES

1. Henshaw R. The encyclopedia of world soc-cer. Washington, DC: New Republic; 1979.

2. Larsen O. Charles Reep: a major influenceon British and Norwegian football. SoccerSoc 2001;2:55–78.

3. Pollard R. Charles Reep (1904–2002): pio-neer of notational and performance analysisin football. J Sports Sci 2002;20:853–855.

4. Pollard R. Home advantage in soccer: varia-tions in its magnitude and a literature reviewof the inter-related factors associated with itsexistence. J Sport Behav 2006;29:169–189.

5. Reilly T, Thomas V. A motion analysis ofwork-rate in differential positional roles inprofessional football match-play. J Hum MovStud 1976;2:87–97.

6. McGarry T, Franks IM. The science of matchanalysis. In: Science and soccer, 2nd ed. New

Q4

York: Routledge; 2003. pp. 265–275.7. Hughes M. Notational analysis. In: Science

and soccer, 2nd ed. London: Routledge; 2003.pp. 245–264.

8. Di Salvo V, Collins A, McNeill B, CardinaleM. Validation of Prozone: a new video-basedperformance analysis system. Int J PerformAnal Sport 2006;6:108–119.

9. Carling C, Reilly T, Williams AM. Handbookof soccer match analysis. London: Routledge;2005.

10. Carling C, Reilly T, Williams AM. Perfor-mance assessment for field sports. London:Routledge; 2009.

11. FIFA. Laws of the game. (2008/2009). www.fifa.com.

12. FIFA. Disciplinary code. 2009. www.fifa.com/mm/50/02/75/disco_2009_en.pdf.

13. Moroney M. Facts from Figures. London: Pel-ican; 1951.

14. Reep C, Benjamin B. Skill and chancein association football. J R Stat Soc A1968;131:581–585.

15. Hughes M, Franks I. Analysis of passingsequence, shots and goals in soccer. J SportsSci 2005;23:509–514.

16. Jochems DB. Voetbalprognoses. Stat Neerl1958;12:17–31.

17. Jochems DB. Forecasting the outcomes ofsoccer matches: a statistical appraisal of theDutch experience. Metrika 1962;5:194–207.

18. Greenhough J, Birch PC, Chapman SC,Rowlands G. Football goal distributionsand extremal statistics. Phys A 2002;316:615–624.

19. Reep C, Pollard R, Benjamin B. Skill andchance in ball games. J R Stat Soc A1971;134:623–629.

20. Maher MJ. Modelling association footballscores. Stat Neerl 1982;36:109–118.

21. Kuonen D.. Modelling the success of soccerteams in European championships. Techni-cal Report 96.1, Department of Mathematics,Swiss Federal Institute of Technology, Lau-sanne; 1996.

22. Lee AJ. Modelling soccer in the PremierLeague: is Manchester United really thebest? Chance 1997;15–19. Q5

23. Dixon MJ, Coles SC. Modelling associa-tion football scores and inefficiencies inthe football betting market. Appl Stat1997;46:265–280.

24. Rue H, Salvesen O. Prediction and retrospec-tive analysis of soccer matches in a league.J R Stat Soc D 2000;49:399–418.

25. Karlis D, and Ntzoufras I. Analysis of sportsdata using bivariate Poisson models. Statis-tician 2003;52:381–393.

26. Stefani RT. Improved least squares foot-ball, basketball, and soccer predictions. IEEETrans Syst, Man, Cybern 1980;10:116–123.

27. Karlis D, Ntzoufras I. Bayesian modelling offootball outcomes: using the Skellam’s dis-tribution for goal difference. IMA J ManagMath 2009;20:133–145.

28. Fahrmeir L, Tutz G. Dynamic stochasticmodels for time-dependent ordered pairedcomparison systems. J Am Stat Assoc1994;89:1438–1449.

29. Brillinger DR An analysis of an ordinal-valued time series. In: Athens conference onapplied probability and time series analysis.Volume 2, Lecture Notes in Statistics Volume115. Springer; 1996. pp. 73–87.

30. Brillinger DR Modelling some Norwegiansoccer data. In: Advances in statistical mod-elling and inference. Singapore: World Sci-entific; 2006. pp. 3–20.

31. Brillinger DR A potential function approachto the flow of play in soccer. J Quant Anal

Page 10: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 10

10 SOCCER/WORLD FOOTBALL

Sports 2007;3(1):3.32. Brillinger DR. Modelling game outcomes of

the Brazilian 2006 Series A Championshipas ordinal-valued. Brazilian. J Probab Stat2008;22:89–104.

33. Brillinger DR. An analysis of Chinese SuperLeague partial results. Sci China A: Math2009;52:1139–1156.

34. Leeflang PSH, van Praag BMS. A procedureto estimate relative powers in binary contactsand an application to Dutch Football Leagueresults. Stat Neerl 1971;25:203–208.

35. Goodard J. Regression models for forecast-ing goals and match results in associationfootball. Int J Forecast 2005;21:331–340.

36. Panaretos VM (2002). A statistical analysisof the European Soccer Champions League.ASA Proceedings of the Joint Statistics Met-ings. pp. 2600–2602.

37. Barnett V, Hilditch S. The effect of an arti-ficial pitch surface on home team perfor-mance in football (soccer). J R Stat Soc A1993;156:39–50.

38. Harvey AC, Fernandes C. Time series modelsfor count data of qualitative observations.J Bus Econ Stat 1989;7:407–422.

39. Pollard R, Reep C. Measuring the effective-ness of playing strategies at soccer. J R StatSoc D 1997;46:541–550.

40. Hirotsu N, Wright M. Using a Markov pro-cess model of an association football matchto determine the optimal timing of substitu-tion and tactical decisions. J Oper Res Soc2002;53:88–96.

41. Hirotsu N, Wright M. An evaluation of char-acteristics of teams in association football byusing a Markov process model. Statistician2003;52:591–602.

42. Hirotsu N, Wright M. Determining thebest strategy for changing the configura-tion of a football team. J Oper Res Soc2003;54:878–887.

43. Croucher JS. The effect of changing compe-tition points in the English football league.Teach Stat 1984;6:39–42.

44. Ridder G, Cramer JS, Hopstaken P. DownQ6to ten: estimating the effect of a red card insoccer. J Am Stat Assoc 1994;89:1124–1127.[see also Albert J, Bennett J, Cochran JJ.Anthology of statistics in sports. Phildelphia:SIAM; 2005. pp. 299–302.]

45. Wright M, Hirotsu N. The professional foulin football: tactics and deterrents. J Oper ResSoc 2003;54:213–221.

46. Vecer J, Kopriva F, Ichiba T. Estimatingthe effect of a red card in soccer: when tocommit an offense in exchange for prevent-ing a goal opportunity. J Quant Anal Sports2009;5(1):8.

47. Volf P. A random point process model forscore in matches. IMA J Manag Math2009;20:121–131.

48. Dixon MJ, Robinson ME. A birth pro-cess model for association football matches.Statistician 1998;47:523–538.

49. Xie L, Xu P, Chang S-F, Divarkanm A, SunH. Structure analysis of soccer video withdomain knowledge and hidden Markov mod-els. Pattern Recognit Lett 2004;25:76–775.

50. Min B, Kim J, Choe C, Eom H, McKay RI. Acompound framework for sports prediction:a football case study. Knowledge-Based Syst2008;21:551–562.

51. Palavi V, Mukherjee J, Majumdar AK, SuralS. Graph-based multiplayer detection andtracking in broadcast soccer videos. IEEETrans Multimedia 2008;10:794–805.

52. Berument H, Ceylan NB, Gozpinar E. Perfor-mance of soccer on the stock market: evidencefrom Turkey. Social Sci J 2006;43:695–699.

53. Garvia A. Is soccer dying? A time seriesapproach. Appl Econ Lett 2007;7:275–278.

54. FIFA. Ranking. 2006. www.fifa.com.worldfootball/ranking/procedure/men.html.

55. Hill ID. Association football and statisticalinference. Appl Stat 1974;23:203–208.

56. Stefani RT. Survey of the major world spor-ts rating systems. J Appl Stat 1997;24:635–646.

57. Bassett GW. Robust sports ratings based onleast absolute errors. Am Stat 1997;51(2). Q7

58. Forrest D, Simmons R. Making up theresults: the work of the Football Pools Panel,1963–1997. Statistician 2000;49:253–260.

59. Forrest D, Simmons R. Forecasting sport:the behaviour and performance of footballtipsters. Int J Forecast 2000;16:317–331

60. Stefani R, Pollard R. Football rating systemsfor top-level competition: a critical survey.J Quant Anal Sports 2007;3(3):3.

61. Gelade GA, Dobson P. Predicting the com-parative strengths of national football teams.Social Sci Q 2007;88:244–258.

62. McHale I, Davies S. Statistical analysis ofthe effectiveness of the FIFA world rankings.In: Albert J, Koning RH, editors. Statistical Q8thinking in sports. New York: Chapman andHall; 2008. pp. 77–90.

Page 11: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 11

SOCCER/WORLD FOOTBALL 11

63. Leake RJ. A method for ranking teams withan application to 1974 college football. NorthHolland: Management Science in Sports;1976.

64. Harville D. The use of linear model method-ology to rate high school or college footballteams. J Am Stat Assoc 1977;72:278–289.

65. Stern H. On the probability of winning a foot-ball game. Am Stat 1992;45:179–183. [seeQ9also Albert J, Bennett J, Cochran JJ. Anthol-ogy of statistics in sports. Phildelphia: SIAM;2005. pp. 53–57].

66. Kuonen D. Statistical models for knock-out soccer tournaments. Technical Report97.3. Lausanne: Department of Mathemat-ics, Swiss Federal Institute of Technology;1997.

67. Urban TL, Russell RA. Scheduling sportscompetitions on multiple venues. Eur J OperRes 2003;148:302–311.

68. Della Croce F, Oliveri D. Scheduling the Ital-ian Football League: an ILP-based approach.Comput Oper Res 2006;33:1963–1974.

69. Noronha TF, Ribeiro CC, Duran G, Souyris S,Weintraub A. A branch-and-cut algorithm forscheduling the highly-constrained Chileansoccer tournament. Lect Notes Comput Sci2007;3867:174–186.

70. Duran G, Guajardo M, Weintraub A, Wolf R.Scheduling the Chilean League using math-ematical programming. OR/MS Today 2009.Q10

71. Appleton DR. May the best man win? Statis-tician 1995;44:529–538.

72. Scarf P, Yusof MMN, Bilbao M. A numericalstudy of designs for sporting contest. EurJ Oper Res 2009;198:190–198.

73. Yavuz M, Inan UH, Figlah A. Fair ref-eree assignments for professional foot-ball leagues. Comput Oper Res 2008;35:2937–2951.

74. Von Neumann J, Morgenstern O. Theoryof games and economic behavior. Princeton:Princeton University Press; 1944.

75. Jones R, Trantner T. Soccer Strategies:defensive and attacking tactics. Reedswain:Spring City; 1979.

76. Wilkinson WHG. Soccer tactics: top teamstrategies explained. Crowood: Marlborough;1988.

77. Wilson J. Inverting the pyramid: a history offootball tactics. London: Orion; 2008.

78. Wesson J. The science of soccer. New York:Taylor and Francis; 2002.

79. Greenless I, Leyland A, Thelwell R, FilbyW. Soccer penalty takers’ uniform colourand pre-penalty gaze affects the impressionsformed by opposing goal-keepers. J SportsSci 2008;26:569–576.

80. McGarry T, Franks IM. On winning thepenalty shoot-out in soccer. J Sports Sci2000;18:401–409.

81. Jordet G, Hartman E, Vischer C, LemminkKAPM. Kicks from the penalty mark in soc-cer: the roles of stress, skill, and fatigue forkick outcomes. J Sport Sci 2007;25:121–129.

82. Franks I, McGarry T, Hanvey T. From nota-tion to training: analysis of the penalty kick.Insight 1999;2:24–26.

83. Haigh J. Uses and limitations of math-ematics in sport. IMA J Manag Math2009;20:97–108.

84. Pacacios-Huerta I. Professionals play mini-max. Rev Econ Studies 2003;70:395–415.

85. Hirotsu N, Wright M. Modelling tacticalchanges of formation in association footballas a zero-sum game. J Quant Anal Sports2006;2(2):4.

86. Coloma G. Penalty kicks in soccer:an alternative methodology for testingmixed-strategy equilibria. J Sports Econ2007;8:530–545.

87. Hirotsu N, Ito M, Miyaji C, Hamano K,Taguchi A. Modelling tactical changes of for-mation in association football as a non-zero-sum game. J. Quant Anal Sports 2009;5(3):2.

88. Bennett PG, Dando MR, Sharp RG. Usinghypergames to model difficult social issues:an approach to the case of soccer hooligan-ism. J Oper Res Soc 1980;31:621–635.

89. Peiser B, Minten J. Soccer violence. In: Reilly Q11T, Williams AM, editors. Science and soc-cer. 2nd ed. Abingon: Routledge; 2003. pp.230–242.

90. Desbordes M. Marketing and football:an international perspective. Butterworth-Heinemann; 2007. Q12

91. Covell D, Walker S, Siciliano J, Hess PW.Managing sports organizations: responsibil-ity for performance, 2nd ed. Portsmouth:Butterworth-Heinemann; 2007.

92. Dawson P, Dobson S, Gerrard B. Stochasticfrontiers and the temporal structure of man-agerial efficiency in English soccer. J SportsEcon 2000;1–4:341–362.

93. Haas DJ. Technical efficiency of MajorLeague Soccer. J Sports Econ 2003;04–03:203–215.

Page 12: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 12

12 SOCCER/WORLD FOOTBALL

94. Sala-Garrido R, Carrion VL, Esteve AM,Bosca JE. Analysis and evolution of efficiencyin the Spanish Soccer League. J Quant AnalSports 2009;5(1):3.

95. Audas R, Dobson S, Goddard J. The impactof managerial change on team perfor-mance in professional sports. J Econ Bus2002;54:633–650.

96. Partovi FY, Corredoira RA. Quality deploy-ment for the good of soccer. Eur J Oper Res2002;137:642–656.

97. Koning RH. Balance in competition in Dutchsoccer. Statistician 2000;49:419–431.

98. Mosheiov G. The solution of the soccer‘Dream League’ game. Math Comput Model1998;27:79–83.

99. Calster BV, Smits T, Van Huffel S. The curseof scoreless draws in soccer: the relation-ship with a team’s offensive, defensive, andoverall performance. J Quant Anal Sports2008;4(1):4.

100. Sellin E. The inner game of soccer. MountainView: World Publications; 1976.

101. Hall LE. Laws of motion. New York: Rosen;2005.

102. Lees A, Nolan L. The biomechanics of soccer:a review. J Sports Sci 1998;16:211–234.

103. Lees A. Biomechanics applied to soccer skills.2003. pp. 109–119 in Reilly et al. (2003).Q13

104. Grimpampi E, Pasculli A, Sacripanti A.Computational biomechanics, stochasticmotion and team sports. 2008. arxiv.org/pdf/0811.3659

105. Bradley PS, Sheldon W, Wooster B, Olsen P,Boanas P, Krustrup P. High-intensity run-ning in English FA Premier League soccermatches. J Sports Sci 2009;27:159–168.

106. Wright MB. 50 years of OR in Sport. J OperRes Soc 2009;60:S161–S168.

107. The Orange Future of Football Report. 2008.http://www.orange.co.uk/sport/football/pics/3395_1.htm

108. Garner P. The future of football. World Soc-cer; 2009.

109. Archontakis F, Osborne E. Playing it safe?Q14A Fibonacci strategy for soccer betting.J Sports Econ 2007;8:295–308.

110. Baxter M, Stevenson R. Discriminatingbetween the Poisson and negative bino-mial distributions: an application to goalscoring in association football. J Appl Stat1988;15:347–354.

111. Bellos A. Futebol, the Brazilian Way of Life.London: Bloomsbury; 2002.

112. Boeri T, Severgnini B. The Italian job: matchrigging, career concerns and media concen-tration in ‘Series A’. IZA Discussion PaperNo. 3745. 2008.

113. Brimson D. March of the Hooligans: Soccer’sbloody fraternity. London: Virgin; 2007.

114. Brocas I, Carrillo JD. Do the three-point vic-tory and golden goal rules make soccer moreexciting? J Sports Econ 2004;5:169–185.

115. Cantor A, Arcucci D. Goooal: a celebration ofsoccer. New York: Fireside; 1997.

116. Carmichael F, Thomas D, Ward R. Team per-formance: the case of English PremiershipFootball. Manag Dec Econ 2000;21:31–45.

117. Comeron M. The prevention of violence insport. Strasbourg: Council of Europe; 2002.

118. Cordeiro SE. El futbol y la corrupcion.Volume 5. In: Carrion F, editor. Bibliotecadel Futbol Ecuatoriano, 5 Volumes. Quito: Q15Flacso; 2006. pp. 233–250.

119. Crowder M, Dixon M, Ledford A, Robinson M.Dynamic modelling and prediction of EnglishFootball League matches for betting. Statis-tician 2002;51:157–168.

120. David FN. Games, gods and gambling, Lon-don: Griffin; 1962.

121. Dobson S, Goddard J. The economics offootball. Cambridge: Cambridge UniversityPress; 2001.

122. Dobson S, Goddard J. Forecasting scores and Q16results and testing the efficiency of the fixedodds betting market in Scotish League foot-ball. In: Albert J, Koning RH, editors. Statis-tical thinking in sports. New York: Chapmanand Hall; 2008. pp. 91–109.

123. Dyte D, Clarke SR. A ratings based Pois-son model for World Cup soccer simulation.J Oper Res Soc 2000;51:993–998.

124. Eastaway R, Haigh J. Beating the odds: thehidden mathematics of sport, London: Rob-son; 2007.

125. Emonet B. Revisiting statistical aspects ofsoccer, STS Report. Lausanne: Swiss Tech-nical University; 2000.

126. England C. No more budha, only football,London: Sceptre; 2003.

127. Espita-Escuer M, Garcia-Cebrianm LI. Mea-suring the efficiency of Spanish First-Division soccer teams. J Sports Econ2004;04–03:203–215.

128. Fernandez-Cantelli E, Meeden G. Animproved award system for soccer. Chance2002;7:275–278.

Page 13: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 13

SOCCER/WORLD FOOTBALL 13

129. Foer F. How soccer explains the world: anunlikely theory of globalization, New York:Harper Collins; 2004.

130. Giulianotti R. Football: a sociology of theglobal game, Polity: Bodmin; 1999.

131. Giulianotti R, Bonney N, Hepworth M. Foot-ball, violence and social identity, London:Routledge; 1994.

132. Goddard J, Asimakopoulos I. Forecastingfootball results and the efficiency of fixed-odds betting. J Forecast 2004;23:51–66.

133. Holey AJ, Johnson BS. Menaces to manage-ment: a developmental view of British SoccerHooligans, 1961–1986. Sports J 1998;1(1).

134. Hornby N. Fever pitch. Riverhead trade;1992.

135. Hughes M, Bartlett R. The use of perfor-mance indicators in performance analysis.J Sports Sci 2002;20:739–754.

136. Hurley WJ. Special issue on operationsresearch in sport. Comput Oper Res2006;33:1871–1873.

137. Jackson DA. Index betting on sports. Statis-tician 1994;43:309–315.

138. Karlis D, Ntzoufras I. On modelling soccerdata. Student 2000;3:229–244.

139. Kern M, Sussmuth B. Managerial efficiencyin German top league soccer: an econometricanalysis of club performances on and off thepitch. German Econ Rev 2005;6:485–506.

140. Knorr-Held L. Dynamic rating of sportsteams. Statistician 2000;49:261–276.

141. Kristan M, Pers J, Perse M, Kovacic S.Q17Towards fast and efficient methods for track-ing players in sports. In: Pers J, Magee DR,editors. Proceedings of ECCV Workshop onComputer Vision Based Analysis in SportEnvironments. 2006. pp. 14–25.

142. Kuonen D, Roehrl ASA. Was Frances’sWorld Cup win pure chance? Student2000;3:153–166. [ranking, knockout – for-mula].

143. Kuypers T. Information and efficiency: anempirical study of a fixed odds betting mar-ket. Appl Econ 2000;32:1353–1363.

144. Ladany SP, Machol RE, editors. Optimalstrategies in sports. Amsterdam: North-Holland; 1977.

145. Lewis MM. Moneyball: the Art of Winningan Unfair Game. New York: Norton; 2003.

146. Lucena CJP, Noronha S, Urrutia S, RibeiroCC. A multi-agent framework to retrieveand publish information on qualification andelimination data in sports tournaments. Rio

de Janeiro, Brazil: Monografias em Cienciada Computacao; 2006.

147. Machol RE, Ladany SP, Morrison DG.Management science in sports. Amsterdam:North-Holland; 1976.

148. Magnum. Soccer. London: Phaidon; 2002.

149. Mason T. Association football and englishsociety 1863–1915. Brighton: Harvester;1980.

150. McCullough P, Nelder JA. Generalized linearmodels, 2nd ed. London: Chapman and Hall;1989.

151. McGuire EJ, Courneya KS, Widmeyer WN,Carron AV. Aggression as a potentialmediator of the home advantage in pro-fessional hockey. J Sport Exerc Physiol1992;14:148–158.

152. Morris D. The soccer tribe. Jonathan Cape;1981.

153. Needham CJ. Tracking and modelling ofteam game interactions. Ph.D. Thesis, Uni-versity of Leeds; 2003.

154. Osborne MJ. An introduction to gametheory.Oxford: Oxford University Press; 2004.

155. Ribeiro CC, Urrutia S. OR on the ball: appli-cations in sports scheduling and manage-ment. OR/MS Today 2004; (June).

156. Sadovskii AL, Sadovskii LE. Mathematicsand sports. Providence: American Mathe-matical Society; 1993.

157. Simons R. Bamboo goalposts. London:MacMillan; 2008.

158. Sloane PJ. The economics of profes-sional football: the football club as autility maximiser. Scottish J. Polit Econ1971;8:121–146.

159. Sloane PJ editor. The economics of sport.Econ Affairs 1997;17:Special issue.

160. Stefani R. Measurement and interpretation Q18of home advantage. In: Albert J, KoningRH, editors. Statistical thinking in sports.New York: Chapman and Hall; 2008. pp.203–216.

161. Stefani RT. Predicting score difference ver-sus score total in rugby and soccer. IMA JManag Math 2009;20:147–158.

162. SUP AMISCO PRO. http://213.30.139.108/sport-universal/uk/amiscopro.htm Q19

163. Turnbull J, Raab A, Satterlee T. The globalgame: writers on soccer. Bison; 2008.

164. Wright DB. Football standings and measure-ment levels. Statistician 1997;46:105–110.

Page 14: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M. P. 14

14 SOCCER/WORLD FOOTBALL

165. Yue Z, Broich H, Seifritz F, Mester J. Math-ematical analysis of a soccer game. Part 1:individual and collective behaviors. StudiesAppl Math 2008;121:223–243.

166. Yue Z, Broich H, Seifritz F, Mester J. Part 2:energy, spectral and correlation analyses.Studies Appl Math 2008;121:245–261.

Page 15: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M.

Queries in Article eorms0791Q1. The reference ‘‘FIFA (2009) Disciplinary Code” has been cited in this sentence. Please

confirm.

Q2. Please check if the edit made in this sentence retains the intended meaning of thesentence.

Q3. The cross-reference ‘‘chapter” has been changed to ‘‘section”. Please confirm if the changeis ok.

Q4. Please provide the editors names for references 6, 7, and 30.

Q5. Please provide the volume number for reference 22.

Q6. The references ‘‘Ridder et al. (1994)” and ‘‘Albert et al. (2005)” have been clubbed as onereference. Please confirm if it is ok.

Q7. Please provide the page range for references 57,133 and 159.

Q8. Since the chapter title ‘‘Statistical analysis of the effectiveness of the FIFA worldrankings” is part of the book ‘‘Statistical thinking in sports”, we have clubbed thereferences ‘‘Albert and Koning (2008)” and ‘‘McHale and Davies (2008)” as one reference.Please confirm if it is ok.

Q9. The references ‘‘Stern (1992)” and ‘‘Albert et al. (2005)” have been clubbed as onereference. Please confirm if it is ok.

Q10. Please provide volume number and page range for references 70 and 155.

Q11. Since the chapter title ‘‘Soccer violence” is part of the book ‘‘Science and soccer”, we haveclubbed the references ‘‘Peiser and Minten (2003)” and ‘‘Reilly and Williams (2003)” asone reference. Please confirm if it is ok.

Q12. Please provide the place of publication for references 90, 108, 134, 152 and 163.

Q13. Please provide complete details of ‘‘Reilly et al. 2003” in reference 103.

Q14. References ‘‘109 to 166” have not been cited in the text. Please clarify as to where theseshould be cited.

Q15. Since the chapter title ‘‘El futbol y la corrupcion” is part of the book ‘‘Biblioteca delFutbol Ecuatoriano”, we have clubbed the references ‘‘Carrion (2006)” and ‘‘Cordeiro(2006)” as one reference. Please confirm if it is ok.

Q16. Since the chapter title ‘‘Forecasting scores and results and testing the efficiency of thefixed odds betting market in Scotish League football” is part of the book ‘‘Statisticalthinking in sports”, we have clubbed the references ‘‘Albert and Koning (2008)” and‘‘Dobson and Goddard (2008)” as one reference. Please confirm if it is ok.

Q17. Please provide complete publication details for the reference (141).

Q18. Since the chapter title ‘‘Measurement and interpretation of home advantage” is part ofthe book ‘‘Statistical thinking in sports”, we have clubbed the references ‘‘Albert andKoning (2008)” and ‘‘Stefani (2008)” as one reference. Please confirm if it is ok.

Q19. Please provide the access year for reference 162.

Q20. Please provide abstract for this article.

Page 16: SOCCER/WORLD FOOTBALL DATA COLLECTION AND … · SOCCER/WORLD FOOTBALL DAVID R. BRILLINGER Statistics Department, University of California, Berkeley, California Soccer/world football,

UNCORRECTED PROOFS

Cochran eorms0791.tex V1 - April 29, 2010 3:56 P.M.

Please note that the abstract and keywords will not be included in the printed book,but are required for the online presentation of this book which will be published onWiley’s own online publishing platform.

If the abstract and keywords are not present below, please take this opportunity toadd them now.The abstract should be a short paragraph upto 200 words in length and keywordsbetween 5 to 10 words.

Abstract:

Keywords: association football; efficiency; FIFA; game theory; laws of the game; match anal-

Q20

ysis; notation analysis; performance; ranking; sports science; spatial-temporal data; tactics;time series