« Typos show the value of statistics | Main | Small numbers and scams »


Feed You can follow this conversation by subscribing to the comment feed for this post.


Was this for me? I seem to remember requesting a statistician's view on the likely World Cup winner.

Melih Onvural

I think the approach ought to be:

  1. Find a set of statistics about soccer
  2. Look at what they seem to tell you blind
  3. Extrapolate whether they in turn describe a given style

The one major mind shift I would make here, is to not look at the probability of a team to beat another team, but instead look at the probability of a team to execute against a given game plan. Treat it like a chess match where a perfect game by white should always lead to victory. The statistic should show us which team is more likely to approach the perfect game. From there, maybe we could extrapolate the most likely winner

Tom Hopper

There are a two general tactics that are very effective in soccer: moving the ball by passing and challenging possession of the ball. Passing is important because it moves the ball up the field faster than players can run; players can always beat another player, but they can never beat the ball. Challenging is important because it doesn't give the other team time to set up or think; you force your opponent into sub-optimal decisions.

For forwards and mid-fielders, attempts on goal is also important; the more attempts you make, the more you'll score.

Since, as you note, there are very few opportunities to collect stats on the national teams, it seems that we might do well to collect stats on individual players and then aggregate them to compare teams.

Some naive possibilities for player-level stats: passes complete per game; passes received per game; attempts on goal per game; fraction of goals to attempts. Measuring "challenges" at the player level seems more difficult. Perhaps average distance and speed while defending. Maybe also average duration of possession, with less being better.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.


Post a comment

Marketing and advertising analytics expert. Author and Speaker. Currently at Vimeo and NYU. See my full bio.

Next Events

Mar: 26 Agilone Webinar "How to Build Data Driven Marketing Teams"

Apr: 4 Analytically Speaking Webcast, by JMP, with Alberto Cairo

May: 19-21 Midwest Biopharmaceutical Statistics Workshop, Muncie, IN

May: 25-28 Statistical Society of Canada Conference, Toronto

June: 16-19 Predictive Analytics World (Keynote), Chicago

Past Events

Feb: 27 Data-Driven Marketing Summit by Agilone, San Francisco

Dec: 12 Brand Innovators Big Data Event

Nov: 20 NC State Invited Big Data Seminar

Nov 5: Social Media Today Webinar

Nov: 1 LISA Conference

Oct: 29 NYU Coles Science Center

Oct: 9 Princeton Tech Meetup

Oct: 8 NYU Bookstore, NYC


Jul: 30 BIG Frontier, Chicago

May: 30 Book Expo, NYC

Apr: 4 New York Public Library Labs and Leaders in Software and Art Data Viz Panel, NYC

Mar: 22 INFORMS NY Student-Practitioner Forum on Analytics, NYC

Oct: 19 Predictive Analytics World, NYC

Jul: 30 JSM, Miami

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee


  • only in Big Data