« This headline writer has the secret to everlasting life | Main | Counting is hard, especially when you don't have theories »


Feed You can follow this conversation by subscribing to the comment feed for this post.

Chris K

I know this is all for fun, but let's remember that for a given movie there are actually multiple posters. There are pre-release "teaser" posters as well as different versions for different markets.


Chris: you beat me to it. i have a comment on this in my next post.


This is a good example of how it is essential to understand the data. I remember a colleagues data from some years ago, a time series of pollution data. When plotted there was a consistent pattern of missing data, which resulted from them removing observations which were outside the calibration range of the sensor. One way of getting the average down.


Isn't this true of much of the data that would be used in climate models ? Historical data either wasn't measured(if you go back far enough) or was measured very differently(for more recent data). I know you can use ice cores and tree rings, etc to get a picture of what the climate was like before we started keeping records, but that is all computed or modeled data, right ? How have climate scientists addressed this? Have they used ice cores and tree rings from the last 30 years to calibrate their models of how that data corresponds to global temperature? No matter how carefully they have, the resulting historical data can't be anything other than an estimation, a range of what the temperature might have been? So, then when you try to build a climate model using that data do you effectively randomly sample your confidence interval of what the temps were at any particular point in time? Or do they just take the mean or median value of the interval and use that as THE value ? Just wondering....

The comments to this entry are closed.

Get new posts by email:
Kaiser Fung. Business analytics and data visualization expert. Author and Speaker.
Visit my website. Follow my Twitter. See my articles at Daily Beast, 538, HBR, Wired.

See my Youtube and Flickr.


  • only in Big Data
Numbers Rule Your World:
Amazon - Barnes&Noble

Amazon - Barnes&Noble

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee

Next Events

Jan: 10 NYPL Data Science Careers Talk, New York, NY

Past Events

Aug: 15 NYPL Analytics Resume Review Workshop, New York, NY

Apr: 2 Data Visualization Seminar, Pasadena, CA

Mar: 30 ASA DataFest, New York, NY

See more here

Principal Analytics Prep

Link to Principal Analytics Prep