« How to act like a data scientist 8: Don't use lagging indicators to forecast | Main | Bittersweet »

Comments

Feed You can follow this conversation by subscribing to the comment feed for this post.

Bretwood Higman

I was surprised you didn't include a discussion of proportional change as opposed to absolute change here. Both types of change can be important. If you are looking to measure the human toll of the virus, the absolute change (e.g. 651 deaths in Italy yesterday) should reasonably be what you want to model most accurately - the difference between 10,000 and 12,000 is more important than the difference between 100 and 120. But if you want to understand the nature of the system, proportional change may be a more useful way of looking at the data. Due to the nature of disease spread, it is reasonable to conceptualize the system as one where exponential growth is expected, and deviation from that exponential growth might reasonably be interpreted as something meaningful - mode/observation mismatch of 20% is similarly important, whether that's 100 v.s 120, or 10,000 vs. 12,000.

Kaiser

BH: In the last part, you anticipated a post that currently sits in my head. It will probably appear this week or next. When there is a discrepancy between a model and the observed data, the modeler has to make a judgment call: how much of the gap is due to a mis-specified model and how much of it is due to poorly measured data? It can be some of each. Nevertheless, it's important to recognize that the exponential curve is an analytical solution to a theoretical setup so there is some basis for it to be "true".

The comments to this entry are closed.

Kaiser Fung. Business analytics and data visualization expert. Author and Speaker.
Visit my website. Follow my Twitter. See my articles at Daily Beast, 538, HBR.

See my Youtube and Flickr.
Numbers Rule Your World:
Amazon - Barnes&Noble

Numbersense:
Amazon - Barnes&Noble

Search3

  • only in Big Data

Next Events

Jan: 10 NYPL Data Science Careers Talk, New York, NY

Past Events

Aug: 15 NYPL Analytics Resume Review Workshop, New York, NY

Apr: 2 Data Visualization Seminar, Pasadena, CA

Mar: 30 ASA DataFest, New York, NY

See more here

Courses

R Fundamentals, Principal Analytics Prep

Numbersense: Statistical Reasoning in Practice, Principal Analytics Prep

Applied Analytics Frameworks & Methods, Columbia

The Art of Data Visualization, NYU

Signed copies at McNally-Jackson, NYC

Excerpts: Numbersense Ch. 1, 7, 8. NRYW

Junk Charts Blog



Link to junkcharts

Graphics design by Amanda Lee

Community