« Round up of book news | Main | What is Numbersense about? »


Feed You can follow this conversation by subscribing to the comment feed for this post.


I don't mind that they are using observational data, as it would be almost impossible to run a trial. Unfortunately it all goes downhill from there. It wouldn't be surprising if coffee had an effect on cardiovascular mortality, as caffeine is a stimulant, but they didn't find that, they found an effect for all cause mortality, but then they didn't look at what particular form of mortality.

From the paper there are obvious differences in the coffee groups, which indicates that there are probably employment, educational and lifestyle differences, which they don't have as covariates. Several of their covariates aren't very accurate. Physical activity, alcohol consumption and smoking deserve more than a binary. Year of entry to the study can also be an important confounder. There are probably other things. I dislike categorising things unless necessary, so using actual cups of coffee as a covariate would be my preference, although lots of medical journals seem to be OK with the idea that at 28 cups per week people suddenly start dying.

One thing that will amuse anyone with a good knowledge of survival analysis is "proportional
hazards assumption was tested by Martingale-based residuals". I hope not.

Jon Peltier

My own quick and admittedly dirty analysis:
I can draw a single horizontal line that passes through all ten sets of error bars in that chart.
This tells me the effect isn't very strong or particularly significant, despite any patterns I may think I see.


I currently drink about 11 cups of coffee a week. I should increase this to 18 cups if I want to live!

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.


Post a comment

Your Information

(Name is required. Email address will not be displayed with the comment.)


Link to Principal Analytics Prep

See our curriculum, instructors. Apply.
Business analytics and data visualization expert. Author and Speaker. Founder of Principal Analytics Prep, MS Applied Analytics at Columbia. See my full bio.

Next Events

May: 2 New York Marketing Association Big Data Workshop, NYC

May: 5 NYPL Analytics Careers Talk, NYC

May: 8 Data Visualization Seminar, Denver, CO

May: 15 Data Visualization Seminar, Cambridge, MA

May: 17 Data Visualization Seminar, Philadelphia, PA

May: 22 Data Visualization Seminar, San Ramon, CA

Past Events

See here

Future Courses (New York)

Summer: Statistical Reasoning & Numbersense, Principal Analytics Prep (4 weeks)

Summer: Applied Analytics Frameworks & Methods, Columbia (6 weeks)

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee


  • only in Big Data