« Round up of book news | Main | What is Numbersense about? »


Feed You can follow this conversation by subscribing to the comment feed for this post.


I don't mind that they are using observational data, as it would be almost impossible to run a trial. Unfortunately it all goes downhill from there. It wouldn't be surprising if coffee had an effect on cardiovascular mortality, as caffeine is a stimulant, but they didn't find that, they found an effect for all cause mortality, but then they didn't look at what particular form of mortality.

From the paper there are obvious differences in the coffee groups, which indicates that there are probably employment, educational and lifestyle differences, which they don't have as covariates. Several of their covariates aren't very accurate. Physical activity, alcohol consumption and smoking deserve more than a binary. Year of entry to the study can also be an important confounder. There are probably other things. I dislike categorising things unless necessary, so using actual cups of coffee as a covariate would be my preference, although lots of medical journals seem to be OK with the idea that at 28 cups per week people suddenly start dying.

One thing that will amuse anyone with a good knowledge of survival analysis is "proportional
hazards assumption was tested by Martingale-based residuals". I hope not.

Jon Peltier

My own quick and admittedly dirty analysis:
I can draw a single horizontal line that passes through all ten sets of error bars in that chart.
This tells me the effect isn't very strong or particularly significant, despite any patterns I may think I see.


I currently drink about 11 cups of coffee a week. I should increase this to 18 cups if I want to live!

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.


Post a comment

Marketing and advertising analytics expert. Author and Speaker. Currently at Vimeo and NYU. See my full bio.

Next Events

Mar: 26 Agilone Webinar "How to Build Data Driven Marketing Teams"

Apr: 4 Analytically Speaking Webcast, by JMP, with Alberto Cairo

May: 19-21 Midwest Biopharmaceutical Statistics Workshop, Muncie, IN

May: 25-28 Statistical Society of Canada Conference, Toronto

June: 16-19 Predictive Analytics World (Keynote), Chicago

Past Events

Feb: 27 Data-Driven Marketing Summit by Agilone, San Francisco

Dec: 12 Brand Innovators Big Data Event

Nov: 20 NC State Invited Big Data Seminar

Nov 5: Social Media Today Webinar

Nov: 1 LISA Conference

Oct: 29 NYU Coles Science Center

Oct: 9 Princeton Tech Meetup

Oct: 8 NYU Bookstore, NYC


Jul: 30 BIG Frontier, Chicago

May: 30 Book Expo, NYC

Apr: 4 New York Public Library Labs and Leaders in Software and Art Data Viz Panel, NYC

Mar: 22 INFORMS NY Student-Practitioner Forum on Analytics, NYC

Oct: 19 Predictive Analytics World, NYC

Jul: 30 JSM, Miami

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee


  • only in Big Data