« Musing over data science | Main | Tales of poor CEOs »


Feed You can follow this conversation by subscribing to the comment feed for this post.


I have some data, which has been previously analysed using a clustering algorithm. One of my suspicions is that one of their classes is subjects with data entry errors. Unfortunately the data is from 40 years ago, so it is impossible to check. A more robust clustering method removes the cluster, so I expect that I am right.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.


Post a comment

Marketing and advertising analytics expert. Author and Speaker. Currently at Vimeo and NYU. See my full bio.

Next Events

Aug: 20 DataViz New York Meetup

Aug: 26 Optimizely Experience, Invited Expert, New York

Past Events

See here

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee


  • only in Big Data