« Q&A with NBA Hackathon winner | Main | Rejecting the data-free argument in support of the cashier-free Shake Shack »


Feed You can follow this conversation by subscribing to the comment feed for this post.


It might be cynical but true, that this is something that Google, Facebook, etc knew about but didn't do anything because it didn't reduce their bottom line.

Jason Borg

Cheers for the read! In my job, I've noticed some tell tale signs of fake data (a heap of traffic lasting for a second with a high bounce rate coming from a specific hosting service or location at a given time). Is there anything more I should be looking out for?


JB: I should invite Augustine over to talk about this since he's the expert. But some of the things he mentioned are websites or apps that have machine-generated names, ads that have 100% click through rates, segments that have almost the same proportions (indicating that some random number generation is deployed!). There is also "ghost traffic" but that is a different matter.

The comments to this entry are closed.

Get new posts by email:
Kaiser Fung. Business analytics and data visualization expert. Author and Speaker.
Visit my website. Follow my Twitter. See my articles at Daily Beast, 538, HBR, Wired.

See my Youtube and Flickr.


  • only in Big Data
Numbers Rule Your World:
Amazon - Barnes&Noble

Amazon - Barnes&Noble

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee

Next Events

Jan: 10 NYPL Data Science Careers Talk, New York, NY

Past Events

Aug: 15 NYPL Analytics Resume Review Workshop, New York, NY

Apr: 2 Data Visualization Seminar, Pasadena, CA

Mar: 30 ASA DataFest, New York, NY

See more here

Principal Analytics Prep

Link to Principal Analytics Prep