« More poll fodder | Main | Moviepass stock plunges »


Feed You can follow this conversation by subscribing to the comment feed for this post.

Thomas Ball

Wish I had known of this event! But for all the bandwagon hype, smoke and mirrors was there any discussion -- beyond bias -- of the methodological limits of AI, DL, NNs, etc.?

Antonio Rinaldi

I remember a tweet about the double (un)known classification. Data is the known known. Sampling error is an example of known unknown. The bias speakers deal on is the known unknown. But there is also another bias, the unknown unknown, that cannot be repaired. The first one cannot be repaired only when assuming politically correctness as a dogma.


AR: Thanks for the add. I recall that comment. I think that quote is only approximately useful. "Data is the known known" doesn't cut it for me as it does not deal with fake data, bad data, missing data, etc. Sampling error is not a known unknown; it is a measure of variability caused by random sampling, which does not apply to "data exhaust" type of data.

I recommend people read Chapter 3 of my book Numbers Rule Your World - the section about dealing with racial bias in SAT testing - to understand what the issue of "bias" is about. Also, the Prologue of Numbersense also deals with this issue.

TB: That meeting is not the place to go to hear about limitations. Under the "Real Challenges" section, I collected the several episodes in which someone pointed out limitations. It does appear that the one problem acknowledged by all is "bias". The general mood in the room is that there is an engineering solution to all problems.

The comments to this entry are closed.


Link to Principal Analytics Prep

See our curriculum, instructors. Apply.
Kaiser Fung. Business analytics and data visualization expert. Author and Speaker. Founder of Principal Analytics Prep, MS Applied Analytics at Columbia. See my website.

Next Events

Oct: 31 Webinar on Data Visualization, online at JMP

Nov: 1 NYU unCOMMON Salon Public Lecture, New York, NY

Nov: 8 Tufts Gordon Institute: A Conversation with Kaiser Fung, Facebook Live

Nov: 8 Tufts TGI Careers & Networking Night panel, Somerville, MA

Nov: 26 Data Visualization New York Meetup, New York, NY

Nov: 27 NYPL Data Analytics Resume Workshop, New York, NY

Nov: 30 Purdue School of Engineering Seminar, West Lafayette, IN

Dec: 1 Purdue Mathematics, Data Science, and Industry Conference, West Lafayette, IN

Past Events

See here

Future Courses (New York)

Summer: Statistical Reasoning & Numbersense, Principal Analytics Prep (4 weeks)

Summer: Applied Analytics Frameworks & Methods, Columbia (6 weeks)

Junk Charts Blog

Link to junkcharts

Graphics design by Amanda Lee


  • only in Big Data