Comments on WannabellTypePad2008-07-25T07:10:31Zjunkchartshttps://junkcharts.typepad.com/junk_charts/tag:typepad.com,2003:https://junkcharts.typepad.com/junk_charts/2008/07/wannabell/comments/atom.xml/bookworm commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553e7a89a88332008-08-15T16:09:54Z2008-08-15T16:09:56ZbookwormI agree with the EVD comment. It looks like a classic Fisher-Tippet distribution.<p>I agree with the EVD comment. It looks like a classic Fisher-Tippet distribution. </p>Alex commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553d8386088342008-07-27T05:15:57Z2008-07-27T05:15:57ZAlexhttp://www.awblocker.comThat would be a very odd sounding bell indeed. I think that it would have been more informative to a...<p>That would be a very odd sounding bell indeed. I think that it would have been more informative to a least include a line showing the median. On a more technical note, I would wager that this could actually be approximated well as a mixture two or three Poisson distributions.</p>anon commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553b907dd88332008-07-25T19:57:52Z2008-07-25T19:57:53ZanonFolks, cut the title some slack. It says "bell curve", not "normal distribution", and I think that term is perfectly...<p>Folks, cut the title some slack. It says "bell curve", not "normal distribution", and I think that term is perfectly appropriate for informal usage. (I mean, it does look like a bell!) The difference between a true normal distribution and this curve is mildly interesting, but does not affect the point of the story.</p>
<p>For the people advocating a cumulative distribution chart: I think you'd find that only about 1% of the readers of that article would understand that diagram. A cumulative distribution graph is highly unfamiliar to most people--and if you diaagree, I challenge you to find even one example of such a chart in a mainstream press article.</p>Tony K commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553b8e07e88332008-07-25T18:47:47Z2008-07-25T18:47:47ZTony Khttp://www.emotionsforengineers.comWe can all speculateon what the distribution of the data is relative to standard algorithmic distributions. But the only thing...<p>We can all speculateon what the distribution of the data is relative to standard algorithmic distributions. But the only thing we will agree on without data is that it is not a normal distribution.</p>
<p>In any case, I agree with the folks who say that a cumulative distribution would be much more useful in this case.</p>
<p>Cheers,<br />
T</p>numen commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553b8a07d88332008-07-25T17:12:02Z2008-07-25T17:12:03ZnumenLognormal curve...logarithms of data are normally distributed... Standard form of normal distribution when range is from zero to infinity, instead...<p>Lognormal curve...logarithms of data are normally distributed...</p>
<p>Standard form of normal distribution when range is from zero to infinity, instead of negative to positive infinity.</p>Aaron commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553d50c1888342008-07-25T17:10:09Z2008-07-25T17:10:09ZAaronIt looks a bit like an extreme-value distribution (EVD), which is a distribution that looks like Normal, but with an...<p>It looks a bit like an extreme-value distribution (EVD), which is a distribution that looks like Normal, but with an asymmetrically "fat" tail on one side or the other. Given the what the numbers represent (the total/max number of trips taken on a given card), an EVD might make perfect sense.</p>derek commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553d4ea5388342008-07-25T15:58:29Z2008-07-25T15:58:29ZderekOops, I just wrote exactly what Kaiser already did in the article! So much for my reading comprehension.<p>Oops, I just wrote exactly what Kaiser already did in the article! So much for my reading comprehension. </p>derek commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553b861a488332008-07-25T15:42:34Z2008-07-25T15:42:35ZderekIt should have been a cumulative curve of percentage against trips. The median, quartiles, and percentiles could be read off...<p>It should have been a cumulative curve of percentage against trips. The median, quartiles, and percentiles could be read off by anyone who cared to, or alternatively the percentage of users who took up to 40, up to 100 trips etc. By reading the percentage scale backwards from 100% the reader could say who took more than 40, more than 100 etc. Who's really interested in who took exactly 40 trips, no more and no less, or exactly 100? </p>
<p>The mean of 56 trips could be marked, as it would not be trivial to read it by inspection alone, and the total number of users represented by the "100%" would be given in nearby explanatory text. Multiplying the read-off percentages by the total number would give anyone inspecting the graph the absolute number of users taking more than 40 trips and so on. </p>kris commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553d45b6c88342008-07-25T11:55:59Z2008-07-25T11:55:59Zkrishttp://krist0ph3r.blogspot.comlooking at the long smooth stretches interspersed with tiny jagged sections, the distribution of these jagged sections, combined with the...<p>looking at the long smooth stretches interspersed with tiny jagged sections, the distribution of these jagged sections, combined with the smoothness of the tails, it looks like something someone drew in photoshop...but i guess i could be wrong.</p>
<p>it makes it's point, though.</p>Smith commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553d4345a88342008-07-25T09:55:56Z2008-07-25T09:55:56ZSmithIn addition to the 40-trip mark noted on the graph, a better x-axis, and a right-hand vertical axis which has...<p>In addition to the 40-trip mark noted on the graph, a better x-axis, and a right-hand vertical axis which has been normalized, I'd like to know the total number of riders below the 40-trip mark and the total above.</p>
<p>As for the technical note, I think the curve is real data. There are about a million data points in the graph, so I can believe that it ends up being fairly smooth.</p>Smith commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553d432f088342008-07-25T09:50:09Z2008-07-25T09:50:09ZSmithIt looks to me like an exponential drop off to the left side of the break-even 40-trip mark. Then the...<p>It looks to me like an exponential drop off to the left side of the break-even 40-trip mark. Then the right hand side looks like it could be fit by a piece of a Gaussian.</p>
<p>To me, this says that people are pretty good at deciding whether or not to buy the 30-day card. Although, I think some credit should go to NYC in selling the right product (30-days = about 20 work days = 40 one-way trips to and from work) for a price that is easily computed in your head ($81/$2 = about 40).</p>Ken commented on 'Wannabell'tag:typepad.com,2003:6a00d8341e992c53ef00e553b7ab5488332008-07-25T08:03:10Z2008-07-25T08:03:10ZKenVariability in the numbers will be low due to the high counts so it wouldn't have needed much if any...<p>Variability in the numbers will be low due to the high counts so it wouldn't have needed much if any smoothing. Even if the tails the counts are probably about 400 so 95% CI are about 10%, hardly noticeable and the graphic artist probably just smoothed it out.</p>
<p>Unless there is another option many people may feel it is worthwhile to buy a ticket for less than 40 trips simply to be able to buy only one ticket. I've done a similar thing with day tickets in non-English speaking countries to avoid the problems due to not understanding currency. Others may have a ticket supplied by an employer and there are always sudden holidays and illness that prevent fully using a ticket.</p>