« Losing the tune | Main | Dizzy display »



A common logarithmic scale would have been a better choice. The decrease in David and Mary is still obvious, and the scale better reflects the importance of a change.

Jorge Camoes

For many, charts are the "lighter side" of data analysis. Thanks God for that, because statistics are so boring... That's why they can't resist to have some fun, even at the cost of a less (and less and less) clear message.


as ET says...if your numbers are boring, you've got the wrong numbers.


I wonder how they'd look as sparklines? (apologies for the crude simulation of actual sparklines)

I realise that sparklines are not well equipped to deal with logarithmic scales.

Jon Peltier

If you presented a lengthy table of names (so more people could find their own name, of course), rather than just a few, you would need sparklines. This would be a two-word display for each name: the name and the sparkline.


When I saw this in the original WSJ article, I thought it made the point well, and I still do. Logs would be confusing to the audience and the mindset of those reading a "baby name" article, even in the WSJ.

The graphs don't really need to be aligned to make the point in this case.

Full disclosure: My old grad school friend, Cleveland Kent Evans, is quoted in this article. Cleve's one of the new people named after two cities in Ohio. There's no sign of this becoming a trend.


I don't have a problem with the reciprocal scale used, but I do have a problem with the filled area under the curve. That area is meaningless, and the curve should have been a line instead.


I disagree. The area under the curve is arguably a measure of the name's overall popularity. Given a series of these graphs at the same scale (like the first four here), a comparison of those areas offers a quick way to guage relative popularities (for instance, there's a lot more red for "Nicole" than for "Farrah"). Though I can't prove it, I would imagine having that area filled in (against a background color) allows our visual system to do the "integration" quicker.

Again, I think this would be useful when eyeballing a series of these graphs, such as the table suggested above. At worst, at adds nothing (if you don't care about "overall popularity"). At best, it facilitates the perception of another related metric.


But wait, the area together with the reverse rank is dangerous! You can arbitrary make the area as large as you want by picking an arbitrarily large end-point for the rank scale. In the case of David, one could choose a y-axis of 1 to 200 for example.

Andrew Gelman

I'd prefer plotting proportion of total names (as is done on that baby names website), rather than ranks.

Ally sal

I think an illustration with more names would make more of a point. This only shows that Mary and David are getting uncommon. Oh and statistics get so boring!

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.


Post a comment

Your Information

(Name is required. Email address will not be displayed with the comment.)


Link to Principal Analytics Prep

See our curriculum, instructors. Apply.
Marketing analytics and data visualization expert. Author and Speaker. Currently at Columbia. See my full bio.

Book Blog

Link to junkcharts

Graphics design by Amanda Lee

The Read

Good Books

Keep in Touch

follow me on Twitter