« Mid-week entertainment: Pity grapefruit | Main | Lunar eclipse »

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341e992c53ef00e55097306c8833

Listed below are links to weblogs that reference Chart cleanup:

Comments

Chris P

I would think the the ranks are somewhat arbitrary--why would all three commuting and transportation elements be together? In the second graph, it is a little hard to decode which line goes with which label. If the categories were ordered by some rank, then you might have blank lines to group them together better or maybe every fourth or sixth line could be half height and blank.

Rosie Redfield

Ranks against what? Other big cities? (It's hard to imagine a competition that would have Boston ranking first for local food and agriculture.)

derek

The vertical ticks would be a good choice if they were in a crowd of ticks, such that a fine line was needed to avoid frequent overlapping. But where each data symbol is guaranteed a row all to itself, I think it can be allowed to spread in two dimensions for visibility. A diamond shape would ensure that the exact position of the symbol was still established, without the danger of invisibility represented by the thin line.

ZBicyclist

I searched in vain for more info on the methodology used, but the links seemed to lead back to the Sustainlane ad network site.

I'm not sure suboptimal graphics are the main problem with this "ranking". Looks like GIGO.

derek

Sustainlane's account of its methodology is here, but it just amounts to "we did stuff, and trust us, it was all straight". It's lacking a table of the actual inputs that informed the rankings.

Someone with the patience to gather the rankings for all 50 cities could construct a parallel coordinates graph, and perhaps also use the colored labels to mimic a Spotfire-style "brushing" technique.

But, you know, rankings and not quantities. It takes all the interest out of such a project. Rankings are teh suck.

Zuil Serip

There are two separate issues here. Regarding the first - the arbitrary and fuzzy nature of the data gathering methodology - I am in full agreement with previous comments. The value of the data itself seems quite suspect.

For the sake of argument, however, let's pretend the data is valid and interesting. I am more interested in the question of how to best represent data of this type. And here I find myself in slight disagreement with the post.

The revised, junk chart version, is clearly better, but I am not convinced that 'redundancy' is always bad in graphic representation (pace Tufte). For example, I'd like to be able to glance at the labels and quickly identify the couple of categories that are in the 'danger' zone without having to take the extra step of looking at each respective bar.

Also, I'd like to be able to get the value for any category without needing to look down at the horizontal axis - I like having the actual value label by the bar even if technically it is redundant with the length of the bar.

The bar lengths and the bar labels serve two very different purposes: The length allows me to intuitively process relative sizes (I can immediately perceive "roughly twice as much as" by looking at the relative lengths, comparing '42' and '17' requires that my conscious mind get involved.) At the same time, once I've intuitively absorbed the big picture, I'd like to be able to drill down and look at a few select individual components. For that, I need the exact labels.

Here is a quick attempt at trying to achieve both of these goals:
http://www.flickr.com/photos/24579696@N05/2328117979/

curioser

I find it really difficult to track which tick marks go with which category in the revised chart.

Zuil Serip

Better link to chart mentioned above:
http://farm4.static.flickr.com/3263/2329410262_7726457637.jpg

derek

I think Kaiser was right not to re-order the categories, since the Boston table is just one of 50 cities, and they won't all be in the same order.

I'd also like to see the bars be longest for the categories where Boston is ranked #1 and shortest where the city is ranked worst.

Gary Klass

Your chart is a little better, but the whole thing would be much better if the data were sorted.

-- Gary Klass

http://lilt.ilstu.edu/jpda/

Kaiser

Good to see all the comments. It's clear everyone has opinions on how the ratings should have been done.

I'd echo Derek's comment, which is to bear in mind the bigger picture... that this one chart needs to be considered as part of a "small multiples" layout of 50 such charts. This challenges the usual advice of sorting, color labels, etc.

Georgia Sam

IMHO, you should not encourage people to "analyze" data by hacking interval-scaled measures up into categories & sticking arbitrary labels on the categories. Just show the rankings & let them speak for themselves.

Kaiser

Georgia Sam: Point taken. Thanks for bringing it up. By converting the original scale to ranks, the differences between cities have already been eliminated, and now to assign bins based on ranks just makes it worse.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Marketing analytics and data visualization expert. Author and Speaker. Currently at Vimeo and NYU. See my full bio.

Book Blog



Link to junkcharts

Graphics design by Amanda Lee

The Read



Good Books

Keep in Touch

follow me on Twitter

Residues