« Shower of bullets | Main | Less is more »

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341e992c53ef00d834c66deb53ef

Listed below are links to weblogs that reference Cutting through the noise:

Comments

josh

I have never really understood tag clouds. To me it seems like a pretty ineffectual way of presenting the data. Why do tag clouds sort alphabetically? If the concept is to highlight the distribution of various tags, why is alphabetic order important?

And with the use of font size as an indicator of frequency, the issue of linear size versus area size crops up - not to mention the mess it makes of text alignment.

What is wrong with a simple list of words, sorted by frequency, with a bar chart? The same information, easily parsed, and importantly, all the words are then easily read.

derek

josh, alphabetical does do one thing: it puts words with a similar root next to each other, so TEACHER and _teachers_ are adjacent instead of at opposite ends of the list, if they were sorted by frequency.

I sort of agree that alphabetical order isn't great, but then, what opportunity for portraying a second dimension of info are they precluding? None that I can see.

Now, what would really make your suggestion fly, IMHO, would be if next to each word was a sparkline charting instances of that word through the minutes of the speech. That would add a true extra dimension, that of time, as the list showed the words appearing earlier or later in the speech.

Even better, if it was a true interactive debate, would be to present the lists for each candidate alongside each other (now arranged in order of total occurrences for all cnadidates, rather than for that candidate). Reading horizontally, the sparklines of one word for each candidate would show who first used the word early on, and who picked it up and ran with it.

Kaiser

Josh: excellent points. I actually discussed these in a much older post. There was even a bar chart included for comparison!

The alphabetical order is typically meaningless and to be avoided. This case, I believe, is an exception. Any kind of random order would spread things out more than the the order by frequency. The alphabetical just happens to induce some "randomness" (although I'm sure we can find examples when it doesn't).

Derek: introducing the time dimension would be exciting. Especially if these debates were unmoderated; otherwise, we'd end up looking at the moderator's preference. Allowing free flowing debate would be akin to leaving the mike on; how embarrassing.

derek

Something that could actually be done with the raw numbers gathered by this exercise (although hard to extract from these graphics - a point to josh), would be a scatter graph of word frequency between two debaters. Top right, of concern to both; bottom left, neglected by both; bottom right, B talked about more than A; top left, vice versa.

Closets

I was sceptical when starting to read this article. However, the conclusions do seem consistent with my own intuitive reaction to the debate.

ars

Not sure if you've already covered this, but you might be interested in Chirag Mehta's tag clouds of presidential speeches going back to Washington. I think the was the first such application of tag clouds.

http://chir.ag/phernalia/preztags/

Kaiser

Ars: thanks for the link. Good site. I'm not sure I understand the "recency" dimension since the date of the speech is fixed.

derek

That reminds me of the State of the Union parsing tool on Jonathan Corum's style.org.

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Marketing analytics and data visualization expert. Author and Speaker. Currently at Vimeo and NYU. See my full bio.

Book Blog



Link to junkcharts

Graphics design by Amanda Lee

The Read



Good Books

Keep in Touch

follow me on Twitter

Residues