Stone-age graphic
Aug 17, 2010
This Economist chart on the history of world GDP throws the art of graphics back several hundred years. (Thanks Tyler A. for the link.)
And I can't really re-make it since I can't make heads or tails of it.
- How are the columns sorted? (on second thought, maybe the 70 should read 1870, 13 is 1913, and so on?)
- Why are there differing gaps between columns?
- Italy was not a country, and the US was definitely not in existence in AD1 so what does it mean to have values for those on the chart? If this is created by taking current-day boundaries and projecting back in time, why are today's boundaries treated as sacrosanct?
- If the columns are sorted chronologically, a line chart would be much more readable. At the minimum, it will reduce the number of colors to 1. Note that multiple colors are necessitated by the choice of a stacked column chart.
- A stacked column chart with percentages should always extend to 100%. The current chart is very misleading if we want to know the percentage of world GDP produced by "other countries".
- How are the countries ordered within a column? It's neither alphabetical, nor by the starting or ending distributions.
- Don't challenge readers by having vertically stacked categories and a horizontal legend.
- It would also be much better if there are annotations to help the reader understand the chart, e.g. collapse of the Roman Empire, Renaissance, Great Depression, Big Fire, etc.
PS. [8/18/10] Dustin linked to a line-chart version of this chart, from the World Bank site, via Chartporn.
I think the evidence is right here as to why the Economist execution leaves a ton to be desired. The use of lines allows the reader to easily trace the rise and fall of different economies, which is the point of the data set. The stacked-column chart draws attention to a point-in-time distribution of GDP among different countries, which is of secondary importance.
There are other differences: this plots the share of "growth" as opposed to the share of total GDP. It also plots regions rather than countries (well, except for China and Japan). It does not presuppose that the US was in existence before its founding. It could have (should have) included an "rest of the world" line.
The spacing of the years is still problematic but it's an Excel inconvenience, really. But it's ok to stretch the axis on a line chart, it's a problem to do it with a column chart, as demonstrated above. The gaps between columns should be proportional to the years between the data but this is impossible to do in a column chart.
I assume that the gap between the tops of the bars and 100% represents the "other" category. I also assume that they were using historical correlates such as the Italia province of the Roman Empire for Italy or Gallia for France (in the case of AD1). Obviously it would be nice to know how they decided to project the economic activity of the hunter-gatherers populating the stretch of land that is now the United States.
Overall, I agree that the execution of this chart was poor, but I don't think that a column chart is the worst choice here. They are taking somewhat arbitrary slices of time and a line chart would (at least to me) carry with it a greater implication of continuity between the times they chose (especially AD1, 1000, and 1500). In the end, they leave far too much uncertainty and force the readers to fill in blanks that they should have filled themselves.
Posted by: Meng Bomin | Aug 17, 2010 at 11:41 PM
I think the differing gaps between columns are meant to indicate the differing time gaps. Although it's still pretty ad-hoc -- I guess they couldn't come up with a spacing that would make the differences in recent years visible, whilst still allowing them to start with 1ad.
Posted by: Repton | Aug 17, 2010 at 11:44 PM
why does it matter how the countries are sorted within a column as long as they are the same sort for each column?
Posted by: thomas | Aug 18, 2010 at 07:08 AM
Oh come on. I actually really like this one! Many of the issues you raised have been addressed by others. Regarding the ordering within a column, it looks quasi-geographical to me. Asia at the bottom, then the US, then European powers. I think your only actually valid point is that the legend should be vertical and should align with the country data.
Posted by: HarlanH | Aug 18, 2010 at 07:50 AM
This chart is artistic. it requires processing and thought, as a painting or sculpture would.
We are taught that professional and academic charts should be clearly described such that there is no equivocal information.
This chart does not belong in a boardroom or classroom, but its puzzle features leave you longing for answers. Have questions? Go hunting!
Posted by: Rhett | Aug 18, 2010 at 10:01 AM
I'd say that the different gaps is a good decision, since they represent (although not accurately, due to the huge difference between the gaps) how many time passed between one bar and another.
And I'm not a fan at all of stacked bars, though I've been using them only when I want to show only the most contrasting values, USA and China in this case. All other data are hardly comparable. It can be an editorial decision, even in detriment of other data, which may be a pity.
Rather than a complete redesign, I think this chart need some retouching. Maybe I'll try later.
Posted by: Matias | Aug 18, 2010 at 12:30 PM
FYI: here's another visualization of the same dataset:
http://chartporn.org/2010/03/29/2000-years-of-economic-growth/
Posted by: Dustin | Aug 18, 2010 at 01:26 PM
The horizontal axis in the line graph is even more problematic since with this category-style axis (as Excel calls it) the first-order derivatives of the growth rate (i.e. the change in growth rate) have no meaning any more. Smoothing only makes thinks worse, in my opinion.
Posted by: Martijn | Aug 23, 2010 at 05:49 AM
"The gaps between columns should be proportional to the years between the data but this is impossible to do in a column chart." Not impossible, but not worth doing. You COULD put 998 blank columns between year 1 and year 1000, another 499 between year 1000 and year 1500, etc. You end up with columns too thin to detect the color of the stacked components, so what's the point? The line chart is much better, you're right.
Posted by: John | Aug 27, 2010 at 09:01 PM