« Social networking | Main | The punch line »

TrackBack

TrackBack URL for this entry:
http://www.typepad.com/services/trackback/6a00d8341e992c53ef00e54fa12a488834

Listed below are links to weblogs that reference A dangerous equation:

Comments

Jens

Interesting, thanks for the reminder.

However, I think in this case, there is an important point that you have overlooked. What they did here was to split 12 large schools into 47 smaller schools. This means that the underlying 'base' of students (or the pool from which they are drawn) has remained the same for each comparative group. BTW, one has to assume that the average percentage of graduation was calculated on the basis of individuals (not taking averages of several schools) and thus, the population size per school was probably the same.

One of the things that I am missing in the analysis is the expenditure per student prior and after the split, as this might be substantially different. Provided the OPEX per student is similar, than this 'experiment' would actually proof that small schools are much more effective than larger ones at teaching students.

derek

It's a nice demonstration of the principle, but Dr Wainer spoils his point a bit by going on to say:

"the regression line shows a significant positive slope; overall, students at bigger schools do better. This too is not unexpected, since very small high schools cannot provide as broad a curriculum or as many highly specialized teachers as can large schools."

He has not presented the evidence that that's the reason for the positive slope, only told a just-so story about why it might be the reason. The small-schools movement have their own just-so stories that "explain" why their ideas are better, like smaller schools allowing teachers to know their students better. It doesn't make it true. Dr Wainer should have just let the graph speak for itself, instead of trying to tell a story about the graph that it wasn't equipped to confirm.

Alternatively, if he wanted to make the graph confirm the story, he ought to have graphed "breadth of curriculum" or "number of highly specialized teachers" against school size and PSSA score. The extra data would have been welcome context, but words alone are not; which I think I will take as a valuable check on my own tendency to lard my graphs about with words "adding context".

John S.

Andrew Gelman has a nice example of this. He shows a map of kidney cancer deaths by U.S counties. Shade the counties with the highest death rates, and sparsely populated Midwestern counties stand out. He asks his students to speculate as to why this might be. Lack of access to health care? Polluted groundwater?

Then he shows another map on which counties with the lowest death rates are shaded. Once again, sparsely populated Midwestern counties stand out again. See Section 3 of this paper:

http://www.stat.columbia.edu/~gelman/research/published/bayesdemos.pdf

This example is also included in his book "Bayesian Data Analysis".

Verify your Comment

Previewing your Comment

This is only a preview. Your comment has not yet been posted.

Working...
Your comment could not be posted. Error type:
Your comment has been posted. Post another comment

The letters and numbers you entered did not match the image. Please try again.

As a final step before posting your comment, enter the letters and numbers you see in the image below. This prevents automated programs from posting comments.

Having trouble reading this image? View an alternate.

Working...

Post a comment

Marketing analytics and data visualization expert. Author and Speaker. Currently at Vimeo and NYU. See my full bio.

Book Blog



Link to junkcharts

Graphics design by Amanda Lee

The Read



Good Books

Keep in Touch

follow me on Twitter

Residues