Comments on Primer on Regression Adjustments 1Kaiser starts a series of posts about regression adjustments.TypePad2021-08-25T21:34:00Zjunkchartshttps://junkcharts.typepad.com/numbersruleyourworld/tag:typepad.com,2003:https://junkcharts.typepad.com/numbersruleyourworld/2021/08/primer-on-regression-adjustments-1/comments/atom.xml/Kaiser commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef02788043e89d200d2021-08-27T19:59:19Z2021-08-28T16:38:09ZKaiser https://junkcharts.typepad.com/numbersruleyourworldJK: Thank you for persisting. I see what you and MD are complaining about. It's the word "person". I've changed...<p>JK: Thank you for persisting. I see what you and MD are complaining about. It's the word "person". I've changed it to sample. </p>Jason Kerwin commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef02788043e85e200d2021-08-27T19:52:43Z2021-08-28T16:38:09ZJason Kerwinhttp://jasonkerwin.comKaiser: I think your calculation is right for the percentiles of the distribution of sample-average heights. I am fairly certain...<p>Kaiser: I think your calculation is right for the percentiles of the distribution of sample-average heights. I am fairly certain it's wrong for the percentiles of the distribution of people's actual heights. 50th to 97.5th percentile for the latter distribution should be 2 SDs, not 2 SEs.</p>Kaiser commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef026bdeebfbf2200c2021-08-27T15:13:50Z2021-08-27T15:27:50ZKaiserhttps://junkcharts.typepad.com/numbersruleyourworldMD: Let me think this through. SE = 0.25 inch. Margin of error is 2*SE on each side of the...<p>MD: Let me think this through. SE = 0.25 inch. Margin of error is 2*SE on each side of the mean. 2*SE = 0.5 inch. For a normal distribution, median = mean, and the margin of error is the middle 95%, spanning the 2.5th percentile to 97.5 percentile. So from the 50th to 97.5th percentile is half the margin of error, which is 2*SE. Did I screw something up?</p>Michael Droy commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef026bdeebf848200c2021-08-27T13:57:49Z2021-08-27T15:09:36ZMichael DroyKaiser: I agree that 0.5 is a big error. It was specifically the following statement: "Half an inch is the...<p>Kaiser:<br />
I agree that 0.5 is a big error.<br />
It was specifically the following statement:<br />
"Half an inch is the difference between the median person and the 97.5th percentile <b>person</b>."<br />
That seems odd to me. </p>
<p>Enjoying this series.</p>Clur commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef026bdeebe659200c2021-08-27T07:46:49Z2021-08-27T07:47:34ZClurhttps://profile.typepad.com/6p026bded48b34200cIt has always been difficult for me to understand the limits of the regression analysis so I am very happy...<p>It has always been difficult for me to understand the limits of the regression analysis so I am very happy that you are making this serie of post!<br />
Thanks a lot for taking the time to write it and even more to share it with us here!<br />
I am looking forward to reading it! </p>Kaiser commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef02788043a6c8200d2021-08-26T20:39:22Z2021-08-26T23:46:59ZKaiserhttps://junkcharts.typepad.com/numbersruleryourworldJK: The 0.25 inch gap is on the sampling distribution, not the population distribution. So, reverse the SE formula, 0.25...<p>JK: The 0.25 inch gap is on the sampling distribution, not the population distribution. So, reverse the SE formula, 0.25 inch * sqrt(900) = 7.5 inches is an estimate of the population SD of heights. I took the population values from this CDC <a href="https://www.cdc.gov/nchs/data/ad/ad347.pdf" rel="nofollow">report</a> (Table 8), and assumed a normal distribution on heights. </p>
<p>Instead of "almost everyone is nearly exactly my height", think almost every sample average height (from repeated drawing of 900 people) is nearly exactly the same value as the sample we're looking at. </p>Jason Kerwin commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef0282e11c1b61200b2021-08-26T19:30:38Z2021-08-26T23:46:59ZJason Kerwinhttp://jasonkerwin.comDoes the population the sample is drawn from have an SD of 0.25 inches? That's what the gap between the...<p>Does the population the sample is drawn from have an SD of 0.25 inches? That's what the gap between the mean and the 97.5th percentile being 0.5 inches implies.</p>
<p>A quick google search shows that the SD of human height is 3 inches, so the difference between the mean and the 97.5th percentile is 6 inches. That is consistent with my experience, too - I am around 70 inches tall, and an SD of 0.25 would imply that almost everyone is nearly exactly my height.</p>Kaiser commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef026bdeebb993200c2021-08-26T15:59:07Z2021-08-26T18:45:55ZKaiserhttps://junkcharts.typepad.com/numbersruleyourworldMD: You raised a common point of confusion. To clarify the situation, think about the overall objective of a statistical...<p>MD: You raised a common point of confusion. To clarify the situation, think about the overall objective of a statistical study. We have a sample of data, and we want to extrapolate from that sample to the unknown population. So we don't know what the SD of the population is. All we have is data on 900 people, with a sample SD of 7.7. The sample SD is not a good estimate of the population SD - not surprising because 900 vs 25,000 people. One of the most magical formulas in all of statistics is the standard error, which measures the variability of the sample average from sample to sample. </p>
<p>Given our objective, the error is defined as how far our sample average is from the population average, therefore, we care about the variability of the sample average, hence the relevant quantity is the standard error. </p>Michael Droy commented on 'Primer on Regression Adjustments 1'tag:typepad.com,2003:6a00d8341e992c53ef0282e11c0ebe200b2021-08-26T15:33:01Z2021-08-26T18:45:55ZMichael DroyInteresting topic and looking for the next episode. A little nitpicking: "For later reference, just remember that 0.5 inch (12...<p>Interesting topic and looking for the next episode.</p>
<p>A little nitpicking:<br />
"For later reference, just remember that 0.5 inch (12 mm) is a big error on this scale. Half an inch is the difference between the median person and the 97.5th percentile person. So our tolerance for inaccuracy is described in small fractions of an inch."</p>
<p>Is this right? The standard deviation for persons (as opposed to samples of 900) is 7.8". So the median to 97.5th percentile should be 15.6"<br />
97.5 percentile is where this sample is relative to all other samples of 900 persons. A person in the sample 0.5 inches above the median would be at the 52.5th percentile</p>
<p>And 7.8/sqrt(900)<br />
From memory shouldn't this be 7.8/sqrt(899) (serious nitpicking!!)<br />
</p>