18.2 Effects of Nonresponse

Textbook example: Example reported in W.G. Cochran, Sampling Techniques, 3rd edition, 1977, ch. 13

Consider data that come form an experimental sampling of fruit orcharts in North Carolina in 1946. Three successive mailings of the same questionnaire were sent to growers. For one of the questions the number of fruit trees, complete data were available for the population…

Ave. # trees	# of growers	% of pop’n	Ave # trees/grower
1st mailing responders	300	10	456
2nd mailing responders	543	17	382
3rd mailing responders	434	14	340
Nonresponders	1839	59	290
	——–	——–	——–
Total population	3116	100	329

The overall response rate was very low.
The rate of non response is clearly related to the average number of trees per grower.
The estimate of the average trees per grower can be calculated as a weighted average from responders \(\bar{Y_{1}}\) and non responders \(\bar{Y_{2}}\).

Bias: The difference between the observed estimate \(\bar{y}_{1}\) and the true parameter \(\mu\).

\[ \begin{aligned} E(\bar{y}_{1}) - \mu & = \bar{Y_{1}} - \bar{Y} \\ & = \bar{Y}_{1} - \left[(1-w)\bar{Y}_{1} - w\bar{Y}_{2}\right] \\ & = w(\bar{Y}_{1} - \bar{Y}_{2}) \end{aligned} \]

Where \(w\) is the proportion of non-response.

The amount of bias is the product of the proportion of non-response and the difference in the means between the responders and the non-responders.
The sample provides no information about \(\bar{Y_{2}}\), the size of the bias is generally unknown without information gained from external data.