Celebratio Mathematica

D. Blackwell and J. L. Hodges, Jr.: “Design for the control of selection bias,” Ann. Math. Stat. 28 : 2 (1957), pp. 449–460. MR 0088849 Zbl 0081.36403 article

Suppose an experimenter \( E \) wishes to compare the effectiveness of two treatments, \( A \) and \( B \), on a somewhat vaguely defined population. As individuals arrive, \( E \) decides whether they are in the population, and if he decides that they are, he administers \( A \) or \( B \) and notes the result, until \( nA \)’s and \( nB \)’s have been administered. Plainly, if \( E \) is aware, before deciding whether an individual is in the population, which treatment is to be administered next, he may, not necessarily deliberately, introduce a bias into the experiment. This bias we call selection bias.

We propose to investigate the extent to which a statistician \( S \), by determining the order in which treatments are administered, and not revealing to \( E \) which treatment comes next until after the individual who is to receive it has been selected, can control this selection bias. Thus a design \( d \) is a distribution over the set \( T \) of the \( \binom{2n}{n} \) sequences of length \( 2n \) containing \( nA \)’s and \( nB \)’s.

We shall measure the bias of a design by the maximum expected number of correct guesses which an experimenter can achieve, knowing \( d \), attempting to guess the successive elements of a sequence \( t \in T \) selected by \( d \), and being told after each guess whether or not it is correct. The distribution of the number \( G \) of correct guesses depends both on \( d \) and on the prediction method \( p \) used by the experimenter. We shall consider particularly two designs, the truncated binomial, in which the successive treatments are selected independently with probability \( 1/2 \) each until \( n \) treatments of one kind have occurred, and the sampling design, in which all \( \binom{2n}{n} \) sequences are equally likely.

We shall consider particularly two prediction methods, the convergent prediction, which predicts that treatment which has hitherto occurred less often, and the divergent prediction, which predicts that treatment which has hitherto occurred more often, except that after \( n \) treatments of one kind have been administered, the divergent prediction agrees with the convergent predictions that the other treatment will follow; when both treatments have occurred equally often, either method predicts \( A \) or \( B \) by tossing a fair coin, independently for each case of equality.

We find that among all designs, the truncated binomial minimizes the maximum expected number of correct guesses. For this design, the expected number of correct guesses is independent of the prediction method, and is \[ n + n \binom{2n}{n} \big/ 2^{2n} \sim n + \Bigl(\frac{n}{\pi}\Bigr)^{1/2} .\] With the truncated binomial design, the variance in the number of correct guesses is largest for the divergence strategy and is \[ \frac{3n}{2} - D - \frac{D^2}{4} \sim \frac{(3\pi - 2)n}{2\pi} - 2\Bigl(\frac{n}{\pi}\Bigr)^{1/2}, \] where \( D = n \binom{2n}{n} \big/ 2^{2n - 1} \), and is smallest for the convergence strategy, and is \[ \frac{n}{2} - \frac{D^2}{4} \sim \frac{(\pi - 1)n}{2\pi} .\] For the sampling design, convergent prediction maximizes the expected number of correct guesses; this maximum is \[ n + 2^{2n - 1} \!\big/ \binom{2n}{n} - \frac{1}{2} \sim n + \Bigl(\frac{\pi n}{4}\Bigr)^{1/2}. \] Finally we note that, if treatments are selected independently at random, bias of the kind we discuss disappears, but the treatment numbers can no longer be preassigned. Three such designs are considered: the fixed total design, in which the total number of treatments is a fixed number \( s \), the fixed factor design, in which we continue until \[ \frac{1}{X} + \frac{1}{Y} \leq \frac{2}{n} ,\] where \( X \) is the number of \( A \) treatments and \( Y \) is the number of \( B \) treatments administered, and the fixed minimum design, in which we continue until \( \min (X, Y) = n \). For the fixed total design, we find that, for \( s = 2n + 4 \), \[ \mathrm{Pr}\Bigl(\frac{1}{X} + \frac{1}{Y} \leq \frac{2}{n}\Bigr) \sim 0.955 \] for large \( n \); at the expense of 4 extra observations, we have a bias-free design whose variance factor will with probability \( 0.955 \) be smaller than that in which treatment numbers are preassigned. For the fixed factor design, the additional number of observations required to achieve the given precision has for large \( n \) the distribution of the square of a normal deviate. For the fixed minimum design, in which we guarantee precision for the estimated effect of each treatment, the expected number of additional observations is roughly \( 1.13 (n)^{1/2} \).

D. Blackwell and J. L. Hodges, Jr.: “The probability in the extreme tail of a convolution,” Ann. Math. Stat. 30 : 4 (1959), pp. 1113–1120. MR 0112197 Zbl 0099.35105 article

Let \( X_1, X_2, \dots \) be independent and identically distributed random variables with possible values that are integers whose differences have g.c.d. one. Assume the m.g.f. of \( X_1 \) exists in an interval about 0, let \( a \) be any number such that \[ E(X_1) < a < \sup X_1 ,\] and let \[ \varphi(a, t) = E\,e^{t(X_{1-a})} .\] There exists a unique value \( t^{\ast}(a) \) of \( t \) which minimizes \( \varphi(a, t) \) with respect to \( t \); write \[ m(a) = \varphi[ a, t^{\ast}(a)] \quad\text{and}\quad z = e^{-t^{\ast}(a)} .\] Let \( Y_1,Y_2, \dots \) be independent and identically distributed random variables such that \( Y_1 \) and \( X_1 \) have the same range and \[ \Pr(Y_1 = x) = \Pr(X_1 = x) \cdot \frac{e^{t^{\ast}(a)\,(x-a)}}{m(a)} ,\] and let \( \mu_2 = \sigma^2, \mu_3, \mu_4 \) be central moments of \( Y_1 \). We show that \[ \Pr \{X_1 + \dots + X_n = na\} = [ m(a) ]^n \Pr \{Y_1 + \dots + Y_n = na\} ,\] and use this to establish the approximation \[ \Pr \{X_1 + \dots + X_n = na\} = \pi^{\ast\ast}_n[ 1 + 0(n^{-2})] ,\] where \( na \) is a possible value of \( X_1 + \dots + X_n \) and \[ \pi^{\ast\ast}_n = \frac{[ m(a)]^n}{\sigma\sqrt{2\pi n}} \Bigl[ 1 + \frac{1}{8n} \Bigl(\frac{\mu_4}{\mu^2_2} - 3 - \frac{5}{3} \frac{\mu^3_2}{\mu^3_2}\Bigr)\Bigr]. \] Similarly we find that \[ \Pr \{X_1 + \dots + X_n \geq na\} = \Pi^{\ast\ast}_n[ 1 + 0(n^{-2})] ,\] where \[ \Pi^{\ast\ast}_n = \pi^{\ast\ast}_n \cdot \frac{1}{1 - z}\Bigl\{1 - \frac{1}{2n}\Bigl[\frac{(z\mu_3/\mu_2) + z(1 + z)/(1 - z)}{(1 + z)\mu_2}\Bigr]\Bigr\}. \] We provide some numerical illustrations of the accuracy of these approximations, and give a conjectured analog of the leading term of \( \Pi^{\ast\ast}_n \) for nonlattice variables.

D. Blackwell and J. L. Hodges, Jr.: “Elementary path counts,” Am. Math. Mon. 74 : 7 (August–September 1967), pp. 801–804. Zbl 0155.02903 article

D. Blackwell: “A hypothesis-testing game without a value,” pp. 79–82 in A Festschrift for Erich L. Lehmann. Edited by P. J. Bickel, K. A. Doksum, and J. L. Hodges. Wadsworth Statistics/Probability Series. Wadsworth (Belmont, CA), 1983. MR 689739 Zbl 0525.62006 incollection

Kjell Andreas Doksum	Related
Joseph Lawson Hodges, Jr.	Related
Erich Leo Lehmann	Related
Peter J. Bickel	Related

M. Proschan: “A note on D. H. Blackwell and J. L. Hodges, Jr.: ‘Design for the control of selection bias’, and P. Diaconis and R. L. Graham: ‘The analysis of sequential experiments with feedback to subjects’,” Ann. Statist. 19 : 2 (1991), pp. 1106–1108. The paper of Diaconis and Graham is Ann. Statist. 9:1 (1981) pp. 3–23. Commentary on Ann. Math. Stat. 28:2 (1957). MR 1105868 Zbl 0747.62021 article

Ronald Lewis Graham	Related
Joseph Lawson Hodges, Jr.	Related
Michael Proschan	Related
Persi Diaconis	Related

year	title	people

			clear

David H. Blackwell

Complete Bibliography

Works connected to Joseph Lawson Hodges, Jr.

Filter the Bibliography List