Deviations from the mean of a sum of independent, non-identically-distributed Bernoulli variables

September 29, 2011

Let Bi, i = 1, …, n be independent Bernoulli variables with parameters qi, i = 1, …, n, respectively. Let S be their sum. For convenience, assume q1 ≥ q2 ≥ ··· ≥ qn. I wish to bound tightly from above the probability that S is greater or equal to some l, having the bound depend solely on ES = q1 + ··· + qn.

Clearly, if l ≤ ES, then the tightest bound is 1. This is attained by setting q1 = ··· = ql = 1.

This example shows that while the variance of S is maximized by setting qi = ES / n, i = 1, ···, n, at least for some values, l, P(S ≥ l) is maximized by having the Bi not identically distributed.

Proposition 1:

For every l, P(S ≥ l) is maximized when q1 = ··· = qmo = 1, qmo+1 = ··· = qn-mz = q, and qn-mz+1 = ··· = qn = 0, for some mo and mz, and for q = (ES – mo) / (n – mo – mz).


Assume that 1 > qi > qj > 0. Let S’ = S – Bi – Bj. Then

P(S ≥ l) = P(S’ ≥ l) + p1 (qi + qj – qi qj) + p2 qi qj,

where p1 = P(S’ = l – 1) and p2 = P(S’ = l – 2). Thus, keeping qi + qj fixed, but varying the proportion between them, P(S ≥ l) is a linear function of qi qj. Unless p1 = p2, P(S ≥ l) will be increased by varying qi and qj – with a maximum either when they are equal or when one of them is zero or one.

Therefore, P(S ≥ l) cannot be at a maximum if there exist 1 > qi > qj > 0, unless p1 = p2. But in that case, the same probability can be achieved by setting the parameter of Bi to be q’i = 0 (if qi + qj < 1) or q’i = 1 (otherwise), and the parameter of Bj to be q’j = qi + qj – q’i. Therefore, in that case, there would exist a set of parameters, q’1, …, q’n, that would achieve that same probability but with fewer parameters that are not equal to zero or one. Thus, in the set of parameter settings maximizing P(S ≥ l), there exists a solution – namely the one which maximizes the number of parameters with extreme values (zeros and ones) – in which there is only one non-extreme value. ¤

The next step is to investigate which specific parameter setting correspond to various combinations of l and ES.


9 Responses to “Deviations from the mean of a sum of independent, non-identically-distributed Bernoulli variables”

  1. Arkadas Says:

    Interesting post, Yoram, looking forward to the continuation. Was this inspired by a real-life problem?

  2. Yoram Gat Says:

    Hi Arkadas,

    Thanks for your interest.

    This is connected in some way to my 2008 EJS paper, Rigorous LCB for the expectation of a positive r.v..

    In that paper I assumed that a sample from a set of IID r.v.’s is available. I have been trying (on-and-off) to extend the treatment to handling a sample from a set of independent non-ID r.v.’s. The real life application would be similar to the one in the paper – survey sampling – but would allow handling stratified sampling.

  3. Arkadas Says:

    Thanks, Yoram. I’ve read your paper (skipping the proofs), and learned quite a bit. It looks like a fundamental contribution—I am surprised that there are still research opportunities on such basic problems. This also got me curious about the original studies on Iraq war casualties, I’ve just checked the Wikipedia page, which was huge.

  4. Yoram Gat Says:

    I am also rather perplexed with the fact that basic questions are still open while the journals are filled with esoteric papers.

    I was half expecting the editor of EJS to return my paper saying that I was covering some well-known ground, but clearly he was unfamiliar with such prior work. Also, I guess Mark var der Laan – a well respected Berkeley professor – would not have put forward his rather absurd argument in the note I refer to in paper had he been aware that a rigorous finite sample LCB can be constructed.

    I would still not be that surprised if it turns out that a similar result was published before, but even if this is the case, the mere fact that two active academic statisticians were not aware of such existing work is puzzling.

    The area of independent but not identically distributed variables also seems to be under-researched, I think. It is a natural extension of the classical IID setup. Again, maybe I am just not aware of existing literature, but even if this is the case, it is surprising that it is not part of the standard textbook material.

  5. Yoram Gat Says:

    I just realized, to my surprise, that the argument made here applies much more widely than to the indicator function. It applies, in fact, to any function of a sum of independent Bernoulli r.v.’s.

    That is, the expectation of the function evaluated at the sum of independent Bernoulli r.v.’s (whose sum of expectations is given) is maximized when the sum is a shifted Binomial or a shifted Poisson.

  6. Yoram Gat Says:

    I have just become aware of some relevant prior work. Y.H. Wang’s On the number of successes in independent trials (1993) has various results about Poisson Binomial distributions. I have not read this paper yet, but it does note that “recent” study of Poisson Binomial distributions was initiated by Wassily Hoeffding in his 1956 paper On the Distribution of the Number of Successes in Independent Trials. It appears that my results here largely reproduce Hoeffding’s results.

    Such a fundamental result is not new then. The question of why this is not part of the standard probability curriculum remains.

  7. Yoram Gat Says:

    Another paper cited by Wang, a 1965 paper by Samuels, On the number of successes in independent trials, is also relevant.

  8. Yoram Gat Says:

    The results in Hoeffding’s, Samuels’s and Wang’s papers not withstanding it still appears to me that there are a lot of questions about PB distributions that have not been discussed.

    Most prominent I think is the question of characterizing the family of PB distributions. The family of PB distributions with the range {0, 1, …, n} covers a non-zero-measure subset of the [0, 1]^n parameter space of the general distribution over that range. How does this subset look like?

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s