Pairwise independence

Inprobability theory,apairwise independentcollection ofrandom variablesis a set of random variables any two of which areindependent.^[1]Any collection ofmutually independentrandom variables is pairwise independent, but some pairwise independent collections are not mutually independent. Pairwise independent random variables with finitevarianceareuncorrelated.

A pair of random variablesXandYareindependentif and only if the random vector (X,Y) withjointcumulative distribution function (CDF) $F_{X,Y}(x,y)$ satisfies

F_{X,Y}(x,y)=F_{X}(x)F_{Y}(y),

or equivalently, their joint density $f_{X,Y}(x,y)$ satisfies

f_{X,Y}(x,y)=f_{X}(x)f_{Y}(y).

That is, the joint distribution is equal to the product of the marginal distributions.^[2]

Unless it is not clear in context, in practice the modifier "mutual" is usually dropped so thatindependencemeansmutual independence.A statement such as "X,Y,Zare independent random variables "means thatX,Y,Zare mutually independent.

Example

Pairwise independence does not imply mutual independence, as shown by the following example attributed to S. Bernstein.^[3]

SupposeXandYare two independent tosses of a fair coin, where we designate 1 for heads and 0 for tails. Let the third random variableZbe equal to 1 if exactly one of those coin tosses resulted in "heads", and 0 otherwise (i.e., $Z=X\oplus Y$ ). Then jointly the triple (X,Y,Z) has the followingprobability distribution:

(X,Y,Z)=\left\{{\begin{matrix}(0,0,0)&{\text{with probability}}\ 1/4,\\(0,1,1)&{\text{with probability}}\ 1/4,\\(1,0,1)&{\text{with probability}}\ 1/4,\\(1,1,0)&{\text{with probability}}\ 1/4.\end{matrix}}\right.

Here themarginal probability distributionsare identical: $f_{X}(0)=f_{Y}(0)=f_{Z}(0)=1/2,$ and $f_{X}(1)=f_{Y}(1)=f_{Z}(1)=1/2.$ Thebivariate distributionsalso agree: $f_{X,Y}=f_{X,Z}=f_{Y,Z},$ where $f_{X,Y}(0,0)=f_{X,Y}(0,1)=f_{X,Y}(1,0)=f_{X,Y}(1,1)=1/4.$

Since each of the pairwise joint distributions equals the product of their respective marginal distributions, the variables are pairwise independent:

XandYare independent, and
XandZare independent, and
YandZare independent.

However,X,Y,andZarenotmutually independent,since $f_{X,Y,Z}(x,y,z)\neq f_{X}(x)f_{Y}(y)f_{Z}(z),$ the left side equalling for example 1/4 for (x,y,z) = (0, 0, 0) while the right side equals 1/8 for (x,y,z) = (0, 0, 0). In fact, any of $\{X,Y,Z\}$ is completely determined by the other two (any ofX,Y,Zis thesum (modulo 2)of the others). That is as far from independence as random variables can get.

Probability of the union of pairwise independent events

Bounds on theprobabilitythat the sum ofBernoulli random variablesis at least one, commonly known as theunion bound,are provided by theBoole–Fréchet^[4]^[5]inequalities. While these bounds assume onlyunivariateinformation, several bounds with knowledge of generalbivariateprobabilities, have been proposed too. Denote by $\{{A}_{i},i\in \{1,2,...,n\}\}$ a set of $n$ Bernoullievents withprobabilityof occurrence $\mathbb {P} (A_{i})=p_{i}$ for each $i$ .Suppose thebivariateprobabilities are given by $\mathbb {P} (A_{i}\cap A_{j})=p_{ij}$ for every pair of indices $(i,j)$ .Kounias^[6]derived the followingupper bound:

\mathbb {P} (\displaystyle {\cup }_{i}A_{i})\leq \displaystyle \sum _{i=1}^{n}p_{i}-{\underset {j\in \{1,2,..,n\}}{\max }}\sum _{i\neq j}p_{ij},

which subtracts the maximum weight of astar spanning treeon acomplete graphwith $n$ nodes (where the edge weights are given by $p_{ij}$ ) from the sum of themarginalprobabilities $\sum _{i}p_{i}$ .
Hunter-Worsley^[7]^[8]tightened thisupper boundby optimizing over $\tau \in T$ as follows:

\mathbb {P} (\displaystyle {\cup }_{i}A_{i})\leq \displaystyle \sum _{i=1}^{n}p_{i}-{\underset {\tau \in T}{\max }}\sum _{(i,j)\in \tau }p_{ij},

where $T$ is the set of allspanning treeson the graph. These bounds are not thetightestpossible with generalbivariates $p_{ij}$ even whenfeasibilityis guaranteed as shown in Boros et.al.^[9]However, when the variables arepairwise independent( $p_{ij}=p_{i}p_{j}$ ), Ramachandra—Natarajan^[10]showed that the Kounias-Hunter-Worsley^[6]^[7]^[8]bound istightby proving that the maximum probability of the union of events admits aclosed-form expressiongiven as:

\max \mathbb {P} (\displaystyle {\cup }_{i}A_{i})=\displaystyle \min \left(\sum _{i=1}^{n}p_{i}-p_{n}\left(\sum _{i=1}^{n-1}p_{i}\right),1\right)

(1)

where theprobabilitiesare sorted in increasing order as $0\leq p_{1}\leq p_{2}\leq \ldots \leq p_{n}\leq 1$ .Thetightbound inEq. 1depends only on the sum of the smallest $n-1$ probabilities $\sum _{i=1}^{n-1}p_{i}$ and the largest probability $p_{n}$ .Thus, whileorderingof theprobabilitiesplays a role in the derivation of the bound, theorderingamong the smallest $n-1$ probabilities $\{p_{1},p_{2},...,p_{n-1}\}$ is inconsequential since only their sum is used.

Comparison with theBoole–Fréchet union bound

It is useful to compare the smallest bounds on the probability of the union with arbitrarydependenceandpairwise independencerespectively. Thetightest Boole–Fréchet upper union bound(assuming onlyunivariateinformation) is given as:

\displaystyle \max \mathbb {P} (\displaystyle {\cup }_{i}A_{i})=\displaystyle \min \left(\sum _{i=1}^{n}p_{i},1\right)

(2)

As shown in Ramachandra-Natarajan,^[10]it can be easily verified that the ratio of the twotightbounds inEq. 2andEq. 1isupper boundedby $4/3$ where the maximum value of $4/3$ is attained when

\sum _{i=1}^{n-1}p_{i}=1/2

,

p_{n}=1/2

where theprobabilitiesare sorted in increasing order as $0\leq p_{1}\leq p_{2}\leq \ldots \leq p_{n}\leq 1$ .In other words, in the best-case scenario, the pairwise independence bound inEq. 1provides an improvement of $25\%$ over theunivariatebound inEq. 2.

Generalization

More generally, we can talk aboutk-wise independence, for anyk≥ 2. The idea is similar: a set ofrandom variablesisk-wise independent if every subset of sizekof those variables is independent.k-wise independence has been used in theoretical computer science, where it was used to prove a theorem about the problemMAXEkSAT.

k-wise independence is used in the proof thatk-independent hashingfunctions are secure unforgeablemessage authentication codes.

References

^Gut, A. (2005)Probability: a Graduate Course,Springer-Verlag.ISBN 0-387-27332-8.pp. 71–72.
^Hogg, R. V., McKean, J. W., Craig, A. T. (2005).Introduction to Mathematical Statistics(6 ed.). Upper Saddle River, NJ: Pearson Prentice Hall.ISBN 0-13-008507-3.{{cite book}}:CS1 maint: multiple names: authors list (link)Definition 2.5.1, page 109.
^Hogg, R. V., McKean, J. W., Craig, A. T. (2005).Introduction to Mathematical Statistics(6 ed.). Upper Saddle River, NJ: Pearson Prentice Hall.ISBN 0-13-008507-3.{{cite book}}:CS1 maint: multiple names: authors list (link)Remark 2.6.1, p. 120.
^Boole, G. (1854).An Investigation of the Laws of Thought, On Which Are Founded the Mathematical Theories of Logic and Probability.Walton and Maberly, London. See Boole's "major" and "minor" limits of a conjunction on page 299.
^Fréchet, M. (1935). Généralisations du théorème des probabilités totales.Fundamenta Mathematicae25:379–387.
^^a ^bE. G. Kounias (1968)."Bounds for the probability of a union, with applications".The Annals of Mathematical Statistics.39(6): 2154–2158.doi:10.1214/aoms/1177698049.
^^a ^bD. Hunter (1976). "An upper bound for the probability of a union".Journal of Applied Probability.13(3): 597–603.doi:10.2307/3212481.JSTOR 3212481.
^^a ^bK. J. Worsley (1982). "An improved Bonferroni inequality and applications".Biometrika.69(2): 297–302.doi:10.1093/biomet/69.2.297.
^Boros, Endre;Scozzari, Andrea; Tardella, Fabio; Veneziani, Pierangela (2014). "Polynomially computable bounds for the probability of the union of events".Mathematics of Operations Research.39(4): 1311–1329.doi:10.1287/moor.2014.0657.
^^a ^bRamachandra, Arjun Kodagehalli; Natarajan, Karthik (2023). "Tight Probability Bounds with Pairwise Independence".SIAM Journal on Discrete Mathematics.37(2): 516–555.arXiv:2006.00516.doi:10.1137/21M140829.

[1] Gut, A. (2005)Probability: a Graduate Course,Springer-Verlag.ISBN 0-387-27332-8.pp. 71–72.

[2] Hogg, R. V., McKean, J. W., Craig, A. T. (2005).Introduction to Mathematical Statistics(6 ed.). Upper Saddle River, NJ: Pearson Prentice Hall.ISBN 0-13-008507-3.{{cite book}}:CS1 maint: multiple names: authors list (link)Definition 2.5.1, page 109.

[3] Hogg, R. V., McKean, J. W., Craig, A. T. (2005).Introduction to Mathematical Statistics(6 ed.). Upper Saddle River, NJ: Pearson Prentice Hall.ISBN 0-13-008507-3.{{cite book}}:CS1 maint: multiple names: authors list (link)Remark 2.6.1, p. 120.

[boole54-4] Boole, G. (1854).An Investigation of the Laws of Thought, On Which Are Founded the Mathematical Theories of Logic and Probability.Walton and Maberly, London. See Boole's "major" and "minor" limits of a conjunction on page 299.

[frechet35-5] Fréchet, M. (1935). Généralisations du théorème des probabilités totales.Fundamenta Mathematicae25:379–387.

[Kounias-6] E. G. Kounias (1968)."Bounds for the probability of a union, with applications".The Annals of Mathematical Statistics.39(6): 2154–2158.doi:10.1214/aoms/1177698049.

[Hunter-7] D. Hunter (1976). "An upper bound for the probability of a union".Journal of Applied Probability.13(3): 597–603.doi:10.2307/3212481.JSTOR 3212481.

[Worsley-8] K. J. Worsley (1982). "An improved Bonferroni inequality and applications".Biometrika.69(2): 297–302.doi:10.1093/biomet/69.2.297.

[Boros2014-9] Boros, Endre;Scozzari, Andrea; Tardella, Fabio; Veneziani, Pierangela (2014). "Polynomially computable bounds for the probability of the union of events".Mathematics of Operations Research.39(4): 1311–1329.doi:10.1287/moor.2014.0657.

[Ramachandra-Natarajan-10] Ramachandra, Arjun Kodagehalli; Natarajan, Karthik (2023). "Tight Probability Bounds with Pairwise Independence".SIAM Journal on Discrete Mathematics.37(2): 516–555.arXiv:2006.00516.doi:10.1137/21M140829.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

Example

Probability of the union of pairwise independent events

Comparison with theBoole–Fréchetunion bound

Generalization

See also

References

Comparison with theBoole–Fréchet union bound