Distribution of the number of significant effect sizes

James E. Pustejovsky

A while back, I posted the outline of a problem about the number of significant effect size estimates in a study that reports multiple outcomes. This problem interests me because it connects to the issue of selective reporting of study results, which creates problems for meta-analysis. Here, I’ll re-state the problem in slightly more general terms and then make some notes about what’s going on.

Consider a study that assesses some effect size across $m$ different outcomes. (We’ll be thinking about one study at a time here, so no need to index the study as we would in a meta-analysis problem.) Let $T_{i}$ denote the effect size estimate for outcome $i$ , let $V_{i}$ denote the sampling variance of the effect size estimate for outcome $i$ , and let $θ_{i}$ denote the true effect size parameter for corresponding to outcome $i$ . Assume that the study outcomes ${[T_{i}]}_{i = 1}^{m}$ follow a correlated-and-hierarchical effects model, in which $T_{i} = μ + u + v_{i} + e_{i},$ where the study-level error $u \sim N (0, τ^{2})$ , the effect-specific error $v_{i} \overset{i i d}{\sim} N (0, ω^{2})$ , and the vector of sampling errors ${[e_{i}]}_{i = 1}^{m}$ is multivariate normal with mean $0$ , known variances $Var (e_{i}) = σ^{2}$ , and compound symmetric correlation structure $cor (e_{h}, e_{i}) = ρ$ .

Define $A_{i}$ as an indicator that is equal to one if $T_{i}$ is statistically significant at level $α$ based on a one-sided test, and otherwise equal to zero. (Equivalently, let $A_{i}$ be equal to one if the effect is statistically significant at level $2 α$ and in the theoretically expected direction.) Formally, $A_{i} = I (\frac{T_{i}}{σ} > q_{α})$ where $q_{α} = Φ^{- 1} (1 - α)$ is the critical value from a standard normal distribution (e.g., $q_{.05} = 1.645$ , $q_{.025} = 1.96$ ). Let $N_{A} = \sum_{i = 1}^{m} A_{i}$ denote the total number of statistically significant effect sizes in the study. The question is: what is the distribution of $N_{A}$ .

Compound symmetry to the rescue

As I noted in the previous post, this set-up means that the effect size estimates have a compound symmetric distribution. We can make this a bit more explicit by writing the sampling errors in terms of the sum of a component that’s common acrosss outcomes and a component that’s specific to each outcome. Thus, let $e_{i} = f + g_{i}$ , where $f \sim N (0, ρ σ^{2})$ and $g_{i} \overset{i i d}{\sim} N (0, (1 - ρ) σ^{2})$ . Let me also define $ζ = μ + u + f$ as the conditional mean of the effects. It then follows that the effect size estimates are conditionally independent, given the common components: $(T_{i} | ζ) \overset{i i d}{\sim} N (ζ, ω^{2} + (1 - ρ) σ^{2})$ Furthermore, the conditional probability of a significant effect is $Pr (A_{i} = 1 | ζ) = Φ (\frac{ζ - q_{α} σ}{\sqrt{ω^{2} + (1 - ρ) σ^{2}}})$ and $A_{1}, . . ., A_{m}$ are mutually independent, conditional on $ζ$ . Therefore, the conditional distribution of $N_{A}$ is binomial, $(N_{A} | ζ) \sim B i n (m, π)$ where $π = Φ (\frac{ζ - q_{α} σ}{\sqrt{ω^{2} + (1 - ρ) σ^{2}}}) .$ What about the unconditional distribution?

To get rid of the $ζ$ , we need to integrate over its distribution, which leads to $Pr (N_{A} = a) = E [Pr (N_{A} | ζ)] = \int f_{N_{A}} (a | ζ, ω, σ, ρ, m) \times f_{ζ} (ζ | μ, τ, σ, ρ) d ζ,$ where $f_{N_{A}} (a | ζ, ω, σ, ρ)$ is a binomial density with size $m$ and probability $π = π (ζ, ω, σ, ρ)$ and $f_{ζ} (ζ | μ, τ, σ, ρ)$ is a normal density with mean $μ$ and variance $τ^{2} + ρ σ^{2}$ .

This distribution is what you might call a binomial-normal convolution or a random-intercept probit model (where the random intercept is $ζ$ ). As far as I know, the distribution cannot be evaluated analytically but instead must be calculated using some sort of numerical integration routine.

Just the moments, please

If all we care about is the expectation of $N_{A}$ , we don’t need to bother with all the conditioning business and can just look at the marginal distribution of the effect size estimates taken individually. Marginally, $T_{i}$ is normally distributed with mean $μ$ and variance $τ^{2} + ω^{2} + σ^{2}$ , so $Pr (A_{i} = 1) = ψ$ , where $ψ = Φ (\frac{μ - q_{α} σ}{\sqrt{τ^{2} + ω^{2} + σ^{2}}}) .$ By the linearity of expectations, $E (N_{A}) = \sum_{i = 1}^{m} E (A_{i}) = m ψ .$

We can also get an approximation for the variance of $N_{A}$ by working with its conditional distribution above. By the rule of variance decomposition, $\begin{aligned} Var (N_{A}) & = E [Var (N_{A} | ζ)] + Var [E (N_{A} | ζ)] \\ = m \times E [π (1 - π)] + m^{2} \times Var [π] \\ = m \times E [π] (1 - E [π]) + m (m - 1) \times Var [π], \end{aligned}$ where $π$ is, as defined above, a function of $ζ$ and thus a random variable. Now, $E (π) = ψ$ and we can get something close to $Var (π)$ using a first-order approximation: $Var (π) \approx {({\frac{δ π}{δ ζ} |}_{ζ = μ})}^{2} \times Var (ζ) = {[ϕ (\frac{μ - q_{α} σ}{\sqrt{ω^{2} + (1 - ρ) σ^{2}}})]}^{2} \times \frac{τ^{2} + ρ σ^{2}}{ω^{2} + (1 - ρ) σ^{2}} .$ Thus, $\begin{array}{r} Var (N_{A}) \approx m \times ψ (1 - ψ) + m (m - 1) \times {[ϕ (\frac{μ - q_{α} σ}{\sqrt{ω^{2} + (1 - ρ) σ^{2}}})]}^{2} \times \frac{τ^{2} + ρ σ^{2}}{ω^{2} + (1 - ρ) σ^{2}} . \end{array}$ If the amount of common variation is small, so $τ^{2}$ is near zero and $ρ$ is near zero, then the contribution of the second term will be small, and $N_{A}$ will act more or less like a binomial random variable with size $m$ and probability $ψ$ . On the other hand, if the amount of independent variation in the effect sizes is small, so $ω^{2}$ is near zero and $ρ$ is near 1, then the term on the right will approach $m (m - 1) ψ (1 - ψ)$ and $Var (N_{A})$ will approach $m^{2} ψ (1 - ψ)$ , or the variance of $m$ times a single Bernoulli variate. So you could say that $N_{A}$ has anywhere between $1$ and $m$ variate’s worth of information in it, depending on the degree of correlation between the effect size estimates.

Interactive distribution

Here is an interactive graph of the probability mass function of $N_{A}$ , with probability points calculated using Gaussian quadrature. Below the graph, I also report $ψ$ , the exact mean and variance of $N_{A}$ , and the first-order approximation to the variance (denoted $V_{approx}). When $τ > 0$ and $ρ > 0$ , the approximate variance is not all that accurate because the first-order approximation to $Var (π)$ isn’t that good.

Code

math = require("mathjs")
norm = import('https://unpkg.com/norm-dist@3.1.0/index.js?module')

quad_points = JSON.parse(all_quad_points).at(qp - 1)

sigma = 2 / math.sqrt(ESS)

zeta_sd = math.sqrt(tau**2 + rho * sigma**2)
ID_sd = math.sqrt(omega**2 + (1 - rho) * sigma**2)

crit = norm.icdf(1 - alpha)

binomial_coefs = Array(m+1).fill(null).map((x,index) => {
  return math.combinations(m, index);
})

probs = quad_points.map(zeta => {
  let Z = (zeta[0] * zeta_sd + mu - crit * sigma) / ID_sd;
  return [norm.cdf(Z), zeta[1]];
})

p_binom_norm = binomial_coefs.map((coef, a) => {
  let p = probs.map((x) => {
    return (x[0]**a) * ((1 - x[0])**(m - a)) * x[1];
  });
  return coef * math.sum(p);
})

math = Object {isNumber: ƒ(e), isComplex: ƒ(e), isBigNumber: ƒ(e), isBigInt: ƒ(e), isFraction: ƒ(e), isUnit: ƒ(e), isString: ƒ(e), isArray: ƒ(), isMatrix: ƒ(e), isCollection: ƒ(e), isDenseMatrix: ƒ(e), isSparseMatrix: ƒ(e), isRange: ƒ(e), isIndex: ƒ(e), isBoolean: ƒ(e), isResultSet: ƒ(e), isHelp: ƒ(e), isFunction: ƒ(e), isDate: ƒ(e), isRegExp: ƒ(e), …}

norm = Module {Z: ƒ(…), cdf: ƒ(z), icdf: ƒ(…), intE: ƒ(a, b), pdf: ƒ(z), Symbol(Symbol.toStringTag): "Module"}

quad_points = Array(21) [Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), …]

sigma = 0.22360679774997896

zeta_sd = 0.19999999999999998

ID_sd = 0.17320508075688773

crit = 1.959963986120195

binomial_coefs = Array(7) [1, 6, 15, 20, 15, 6, 1]

probs = Array(21) [Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), Array(2), …]

p_binom_norm = Array(7) [0.3632983110161876, 0.18642239532408006, 0.13279056012006735, 0.10429406708513911, 0.0848789783106351, 0.0702147931381166, 0.058003374919324134]

Code

psi = norm.cdf((mu - crit * sigma) / math.sqrt(tau**2 + omega**2 + sigma**2))
psi_print = psi.toFixed(3)

E_NA = m * psi 
E_NA_print = E_NA.toFixed(3)

dpi_dzeta = norm.pdf((mu - crit * sigma) / ID_sd)
V_pi_approx = (dpi_dzeta * zeta_sd / ID_sd)**2
V_approx = m * psi * (1 - psi) + m * (m - 1) * V_pi_approx
V_approx_print = V_approx.toFixed(3)

V_NA = {
  let V_NA = 0 - E_NA**2;
  for (let i = 0; i <= m; i++) {
    V_NA += i**2 * p_binom_norm[i];
  }
  return V_NA;
}
V_NA_print = V_NA.toFixed(3)

psi = 0.3006337901985208

psi_print = "0.301"

E_NA = 1.803802741191125

E_NA_print = "1.804"

dpi_dzeta = 0.290096539705762

V_pi_approx = 0.11220800313234232

V_approx = 4.627758780306626

V_approx_print = "4.628"

V_NA = 3.60408188896073

V_NA_print = "3.604"

Distribution of $N_{A}$

Code

Plot.plot({
  x: {
    label: "Number of significant effect sizes"
  },
  y: {
    domain: [0, 1],
    label: "Probability"
  },
  marks: [
    Plot.ruleY(0),
    Plot.barY(p_binom_norm, {
      fill: "steelblue"
    }),
  ]
})

Moments of $N_{A}$

\begin{aligned} \psi &= 0.301 \\ \mathbb{E}\left(N_A\right) &= 1.804 \\ \mathbb{V}\left(N_A\right) &= 3.604 &V_{approx} &= 4.628 \end{aligned}

Code

viewof m = Inputs.range(
  [1, 30], 
  {value: 6, step: 1, label: "Number of effect sizes (m):"}
)

viewof ESS = Inputs.range(
  [4, 300], 
  {value: 80, step: 1, label: "Effective sample size:"}
)

viewof mu = Inputs.range(
  [-2, 2], 
  {value: 0.3, step: 0.01, label: "Average effect size (mu):"}
)

viewof tau = Inputs.range(
  [0, 1], 
  {value: 0.1, step: 0.01, label: "Between-study SD (tau):"}
)

viewof omega = Inputs.range(
  [0, 1], 
  {value: 0.1, step: 0.01, label: "Within-study SD (omega):"}
)

viewof rho = Inputs.range(
  [0, 1], 
  {value: 0.6, step: 0.01, label: "Sampling error correlation (rho):"}
)

viewof alpha = Inputs.range(
  [0.005, 0.995], 
  {value: 0.025, step: .005, label: "One-sided significance threshold (alpha):"}
)

viewof qp = Inputs.range(
  [1, 30], 
  {value: 21, step: 1, label: "Number of quadrature points:"}
)

m = 6

ESS = 80

mu = 0.3

tau = 0.1

omega = 0.1

rho = 0.6

alpha = 0.025

qp = 21

Back to top

Compound symmetry to the rescue

Just the moments, please

Interactive distribution

Distribution of NA

Moments of NA

Distribution of $N_{A}$

Moments of $N_{A}$