Negative Binomial Distribution

율·2025년 3월 19일

0

The Negative Binomial (NB) you’re using (as implemented by MASS::glm.nb()) is parameterized by two quantities:

$μ$ = expected count (mean)
$θ$ (often called size or dispersion parameter) > 0

🔢 Probability Mass Function (PMF)

For $y = 0,1,2,\dots$ , the NB‑2 PMF (MASS’s “size–mu” parameterization) is

P(Y = y) = \frac{\Gamma(y + θ)}{\Gamma(θ)\,y!} \left(\frac{θ}{θ + μ}\right)^{θ} \left(\frac{μ}{θ + μ}\right)^{y}.

$\Gamma(\cdot)$ = gamma function
$θ/(θ+μ)$ = probability of “failure”
$μ/(θ+μ)$ = probability of “success”

📈 Mean & Variance

E[Y] = μ, \qquad \mathrm{Var}(Y) = μ + \frac{μ^2}{θ}.

Because $\frac{μ^2}{θ} ≥ 0$ , Var(Y) ≥ μ, making NB inherently a over‑dispersed generalization of Poisson.

📊 Role of θ

θ (size)	Interpretation	Var(Y) relative to μ
→∞	Gamma mixing variance → 0	Var ≈ μ (Poisson limit)
large (≫μ)	Weak overdispersion	Var slightly > μ
small (≪μ)	Strong overdispersion	Var ≫ μ

$θ$ is the shape parameter of the Gamma distribution in the Poisson–Gamma mixture view:
1. Draw $\lambda \sim \mathrm{Gamma}(\text{shape}=θ,\;\text{scale}=μ/θ)$
2. Then $Y \mid \lambda \sim \mathrm{Poisson}(\lambda)$

⚠️ Why $θ>0$ ?

Gamma(shape= $θ$ ) is only defined for $θ>0$
Ensures PMF integrates to 1
Guarantees Var(Y) ≥ μ (no underdispersion)

If $θ\to∞$ , NB “degenerates” to Poisson; if $θ$ is very small, variance blows up (extreme overdispersion).

보건대학원 뉴비

이전 포스트

지수족 분포의 Quasi-Likelihood

다음 포스트

음이항분포와 포아송분포 간의 관계

0개의 댓글