p.16-17
Introduction
This part explains the detailed steps of calculating Little’s test
statistic to check for the Missing Completely at Random (MCAR)
assumption, as described in the provided example. The test is based on
comparing the means of observed and missing data patterns, with the
computation of the test statistic TL and its interpretation.
Step 1: Define the Grand Means and Covariance Matrix
From the given example, the maximum likelihood estimates of the grand
means μ and the variance-covariance matrix Σ are as follows:
μ^=[20.3114.29],Σ^=[27.27−13.80−13.8036.15]20.31 is the mean of the first variable (denoted as v1), i.e.,
the average value for the first variable across all observations.
14.29 is the mean of the second variable (denoted as v2), i.e.,
the average value for the second variable across all observations.
27.27 is the variance of the first variable (v1), representing
the spread of values for the first variable.
36.15 is the variance of the second variable (v2), representing
the spread of values for the second variable.
-13.80 is the covariance between the first and second variables.
This tells us how the two variables vary together. A negative value
indicates that as one variable increases, the other tends to decrease.
These are the benchmarks against which we compare the pattern-specific
means for the observed data.
Step 2: Define the Pattern-Specific Means
There are two missing data patterns in the dataset, represented by the
two groups. For this example, the two groups have the following
arithmetic means:
Group 1 (for v1=2,v2=1):
Y1=[23.2712.79]Group 2 (for v1=2,v2=0):
Y2=[17.19NA]Step 3: Compute the Test Statistic
The formula for the test statistic TL is as follows:
TL=g=1∑Gng(Yg−μg)⊤Σ−1(Yg−μg)Where:
- ng is the number of cases in group g,
- Yg is the group-specific mean,
- μg is the grand mean for the variable,
- Σ−1 is the inverse of the covariance matrix.
For Group 1:
The difference between Group 1’s mean and the grand mean is:
Y1−μ=[23.2712.79]−[20.3114.29]=[2.96−1.50](Y1−μ)⊤=[2.96−1.50]Σ=[27.27−13.80−13.8036.15]For any 2x2 matrix:
A=[acbd]The inverse of this matrix A−1 is given by:
A−1=det(A)1[d−c−ba]Where det(A) is the determinant of the matrix, and it is
calculated as:
det(A)=ad−bcThe determinant of Σ, denoted det(Σ), is calculated
as:
det(Σ)=(27.27)(36.15)−(−13.80)(−13.80)det(Σ)=985.8105−190.44=795.3705Σ−1=795.37051[36.1513.8013.8027.27]Σ−1=[795.370536.15795.370513.80795.370513.80795.370527.27]Σ−1=[0.045450520.01735040.01735040.03428591]First, multiply:[2.96−1.50]×[0.045450520.01735040.01735040.03428591][(2.96)(0.04545052)+(−1.50)(0.0173504)(2.96)(0.0173504)+(−1.50)(0.03428591)]=[0.1085079−0.000071681]Then:[0.1085079−0.000071681]×[2.96−1.50](0.1085079)(2.96)+(−0.000071681)(−1.50)=0.3211834+0.0001075215=0.3212909This is the squared difference for Group 1. By multiplying this by
n1=141, you get 45.30202.
For Group 2:
The difference between Group 2’s mean and the grand mean is:
Y2−μ=[17.19NA]−[20.3114.29]=[−3.12NA](Y2−μ)⊤=[−3.12NA]For missing data, we can’t compute the full Mahalanobis distance, but
for the observed part, we compute:
Varobs(Yobs−μobs)2Which here becomes:
27.27(17.19−20.31)2134×27.27(17.19−20.31)2=47.8331353136For this example:
TL=(Y1−μ)⊤Σ−1(Y1−μ)+(Y2−μ)⊤Σ−1(Y2−μ)Substituting the values:
TL=45.30202+47.8331353136=93.13516Step 4: Interpretation of the Result
The computed test statistic TL is 93.13516, which is compared
against a chi-square distribution with degrees of freedom
u=(2+1)−2=1.
Given that this test statistic is statistically significant (with
p<0.001), we can conclude that the missing data is not Missing
Completely at Random (MCAR).
Conclusion
This computation demonstrates the process of using Little’s test to
evaluate the MCAR assumption in a dataset. The test statistic value
indicates that the missing data in this example is not MCAR.