|
|
Partial correlation still requires meeting all the usual assumptions of Pearsonian correlation: linearity of relationships, the same level of relationship throughout the range of the independent variable ("homoscedasticity"), interval or near-interval data, and data whose range is not truncated.
Partial correlation is common when there is only one control variable but is sometimes used when there are two or three. For large models, researchers use path analysis or structural equation modeling when data are near or at interval level, or use log-linear modeling for lower-level data. Newer versions of structural equation modeling software allow variables of any type on either side of the equation.
.
Where partial correlation is the correlation of the independent and dependent variables after controlling both for control variables, semi-partial or part correlation is the correlation the independent with the dependent, controlling only the independent variable for control variables.
Semi-partial or part correlation is the basis for multiple regression. Regression coefficients are semi-partial coefficients. In multiple regression, the squared part correlation is the proportion of the total variance in the dependent variable accounted for by adding the given independent variable to those already entered in the multiple regression formula. This is equivalent to saying that the part correlation of Y as dependent with X1 as independent, controlling for X2 = r2Y(X1.X2) = R2Y.X1X2 - R2Y.X2. Let job satisfaction (J) be the dependent. Let education (E) be the independent and let salary (S) be the control variable. The partial correlation of E with J controlling for S is written rJE.S and when squared is interpreted as the percent of unique variance in J uniquely accounted for by E, after both J and E are controlled by S. Partial correlation is thus the correlation of the residual of J with the residual of E. The semi-partial correlation of E with J controlling for S is written rJ(E.S) and when squared is interpreted as the percent of total (unique plus joint) variance in J uniquely accounted for by E and not by S. Thus semi-partial correlation is the correlation of the residual of E with unadjusted J.
For a given independent variable (IV), part correlation first removes from that IV all variance which may be accounted for by control IVs (ex., other IVs in a regression model), then correlates the remaining unique component of the IV with the dependent variable (DV). Part correlation will always be less than the partial correlation, except that it will be equal if the control variable is unrelated to the IV. Partial correlation, in contrast, removes from both the given IV and the DV all variance accounted for by the control IVs, then correlates the unique component of the IV with the unique component of the DV. That is, the common variance of the control variables is removed from just the independent variable in part correlation, whereas in partial correlation it is removed from both the independent and dependent variables. Partial correlation is always larger than the corresponding part correlation because in partial correlation, variance is removed from the DV.
| Gender Record |
Male | Female |
|---|---|---|
| Arrest | a | b |
| No arrest | c | d |
Yule's Q = (ad - bc)/(ad + bc). If our hypothesis is that males are more likely to have arrest records, then ad is concordant pairs (pairs consistent with out hypothesis) and bc is discordant pairs (inconsistent with our hypothesis). Thus Yule's Q is the surplus of concordant over discordant pairs, as a percentage of all pairs (not counting tied pairs like ab and cd). That is, Yule's Q represents the probability that, when we draw two units (a pair) from our population excluding ties, that pair will be consistent with our hypothesis.
Partial Q is simply Q for those pairs of i and j that are tied on a dichotomous control variable, k. Let k be high school diploma/no diploma, in the table below:
| Diploma | Gender Record |
Male | Female |
|---|---|---|
| Arrest | a | b |
| No arrest | c | d | No diploma | Gender Record |
Male | Female |
| Arrest | a' | b' |
| No arrest | c' | d' |
Qij.k = [ad-bc)+(a'd'-b'c')]/[(ad+bc)+(a'd'+b'b')]
Partial association may be constructed in an analogous manner for other measures of association, such as gamma for ordinal data (Yule's Q is gamma for the 2x2 case).
Copyright 1998, 2008 by G. David Garson.
Last update, 2/26/08.