|
|
Overview
Multiple analysis of variance (MANOVA) is used to see the main and interaction effects of categorical variables on multiple dependent interval variables. MANOVA uses one or more categorical independents as predictors, like ANOVA, but unlike ANOVA, there is more than one dependent variable. Where ANOVA tests the differences in means of the interval dependent for various categories of the independent(s), MANOVA tests the differences in the centroid (vector) of means of the multiple interval dependents, for various categories of the independent(s). One may also perform planned comparison or post hoc comparisons to see which values of a factor contribute most to the explanation of the dependents. There are multiple potential purposes for MANOVA.
Multiple analysis of covariance (MANCOVA) is similar to MANOVA, but interval independents may be added as "covariates." These covariates serve as control variables for the independent factors, serving to reduce the error term in the model. Like other control procedures, MANCOVA can be seen as a form of "what if" analysis, asking what would happen if all cases scored equally on the covariates, so that the effect of the factors over and beyond the covariates can be isolated. The discussion of concepts in the ANOVA section also applies, including the discussion of assumptions. See also:
|
|
Though not illustrated here, output for multivariate GLM contains lack of fit tests as in univariate GLM, used when the researcher has employed a custom (non-full factorial) model and wishes to test if terms present in the full factorial model but absent in the custom model should have been included. See the lack of fit test output in the section on univariate GLM.
The more the means of the dependents vary by factor level, the stronger the relation of the factor to the dependent. In the example above, means vary only a little, indicating a weak relationship. By examining the overlap of upper and lower bounds one finds that most regional comparisons are not significant. An exception is Southeast vs. West for "Having trouble with one's boss."
In the Tests of Between Subjects Effects table, partial eta-square serves as an effect size measure. The noncentrality index is used to compute the power level, which by rule of thumb should be equal or greater than .80 to accept with confidence that the chance of Type II error is low enough for a finding of non-significance by the F test (that is, to be confident that the relationship does not exist). Thus by the F test, we fail to reject the null hypothesis that region is unrelated to being unemployed. However, because the power of this test is well below .80, we acknowledge that the chance of Type II error is insufficiently low to be confident in this judgment.
The t-test significance for levels of a factor (ex., region in the figure above) refers to the significance of the contrast between the given level and the reference level (the last category if the default is accepted), in this case Region 3 = West. For covariates (ex., age above), the t-test significance level will be the same in level as in the Tests of Between Subjects Effects table above and refers to whether or not the covariate contributes significantly to the model.
Each canonical root represents a dimension of meaning, but what? What meaningful label do we give to each canonical root (which SPSS labels merely, 1, 2, etc.)? In factor analysis one ascribes a label to each factor based on the factor loadings of each measured variable on the factor. In MANOVA, this is done on multiple bases, using the raw weights, standardized weights, and structure correlations. The structure correlations are often the most useful for this purpose when there is more than one significant canonical root. Structure correlations are the correlations between the measured variables and the canonical roots. In MANOVA, there will be one set of MDA output for each main and interaction effect.
For the credit card case on the right, for observed by predicted there is a mostly random pattern, indicative of a weak or nonexistent relationship between the predictors and the dependent. The upward linear pattern of residuals by observed shows that as the observed value of credit card debt goes up, model error increases. For the example on the right for a dichotomous dependent, as the observed value goes from 0 to 1, predicted values are a bit higher but the overlap is very large, again indicating a weak or possibly nonsignificant relationship. With a dichotomy, the residual plots cannot assume a random cloud pattern, but it is again seen that as observed increases in value, so does residual (error) value.
In profile plots, additional factors can be represented by additional lines, as shown below, where parallel lines indicate no interaction and crossing lines (as in this example) indicate interaction among the factors (here, interaction between education and defaulting on a loan, when dependent variables are types of debt).
Note that profile analysis in MANOVA has been superceded to some extent by multidimensional scaling, mixed model ANOVA, and/or random effects regression models.
/LMATRIX = GENDER 1 -1 /MMATRIX t1 1 t2 -1; t2 1 t3 -1; t3 1 t4 -1.
/LMATRIX = gender 1 -1 /MMATRIX = t1 .25 t2 .25 t3 .25 t4 .25.
/LMATRIX = INTERCEPT 1 gender .5 .5 /MMATRIX t1 1 t2 -1; t2 1 t3 -1; t3 1 t4 -1.The LMATRIX command specifies a contrast between the two eqaully weighted values of gender and the intercept. The MMATRIX command asks for contrasts between t1 and t2, between t2 and t3, and between t3 and t4. SPSS output will be in a section labeled "Custom Hypothesis Tests."
GLM dependent varlist [BY factor list [WITH covariate list]]
[/WSFACTOR=name levels [{DEVIATION [(refcat)] }] name...
{SIMPLE [(refcat)] }
{DIFFERENCE }
{HELMERT }
{REPEATED }
{POLYNOMIAL [({1,2,3...})]**}
{ {metric } }
{SPECIAL (matrix) }
[/MEASURE=newname newname...]
[/WSDESIGN=effect effect...]†
[/RANDOM=factor factor...]
[/REGWGT=varname]
[/METHOD=SSTYPE({1 })]
{2 }
{3**}
{4 }
[/INTERCEPT=[INCLUDE**] [EXCLUDE]]
[/MISSING=[INCLUDE] [EXCLUDE**]]
[/CRITERIA=[EPS({1E-8**})][ALPHA({0.05**})]
{a } {a }
[/PRINT = [DESCRIPTIVE] [HOMOGENEITY] [PARAMETER][ETASQ] [RSSCP]
[GEF] [LOF] [OPOWER] [TEST [([SSCP] [LMATRIX] [MMATRIX])]]
[/PLOT=[SPREADLEVEL] [RESIDUALS]
[PROFILE (factor factor*factor factor*factor*factor ...)]
[/TEST=effect VS {linear combination [DF(df)]}]
{value DF (df) }
[/LMATRIX={["label"] effect list effect list ...;...}]
{["label"] effect list effect list ... }
{["label"] ALL list; ALL... }
{["label"] ALL list }
[/CONTRAST (factor name)={DEVIATION[(refcat)]** ‡ }]
{SIMPLE [(refcat)] }
{DIFFERENCE }
{HELMERT }
{REPEATED }
{POLYNOMIAL [({1,2,3...})]}
{ {metric } }
{SPECIAL (matrix) }
[/MMATRIX= {["label"] depvar value depvar value ...;["label"]...}]
{["label"] depvar value depvar value ... }
{["label"] ALL list; ["label"] ... }
{["label"] ALL list }
[/KMATRIX= {list of numbers }]
{list of numbers;...}
[/POSTHOC = effect [effect...]
([SNK] [TUKEY] [BTUKEY][DUNCAN]
[SCHEFFE] [DUNNETT(refcat)] [DUNNETTL(refcat)]
[DUNNETTR(refcat)] [BONFERRONI] [LSD] [SIDAK]
[GT2] [GABRIEL] [FREGW] [QREGW] [T2] [T3] [GH][C]
[WALLER ({100** })]]
{kratio}
[VS effect]
[/EMMEANS=TABLES({OVERALL })] [COMPARE ADJ(LSD) (BONFERRONI) (SIDAK)]
{factor }
{factor*factor... }
{wsfactor }
{wsfactor*wsfactor ... }
{factor*...wsfactor*...}
[/SAVE=[tempvar [(list of names)]] [tempvar [(list of names)]]...]
[DESIGN]
[/OUTFILE=[{COVB('savfile'|'dataset')}]
{CORB('savfile'|'dataset')}
[EFFECT('savfile'|'dataset')] [DESIGN('savfile'|'dataset')]
[/DESIGN={[INTERCEPT...] }]
{[effect effect...]}
† WSDESIGN uses the same specification as DESIGN, with only within-subjects factors.
‡ DEVIATION is the default for between-subjects factors, while POLYNOMIAL is the default for within-subjects factors.
** Default if the subcommand or keyword is omitted.
MANOVA dependent varlist [BY factor list (min,max)[factor list...]
[WITH covariate list]]
[/WSFACTORS=varname (levels) [varname...] ]
[/WSDESIGN]*
[/TRANSFORM [(dependent varlist [/dependent varlist])]=
[ORTHONORM] [{CONTRAST}] {DEVIATION (refcat) } ]
{BASIS } {DIFFERENCE }
{HELMERT }
{SIMPLE (refcat) }
{REPEATED }
{POLYNOMIAL [({1,2,3...})]}
{ {metric } }
{SPECIAL (matrix) }
[/MEASURE=newname newname...]
[/RENAME={newname} {newname}...]
{* } {* }
[/ERROR={WITHIN } ]
{RESIDUAL }
{WITHIN + RESIDUAL}
{n }
[/CONTRAST (factorname)={DEVIATION** [(refcat)] }] †
{POLYNOMIAL**[({1,2,3...})]}
{ {metric } }
{SIMPLE [(refcat)] }
{DIFFERENCE }
{HELMERT }
{REPEATED }
{SPECIAL (matrix) }
[/PARTITION (factorname)[=({1,1... })]]
{n1,n2...}
[/METHOD=[{UNIQUE** }] [{CONSTANT**}] [{QR** }]]
{SEQUENTIAL} {NOCONSTANT} {CHOLESKY}
[/{PRINT }= [CELLINFO [([MEANS] [SSCP] [COV] [COR] [ALL])]]
{NOPRINT} [HOMOGENEITY [([ALL] [BARTLETT] [COCHRAN] [BOXM])]]
[DESIGN [([OVERALL] [ONEWAY] [DECOMP] [BIAS] [SOLUTION]
[REDUNDANCY] [COLLINEARITY] [ALL])]]
[PARAMETERS [([ESTIM] [ORTHO][COR][NEGSUM][EFSIZE][OPTIMAL][ALL])]]
[SIGNIF [[(SINGLEDF)]
[(MULTIV**)] [(EIGEN)] [(DIMENR)]
[(UNIV**)] [(HYPOTH)][(STEPDOWN)] [(BRIEF)]
[{(AVERF**)}] [(HF)] [(GG)] [(EFSIZE)]]
{(AVONLY) }
[ERROR[(STDDEV)][(COR)][(COV)][(SSCP)]]
[/OMEANS =[VARIABLES(varlist)] [TABLES ({factor name }] ]
{factor BY factor}
{CONSTANT }
[/PMEANS =[VARIABLES(varlist)] [TABLES ({factor name })] [PLOT]] ]
{factor BY factor}
{CONSTANT }
[/RESIDUALS=[CASEWISE] [PLOT] ]
[/POWER=[T({.05**})] [F({.05**})] [{APPROXIMATE}]]
{a } {a } {EXACT }
[/CINTERVAL=[{INDIVIDUAL}][({.95}) ]
{JOINT } {a }
[UNIVARIATE ({SCHEFFE})]
{BONFER }
[MULTIVARIATE ({ROY })] ]
{PILLAI }
{BONFER }
{HOTELLING}
{WILKS }
[/PCOMPS [COR] [COV] [ROTATE(rottype)]
[NCOMP(n)] [MINEIGEN(eigencut)] [ALL] ]
[/PLOT=[BOXPLOTS] [CELLPLOTS] [NORMAL] [ALL] ]
[/DISCRIM [RAW] [STAN] [ESTIM] [COR] [ALL]
[ROTATE(rottype)] [ALPHA({.25**})]]
{a }
[/MISSING=[LISTWISE**] [{EXCLUDE**}] ]
{INCLUDE }
[/MATRIX=[IN({file})] [OUT({file})]]
{[*] } {[*] }
[/ANALYSIS [({UNCONDITIONAL**})]=[(]dependent varlist
{CONDITIONAL } [WITH covariate varlist]
[/dependent varlist...][)][WITH varlist] ]
[/DESIGN={factor [(n)] }[BY factor[(n)]] [WITHIN factor[(n)]][WITHIN...]
{[POOL(varlist)}
[+ {factor [(n)] }...]
{POOL(varlist)}
[[= n] {AGAINST} {WITHIN }
{VS } {RESIDUAL}
{WR }
{n }
[{factor [(n)] } ... ]
{POOL(varlist)}
[MWITHIN factor(n)]
[MUPLUS]
[CONSTANT [=n] ]
* WSDESIGN uses the same specification as DESIGN, with only within-subjects factors.
† DEVIATION is the default for between-subjects factors, while POLYNOMIAL is the default for within-subjects factors.
** Default if the subcommand or keyword is omitted.
This command reads the active dataset and causes execution of any pending commands. See Command Order for more information.
Example 1
* Analysis of Variance
MANOVA RESULT BY TREATMNT(1,4) GROUP(1,2).
Example 2
* Analysis of Covariance
MANOVA RESULT BY TREATMNT(1,4) GROUP(1,2) WITH RAINFALL.
Example 3
* Repeated Measures Analysis
MANOVA SCORE1 TO SCORE4 BY CLASS(1,2)
/WSFACTORS=MONTH(4).
Example 4
* Parallelism Test with Crossed Factors
MANOVA YIELD BY PLOT(1,4) TYPEFERT(1,3) WITH FERT
/ANALYSIS YIELD
/DESIGN FERT, PLOT, TYPEFERT, PLOT BY TYPEFERT,
FERT BY PLOT + FERT BY TYPEFERT
+ FERT BY PLOT BY TYPEFERT.
The write-up would first state the results of the overall test of inter-group differences: "The Wilks Lambda multivariate test of overall differences among groups was statistically significant (p=0.032)." Depending on the publication, the parenthetical term might contain the F parameters and value (ex., "F(4,90) = 6.28"). Optionally, one might also state "The F statistic for Wilks' Lambda was exact.", which is reported under the relevant SPSS table. If there had been more than one factor, then the overall test is reported similarly for each factor. Also, one should always report effect size as well as significance. Before running the analysis, under the Options button, check "Estimates of effect size," which will cause partial eta-squared to be reported. State, "Although significant, the effect size of this relationship was weak as indicated by partial eta-squared = ..11).
The "Tests of Between-Subjects Effects" table following the overall multivariate tests gives significance levels and partial eta-squared for each dependent. The write-up would state something like, "Univariate between-subjects tests showed that media exposure was significantly and moderately related to recycling paper (p=.0001; partial eta-squared = .21) and cans (p=.041; partial eta-squared = .10), but not to plastics (p=.421; partial eta-squared = .01)."
Since the overall test when significant only shows at least one group is significantly different from another, the next research statement in the write-up must report post-hoc contrast tests among groups. For instance, in the table of "Multiple Comparisons" SPSS prints the mean difference on the dependent variable between any two groups and its corresponding significance. After using this test, the researcher might state, "Post-hoc comparisons between groups using F statistics and Bonferroni-type simultaneous confidence intervals based on Student's t distribution also showed that the movie treatment group was significantly related to the recycling of paper (p=0.002) and cans (p=0.01) but not plastics (p=.055)." A similar statement would be made about the pamphlet group. The control group, which is the omitted category, would not have a corresponding statement. There are no partial eta-squareds to report. Optionally, one might list F parameters and values.
Line 1: The MANOVA command word is followed by the three variables opinion1, opinion2, and opinion3. These represent the three levels of the within-subjects factor Ideology. The BY keyword tells SPSS that what follows are the groups or between-subjects factors; in this case, EducLevel and SESlevel. Following each of the two between-subjects factors are two numbers between parentheses. SESlevel (1,3) simply means that the variable SESlevel has three levels coded as 1, 2, and 3. One may have no grouping variable and thus no BY clause.
Line 2: The slash mark indicates a subcommand. The WSFACTORS subcommand, tells SPSS that there is one repeated factor called Ideology and that it has three levels (matching the three opinion measurements listed after the MANOVA keyword). This is needed by SPSS to interpret the list of dependent variables in line 1. The WSFACTORS subcommand follows the MANOVA command when there is a within-subjects factor, which is to say when there is a repeated measures design.
Line 3: The WSDESIGN subcommand tells SPSS to test the within-subjects hypotheses for repeated measures designs.
Line 4: The PRINT subcommand specifies the output. CELLINFO (MEANS) prints cell means and standard deviations used to evaluate patterns in the data. Many additional statistics could be requested.
Line 5: The DESIGN subcommand causes SPSS to test the between-subjects hypotheses.
The general MANOVA syntax, from the SPSS manual, is:
MANOVA
depvarlist [BY indvarlist (min,max) [indvarlist (min, max) ...]
[WITH covarlist]]
[/WSFACTORS = name (levels) name...]
[/{PRINT | NOPRINT} = [CELLINFO [(MEANS SSCP COV COR ALL)]]
[HOMOGENEITY [(BARTLETT COCHRAN BOXM ALL)]]
[SIGNIF (MULTIV UNIV AVERF AVONLY EFSIZE ALL)]]
[/OMEANS [VARIABLES(varlist)] [TABLES ([CONSTANT] [factor BY factor])]]
[/CONTRAST(factor) = {POLYNOMIAL[(#)] | SPECIAL(k1s + contrasts)}]
[/CONTR ...}
[/WSDESIGN = effect ...] [/DESIGN = effect ...]
Notes on Effects
Keywords: BY, W or WITHIN, MWITHIN
Varname(#): # = one of k-1 contrasts or one of k levels
Click here for syntax for other MANOVA commands for various designs.
In SPSS, make sure that the dependents are entered in the desired order, then in the MANOVA syntax, enter PRINT SIGNIF(STEPDOWN) or simply PRINT SIG(STEP). For example:
manova var1 var2 var3 BY gender(1,2) /print signif(stepdown) ...See James Stevens, Applied Multivariate Analysis for the Social Sciences, 2nd Edition.
Using discriminant analysis, the MANCOVA dependents are used as predictor variables to classify a factor (treatment) variable, and the discriminant beta weights are used to assess the relative strength of relation of the MANOVA dependents to the factor. The beta weights indicate the strength of relation of a given dependent controlling for all other dependents