|
|
See also separate section on regression.
A strong reason for using 2SLS, noted by Eddie Eoczkowski, "is that 2SLS estimators of SEMs can make use of all the advances made by econometricans in diagnostic testing of models." Oscztrowski writes, "My concern with many published SEM results is that insufficient effort is devoted to checking the assumptions which underpin estimation. Only if underpinning assumptions are valid will the resulting estimates be consistent, efficient and permit valid testing. In other words, too many fragile SEM estimates get through without rigorous checking. Effectively by subjecting the final estimated model to a battery of checks only good strong robust models will get through and hence may be of some use to practitioners." For an example, see Eoczkowski and Farrell (1998), who use non-nested tests and Bollen's (1996) 2SLS estimator for SEMs to compare non-nested SEMs to facilitate the choice of alternative measurement scales for the same latent construct. (Eoczkowski, private communication, 8/18/2000).
As of Amos version 16, Amos does not support 2SLS estimation. LISREL, however, does. A description of the LISREL procedure is found in Schmacker & Lomax (2004: 386-388).
The general strategy of using 2SLS to test for selection bias is (1) in the first stage, predict participation/non-participation in the treatment group, based on gender, race, education, and other measured independents thought to be relevant to selection; then (2) use the estimated probability of selection computed in stage 1 as an independent variable in stage 2. If its OLS regression coefficient is not significant, it is concluded that selection bias is not a problem. An illustration is found in Heinrich (1998).
Stage 1. Using OLS regression, the dependent variable is participation in the treatment group = 1, non-participation = 0. The independents are gender, race, education, and other measured independents thought to be relevant to selection into the treatment group. Naturally, the researcher should have anticipated the need for this test of selection bias and should have measured all factors which enter into the staff decision to admit a subject into the treatment group. The higher the R2 for this equation, the more complete the set of selection-related independents, but since selection may also be at random, a low or moderate R2 is not in itself a reason to reject the appropriateness of the 2SLS procedure. However, failure to include an independent important to selection can radically affect the coefficients in stage 1 and the resulting estimates of the probability-of-selection variable (see below).
Stage 2. Based on stage 1, a probability-of-selection variable is calculated for each case in the sample. This is simply the regression estimate of the dependent in stage 1. Then an OLS regression is run with the actual dependent variable of interest as dependent. The independents include the probability-of-selection variable and any other variable which is an independent in the model for that dependent variable (these do not necessarily include stage 1 independents thought to influence selection). If the b coefficient for the probability-of-selection variable is not significant, the researcher concludes selection bias is not a problem. The researcher may also run this second-stage regression with and without the probability-of-selection variable. If the regression coefficients are similar for the other independent variables in the two regressions, this is further indication that selection bias is not a problem.
There are significant limitations on this method of testing for selection bias. First, 2SLS does not test for selection bias due to unmeasured variables. Second, it does not test for nonlinear and interaction effects among measured variables unless explicitly modeled in stage 1. Third, if in Stage 2 the probability-of-selection variable and the grouping variable (treatment/comparison) are both included in the equation as independents, when there is multicollinearity between these two variables, the significance of the b coefficient for the probability-of-selection variable and the grouping variable may be biased, leading to erroneous conclusions about the presence or absence of selection bias.
Examples