Hi all,
I am trying to predict teacher expectations from a number of variables and would like to include interaction effects as predictors. Research has shown, for instance, that child gender and class size havea direct effect. However, I would also like to know whether, besides these direct effects, the interaction between the two plays a role (such that, e.g., girls in smaller classes are less likely to be underestimated and boys in bigger classes). As far as I know, interaction effects can be modeled by multiplying the variables in question; however, if I do so, it is no longer possible to include the two direct effects, due to collinearity of the predictors. If I use the residuals only (here, by regressing the interaction term on both child gender and class size), the effects are minuscule (I tried several interactions, which all make sense from a theoretical point of view, and this was the case for all of them). To solve the problem: * Do I have to make a choice (direct effects or interaction effects, but not the two)? * How would I proceed? My idea would be to have a look at the correlations with the criterion (if they are higher for the interaction term than for the two individual terms, would I choose the interaction term?) I hope the question is not absolutely silly; if it is, please let me know so I know at least the answer is closer than my blocked brain is able to recognize right now ;) Thanks very much in advance! Tanya ===================== To manage your subscription to SPSSX-L, send a message to [hidden email] (not to SPSSX-L), with no body text except the command. To leave the list, send the command SIGNOFF SPSSX-L For a list of commands to manage subscriptions, send the command INFO REFCARD |
Administrator
|
Hi Tanja. Two things.
1. You have explanatory variables from at least 2 levels here. Characteristics of individual students (e.g., sex, IQ) are Level 1 variables; and characteristics of the classes (e.g., class size) are level 2 variables. So IMO, you should be using MIXED, not REGRESSION. 2. Try centering your variables on some meaningful in-range value*. This will not only reduce the (appearance of) multicollinearity, but will also yield a constant that is interpretable. * Authors often describe mean-centering. But you do not have to center on the mean. I like to choose a nice round figure somewhere near the bottom end of the range for a variable, for example. HTH.
--
Bruce Weaver bweaver@lakeheadu.ca http://sites.google.com/a/lakeheadu.ca/bweaver/ "When all else fails, RTFM." PLEASE NOTE THE FOLLOWING: 1. My Hotmail account is not monitored regularly. To send me an e-mail, please use the address shown above. 2. The SPSSX Discussion forum on Nabble is no longer linked to the SPSSX-L listserv administered by UGA (https://listserv.uga.edu/). |
In reply to this post by Tanja Gabriele Baudson
Hi Tanya,
Bruce is right (like always). You need to fit a multilevel (a.k.a. random effects, a.k.a. mixed model) in order to be able to include at the same time independent variables related to the teachers (the individuals) and independent variables related to the schools (groups). As Bruce suggest, centering in multilevell is very useful indeed. If you want to read about multilevell I recommend you Hox (2010): Multilevel analysis. It is an excelent text-book, it is clear and it has many examples with datasets from education data. The only alternative to to multilevel here will be to fit a 2-stages fixed effects model, which I think would not be very adequate in your case. Kind Regards |
Free forum by Nabble | Edit this page |