Using Statistical Regression Methods in Education Research




Main effect

This is the effect that a given explanatory variable has an on an outcome variable. In a main effects model there are no terms for interactions between explanatory variables, so the main effects represent the unique effect of each explanatory variable on the outcome. While interpretation of the model is much simpler when only main effects are specified, ths can overly simplify the situation where there are strong interactions between explanatory variables. See for example MLR module 3.11-3.13.

Maximum Likelihood Estimation

Maximum likelihood estimation is a statistical method for a model to the data. The process itself is very technical but luckily SPSS will do it for you! Basically, maximum likelihood estimation selects values for the parameters of the explanatory variables that most closely predict the actual outcome, doing this through an iterative process of successive approximation. The process calculates the probability that the data could have been caused by various explanatory variables and continues until it settles on the combination of parameters that give the highest probability - the most likely!

Multi-level regression models

Multi-level regression models can take account for clustering in data sets to more accurately model complex multi-level social datasets.

They are rather complex and we don't discuss them on this site. However, if you are ready to take the challenge and learn about them then we can highly recommend an excellent site called LEMMA.


This occurs when two or more explanatory variables are very strongly correlated (usually above 0.80). It is can be problematic in regression analysis as it implies the two explanatory may actually be measuring the same phenomena. In such cases it may be best to use only one of the two variables, or to create a new variable that is a weighted combination of the two. For example a measures of Socio-Econic Status (SES) could be derived by a weighted combination of variables such as parental education, occupation and income.

Multiple Linear Regression

Multiple linear regression is similar to simple linear regression but it can produce more expansive models by allowing researchers to include two or more explanatory variables. The formula for multiple linear regression is shown below. Multiple linear regression is the topic of Module 3

Yi = (b0+b1X1+b2X2+...+bnXn) + εi

  • Y = outcome variable, X1 = first explanatory variable, X2 = second explanatory variable, Xn = nth explanatory variable, b0 = value of outcome when all explanatory variables are zero, b1 = regression coefficient for the first explanatory variable, b2 = regression coefficient for the second explanatory variable, bn = regression coefficient for the nth explanatory variable, εi = error.
Multiple R

Multiple R is the correlation between the actual values of an outcome variable and the values predicted by a multiple regression model. Multiple R is similar to Pearson's r for the purposes of interpretation and is useful for making decisions regarding how well a model fits the data.

Page contact: Feedback to ReStore team Last revised: Thu 28 Jul 2011
Back to top of page