When Do Suppressor Effects Occur? | Statistical Horizons

When doing regression analysis, most data analysts expect that the coefficient associated with a predictor variable will get smaller (closer to zero) when other variables are added to the regression model. But if the analysts are experienced, they also know that sometimes a coefficient gets larger when other variables are added. That’s commonly known as a “suppressor effect,” and the additional variable is called a “suppressor variable”.

I thought I knew all about suppressor effects. It turns out I was wrong. A few weeks ago, I was asked the following (paraphrased) question by Adam Abdulla, an Associate Lecturer at Robert Gordon University in Scotland:

Suppose we have three variables, y, x_1,and x₂. The correlations among these three variables are all positive. If we then do an OLS regression of y on x₁and x₂, will the standardized coefficients for x₁and x₂ necessarily be smaller than their bivariate correlations? In other words, if all correlations are positive, can we have a suppressor effect?

My initial answer was no. For a suppressor effect to occur, there has to be an inconsistency of signs. For example, x₁and x₂ both have positive correlations with y, but they are negatively correlated with each other. Or, x₁and x₂are positively correlated with each other, but one is negatively correlated with y and the other is positively correlated with y.

Well, as pointed out by another one of Adam’s correspondents, that was a serious error. Yes, there will be a suppressor effect if the correlations are inconsistent in sign. But it can easily be proved that suppressor effects can occur even when all correlations are positive.

LEARN MORE IN A SEMINAR WITH PAUL ALLISON

Here is the basic result. We start with three correlations, labeled as

r_y₁(the correlation between y and x₁)

r_y₂(the correlation between y and x₂)

r₁₂ (the correlation between x₁ and x₂).

We assume that all three are positive. The standardized coefficients for the regression of y on both x₁ and x₂ are labeled p_y₁ and p_y₂. [In this post, I’ll focus on standardized coefficients, but the same conclusions apply to unstandardized coefficients.] We want to know how p_y₁ compares with r_y₁ (since r_y₁ is the standardized coefficient in a bivariate regression of y on x₁).

The simple answer is this: It can be shown that p_y₁ > r_y₁ whenever r_y₁r₁₂ > r_y₂.

What this inequality says is that you’re likely to get a suppressor effect when the correlation between two predictors is high, but their correlations with the dependent variable are very discrepant (i.e., one is substantially larger than the other). A bit later, I’ll make this conclusion more precise.

There is an additional, and somewhat surprising, consequence of r_y₁r₁₂ > r_y₂, and that is that p_y₂, the standardized coefficient for the other variable, must be less than zero.

Here’s an example. Suppose r_y₁= 0.40, r_y₂ = 0.20, and r₁₂ = 0.60. Arranging those as a correlation matrix, we have

Using the well-known formula for a standardized coefficient, we get

which is larger than the corresponding correlation of .40. And for the other coefficient, we get

a negative coefficient.

At least for me, this all becomes more understandable when the relationships among the three variables are displayed as a path diagram. Here’s a path diagram (produced by Mplus) for the example just described:

My original mistake was to claim that a suppressor effect occurs when there is an inconsistency of signs among the correlations. But I wasn’t totally off the mark. The correct way to put it is that a suppressor effect occurs when there is an inconsistency of signs among the three arrows in this diagram. In this example, we have a positive correlation between x₁and x₂, but the effects of those two variables have opposite signs. There would also be a suppressor effect if the correlation between x₁ and x₂ were negative, but the two standardized coefficients had the same sign.

That’s evident from the tracing rule for path diagrams, which says that

Let’s rewrite this as

Now, it’s easy to see what happens when the signs of these parameters vary. For example, assuming that r_y₁ is positive, p_y₁ will be larger than r_y₁whenever either (a) p_y₂ is negative and r₁₂ is positive or (b) p_y₂ is positive and r₁₂ is negative.

Overall, one take-away is that suppressor effects can occur more often than we might expect. An important implication, which I have long emphasized in my linear regression courses, is that we should be very wary of relying on bivariate correlations to suggest what variables ought to go into a regression model. A bivariate correlation may be small and not statistically significant, and yet, in reality, the independent variable may actually have a big effect on the dependent variable when other variables are controlled.

For several detailed empirical examples of suppressor effects, see Richard Williams’ excellent tutorial.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Leave a Reply Cancel reply