Information Transfer Economics: Statistical significance is not model correctness

Monday, August 3, 2015

Statistical significance is not model correctness

I have been having a discussion with Mark Sadowski in comments here about what his series of posts at Marcus Nunes's blog actually show. My more general argument (that Mark's model is actually an information transfer model that demonstrates a liquidity trap) notwithstanding, here is a piece of the conversation:

Me: The data are exponential and any two exponentials will be related by the above relationship [log P(t) = a log MB(t-q) + b]. You're not learning that your model or theory is correct. You're learning that economic systems tend to be exponential. Interpreting statistical significance as model correctness is an inference error.

Mark: Just because two variables both exhibit a similar trend does not mean there is a statistically significantly relationship between them. Moreover there is often statistically significant relationships between variables which do not exhibit similar trends. So frankly this statement is weirdly nonsensical.

I agree that there can be statistically significant relationships between variables with different trends, which is beside the point; my point was that if your data are all roughly samples of exponentially growing functions, you can almost always find a statistically significant relationship between them even if they isn't any relationship at all.

I decided to demonstrate my assertion with a concrete example (the complete details are at the bottom of this post). Let's generate two randomly fluctuating exponentially growing data series (using normally distributed shocks to the growth rate):

Now let's say the red one is the data V1 and blue one is the explanatory variable V2 and fit the data to the model:

log V1(t) = a0 log V2(t - t0) + b0

Here is the model result (solid blue) -- the dashed blue line is a0 log V2(t) + b0, i.e. showing the data for V2 without the lag:

The fit has chosen a lag that lines up some of the random fluctuations in V1 to the random fluctuations in V2 a bit better (actually, statistically significantly better).

And the result is that the parameter p-values of the model fit are all p < 0.01 ... a statistically significant relationship, worthy of publication in an economics journal for example.

Except it was random data.

OK ... maybe that's a fluke?

So I did it 100 times -- and only a few percent of the results failed to achieve p < 0.01 (showing the parameter p-values with the worst p-values):

And that is for random data with no relationship between the two except that they are both exponential. That is what I mean by "[i]nterpreting statistical significance [p < 0.01] as model correctness is an inference error."

Additionally, adding more data causes the previously identified relationship (the model with parameters a0, t0 and b0) to break down.

The big takeaway from this, and what brings it back to my more general argument, is that in order to make causal inferences you need to see strong changes in the trend of your data ... like the one that happens in 2008 here:

I once called this the cleanest economic experiment ever. Mark's analysis ignores this amazing source of potentially informative data by concentrating only on the post-2008 data (and thus contains only a single log-linear trend).

...

Here is the full Mathematica notebook:

32 comments:

Tom BrownAugust 3, 2015 at 8:08 PM
OK, great, I'm glad you continued this. I hope Mark responds, and that we ... er... you two eventually come to some sort of resolution so that I can potentially learn something. (c:

If you go back to where we left off with the comments, I asked him about coming up with a lag of 1 for M0. Any thoughts on that?

Also, it sounds like when he compared p-values for the three data variables he looked into, it sounds like he had a big difference between the M0 and the MB explanatory variables, with the former around 0.5 and the latter giving p-values for the three curves much smaller values. I realize that he's skipping the big change that you've pointed out here, but is there any reason for that you can think of off hand? Do you consider it to be of any importance? Why?

OK, thanks Jason.
ReplyDelete
Replies
Mark A. SadowskiAugust 4, 2015 at 3:56 AM
I have a busy day today and may come back late this afternoon.

However, on first pass it looks to me like you've simply fitted one nonstationary process on another nonstationary process without first detrending the data. This is a classic example of what is known as a spurious regression.

http://davegiles.blogspot.com/2012/05/more-about-spurious-regressions.html

Of course the p-values will be low.
ReplyDelete
Replies
Tom BrownAugust 4, 2015 at 9:47 AM
Looking forward to responses here:

From Jason: The detrending issue. Also, you mention the sensitivity of p-values above. To summarize what I think I hear Mark saying regarding the data since 2008 (only), MB probably is a good explanation while M0 is probably not. And you're saying that if we restrict ourselves to 2008 and later data only, that's a reasonable conclusion (putting aside the "sensitivities" you mention in p-values), but why ignore the elephant in the room (i.e. why not include pre-2008 data as well)?

From Mark: Jason's point about excluding pre-2008 data (I've asked before, although I'm not sure you saw my 2nd question on the subject).
ReplyDelete
Replies
AnonymousAugust 4, 2015 at 10:37 AM
"Interpreting statistical significance as model correctness is an inference error."

Indeed. That is a general truth. Statistical significance means that there is good evidence against the null hypothesis. That means that there is confirmatory evidence for every other hypothesis, not just for the model in question. And, as we know, confirmatory evidence is very weak.
ReplyDelete
Replies
Mark A. SadowskiAugust 4, 2015 at 2:11 PM
"I once called this the cleanest economic experiment ever."

So I click on the link and I find the following.

http://informationtransfereconomics.blogspot.com/2014/11/quantitative-easing-cleanest-experiment.html

"Let's plot the Pearson's correlation coefficient of MB (blue) and M0 (red) with P (as well as the correlation of MB and M0, green):

[Graph]

Before QE, all of these are fairly highly correlated -- actually MB and M0 are almost perfectly correlated. This really doesn't tell us very much. NGDP is also highly correlated with the price level. So is population, and in fact any exponentially growing variable.

With the onset of QE, the correlation between MB and P drops precipitously (as well as the correlation between M0 and MB). We see that the counterfactual path MB without QE would have been more correlated with P (effectively given by the red line)....This means central bank reserves have nothing to do with the price level or inflation."

What Jason did in that post is essentially what he is doing in this post.

You cannot regress nonstationary time series on each other. These are spurious regressions. Spurious regressions result in invalid estimates with high R-squared values, high t-statistics and low p-values.

Thus the Pearson's r values (the square root of the R-squared values) in this earlier post were subject to extreme statistical bias and so could not have been interpreted as meaning that these series are correlated. Nor could the fact that the Pearson's r values dropped precipitously be interpreted as meaning anything.

One must correct nonstationarity by differencing, or by some other method before checking to see if two time series have a statistically significant relationship.

Either Jason is guilty of outrageous obfuscation or he simply doesn't know what any statistics student should have learned by the completion of his freshman year in college.
ReplyDelete
Replies
Tom BrownAugust 4, 2015 at 4:16 PM
This comment has been removed by the author.
ReplyDelete
Replies
Tom BrownAugust 4, 2015 at 4:46 PM
This comment has been removed by the author.
ReplyDelete
Replies

Add comment

Comments are welcome. Please see the Moderation and comment policy.

Also, try to avoid the use of dollar signs as they interfere with my setup of mathjax. I left it set up that way because I think this is funny for an economics blog. You can use € or £ instead.

Note: Only a member of this blog may post a comment.