Thursday, June 7, 2018

Measuring labor demand

On Twitter, I've gotten into an extended discussion with "Neoliberal Sellout" @IrvingSwisher ("NS") about my bold claim that we are seeing the leading edge of a recession in the JOLTS job openings data using the dynamic information equilibrium model (DIEM).

Calibration is a general issue in time series data that involves different collection methods or models. It becomes a more significant issue if the calibration is done with knowledge of the model you are testing using the calibrated data! For example, if we corrected the aforementioned HWOL data using the DIEM as a prior, that would be extremely problematic. But that isn't what has been done here.

The main issue (I think — I might be wrong) is whether a) we can trust changes in the JOLTS data as representing information about the business cycle, and b) whether specifically "job openings" maintains a constant definition over time. Let me address these points.

The "Help Wanted OnLine" (HWOL) case study

NS points to a study of the "Help Wanted On-Line" (HWOL) index created by the Conference Board. The study documents how changes in Craigslist's pricing affected the HWOL metric, and it's true that price change appears as a non-equilibrium shock in the DIEM:

Actually, the DIEM is remarkably precise in ascertaining the timing of the shock (the gray band represents the beginning and ending of the shock) as November 2012 (dashed line). However, this shock doesn't represent information about the business cycle — it is a measurement issue. This is NS's point: the data's deviation from the job openings DIEM I used to make a bold claim about an upcoming recession may well be a measurement problem rather than a signal about the business cycle.

This is a reasonable point, and as I show in the model analysis above, something that is not related to the business cycle (except possibly indirectly in that being swamped with ads, Craigslist needed to raise the price to keep their servers from crashing from the traffic) indeed shows up as what might be interpreted as the onset of the "2013 recession".

It could well be that the deviation observed in the JOLTS data is a JOLTS specific shock — note that it appears to affect all the JOLTS series (hires, quits, etc are also showing a correlated model error), so it's not a job openings-specific shock. But this is where additional evidence comes in such the trend towards yield curve inversion as well as the general fact that the timing of recessions is consistent with a Poisson process with a mean time between recessions on the order of 8 years — therefore the probability we will see one in the next couple years is rising. If this was 2011 and T-bill spreads were above 3%, I'd probably put much less confidence in the prediction (however, I'd still make it because predictions are a really nice test of models). But with the 10 year - 3 month spread below 1% and on a declining trend since 2014, I'm much more confident the deviation visible in the JOLTS job openings data represents the leading edge of a recession rather than issues with JOLTS data collection methodology.

Measuring job openings

NS also has issues with measuring job openings/vacancies in particular (concatenating some tweets):
... The definition of job openings is not at all constant. An index that links across cycles must rely on varying definitions and makes strong assumptions about how they must be linked. That makes the time series have a long history but still makes for a young vector ... Prior methods for constructing a job vacancy measure in other countries have often had to be discontinued or re-constructed. It’s hard to keep a constant methodology that keeps up with technological shifts while avoiding cyclical distortion. ... It may prove empirically negligible (hard to say given changing def’ns) but vacancy measurement is vulnerable to tracking biz cycle here. Might be robust for other purposes but the Craigslist-HWOL is instructive ...
The DIEM fits with previous data from Barnichon (2010) [1] that uses entirely different mix of data sources (i.e. mostly newspapers as there was no world wide web) and therefore necessarily requiring different definitions of "vacancy". The model also describes the other JOLTS series (hires, quits, separations) as well as the unemployment rate [2] and employment rates across several countries. This is not to say we should therefore believe the DIEM, but rather we should put little weight on the hypothesis that it's just a coincidence the JOLTS job openings data is also well-described by the DIEM with a comparable level of error to other time series because the job openings data suffers from a series of methodological problems specific to the job openings data that somehow results in a time series that looks for all the world as if it doesn't suffer from those methodological problems. We can call it the "immaculate mis-calibration": despite being totally mis-calibrated, the JOLTS job openings data looks as if it is a well-calibrated, reasonably accurate measure of the labor market.

Additionally, the estimate of the dynamic equilibrium is robust to the "business cycle" (i.e. recession shocks) due to the entropy minimization described in the paper. The prediction of the recession is based on the deviation from this dynamic equilibrium, not the "cyclical" (actually random) shocks in the model -- these are exponentially suppressed.

However we have the further check on the JOLTS job openings data: we can use the model to solve for the job openings given the unemployment level and the number of hires. The hires number is less dependent on the methodological issues involved with vacancies (online vs print ads, what constitutes "active recruiting") since it more directly asks if a firm has hired an employee, and NS specifically states the unemployment data is reasonably valid. This check tells us the JOLTS job openings data (yellow) is reasonably close to what we would expect as reconstructed by the model (blue):

Additionally, using the model with this "expected" data series constructed from the hires and unemployment levels, we still see the deviation from the DIEM (blue dashed) for which we posit a shock and a recession.


Overall, this addresses most of NS's points. I'm not entirely sure what the end result we're aiming for is. We should always keep an open mind. I'm not arguing that the model is correct and we shouldn't question it — I'm arguing that the model predicts a shock to JOLTS JOR that will be associated with a rise in the unemployment rate (that we typically associate with a recession). That is to say I am perfectly aware that this prediction may be wrong, and that will be determined by future data. If it is wrong, then we can do a post-mortem and many of the points NS raised will become more salient. If the prediction is correct, NS's points will still be salient for future predictions but less so for this (hypothetically) successful prediction.

Whether or not we believe the prediction going into the "experiment" is in a sense irrelevant. You might select H0 = model is true versus H0 = model is false based on this, but I (and most other people) pretty much always select the latter as good methodology (i.e. not giving my model the benefit of being the null hypothesis). This is a prediction that was made in order to test the model. That the prediction might be wrong is precisely the point of making the prediction.

Basically, NS is arguing why the prediction will be wrong — itself a prediction. This is fine and it's definitely part of Feynman's "leaning over backwards" to present everything that could go wrong (which is why I've written this post to document the points NS makes [3]). But it prejudges the future data to say this information invalidates the prediction before the prediction is tested.



[1] DIEM for Barnichon (2010) data (click to enlarge):

Also, the resulting Beveridge curve (click to enlarge):

[2] The unemployment rate (click to enlarge):

[3] What's somewhat ironic is that I could use NS's points post hoc to rationalize why the prediction failed! I'm not going to do that because I'm genuinely interested in a model that demonstrates something true and valid about the real world. I am not interested in the model if it doesn't do that — and it's not like it has some political ideology behind it (it's basically nihilism, which doesn't need mathematical models) that would cause me to hold onto it. While I have put a lot of work into information equilibrium, I don't have any problem moving on to something else. That's actually been how most of my life has gone: working on something for 5-10 years and moving on to something else — QCD, synthetic aperture radar, compressed sensing, economic theory. It's not like I'd even have to give up blogging because very few people care about the information equilibrium models and forecasts. Most of you come for the methodology discussions and macro criticism. 


  1. Thanks again for engaging. I think you do a very good job of addressing my concerns/issues and clarifying some of the distorted interpretations I had of what you were presenting/predicting.

    Just as further clarification, my concern was that while job vacancy data has some business cycle info, its definition lends itself to vulnerability for the purposes of recession prediction and the JOLTS data only goes back one full business cycle, leaving those time series especially untested vs a proper small sample size.

    This post really helps flesh out for me the additional evidence and modeling that informs your view for why a recession is more imminent. I was worried you were relying too heavily on JOLTS data, and especially job vacancy data, when making your 2019 recession call. I don't think that's actually the case now, given your explanation of what else you see as supporting your view.

    As a separate discussion that's relevant to the title of your post but not really to our previous discussion (just thought you might find it fun and would love to get your thoughts):
    I do question the quality of the JOLTS job openings data for the specific purpose of proxying labor demand relative to the business cycle. This is NOT the same as saying that JOLTS job openings don't move cyclically (it obviously does). Rather, the Beveridge Curve analysis/predictions for the post-crisis period represent a pretty epic fail. Predictive models of wage growth that incorporated available labor supply via unemployment/slack and labor demand via job openings disappointed at both the aggregate and sector levels (unless I'm somehow missing the right specification). The Beveridge Curve analysis seemed straightforward: job openings seemed elevated conditional on the level of unemployment and such an imbalance would have to necessitate faster wage growth conditional on the level of unemployment. In reality we've seen the opposite result, with wage growth continuing to undershoot the prior two cycles. It's very plausible a simple supply-demand approach to modeling wage growth is just wrong (very open to this), but my best reconciliation of these facts is that job opening rates are systematically moving higher because technology has led to secular decline in the cost of posting a job opening. Job opening rates are elevated due to technology, while unemployment understates the level of slack and thus leads to an overshooting on wage growth forecasts. Again, this doesn't really affect your analysis, but just thought you might find this interesting to think about.

    1. I think there was a bit of misunderstanding at the beginning when I said I was basing it off of JOLTS data. The specific timing and thresholds of claiming a recession was coming was based on JOLTS data, but the claim that it is going to be a recession and not a random shock to openings was based on a broader set of evidence.

      Actually, the model I'm using gives a rather mundane explanation for the lack of a specific relationship between job opening level and unemployment level as well as the lack of wage growth.

      For the former, since the timing of shocks are different to unemployment and vacancies (vacancies falling first), any specific joint level of unemployment and vacancies is more an accident of history.

      For the latter, the level of wage growth is dependent on how large the previous recessions were and how long the time between them is (again, history).

      The well-defined relationships ("equilibria") appear to be the rates of change, not the levels. The unemployment rate falls at a specific constant rate over the past 60 years of about 9% per year (that's of the percentage so that a 10% unemployment rate would fall about 0.9 percentage points in a year while a 1% rate would fall 0.09 percentage points). The same (but with different positive or negative rates) applies to wage growth, job openings, etc. That means the level (of unemployment rate, wage growth rate) at any point is more determined by the history of non-equilibrium shocks (usually recessions, but there was a mini-boom in the US in 2014) and the time in between them. Wage growth has undershot in the last two cycles because not enough time has occurred between the recessions in 1991, 2000, and 2008 for wage growth to return to that level.

      A decent analogy is avalanches and snow pack on mountains. Snow can accumulate at a constant rate, but the amount of snow pack (level) is dependent on how long between avalanches and how big they are. The level is a record of the non-equilibrium process of avalanches, while the rate of increase is the equilibrium process of snowfall (growth).


Comments are welcome. Please see the Moderation and comment policy.

Also, try to avoid the use of dollar signs as they interfere with my setup of mathjax. I left it set up that way because I think this is funny for an economics blog. You can use € or £ instead.