Spatial Dynamic Panels with Common Factors

Credit risk in US banking, the spxtivdfreg way

0.394spatial spillover psi
7.765long-run total effect
33.5%variance from factors

Carlos Mendez

Nagoya University (GSID)

June 11, 2026

The Tension

Act I

When the crisis hit, credit risk spread two ways — and standard tools saw only one

When the 2007–2009 crisis struck, non-performing loans surged across the whole US banking system at once.

Two channels did it: spatial spillovers between banks, and common factors — macro shocks that hit every bank at once.

Stata’s xsmle and spxtregress model only the first.

One equation must absorb four kinds of endogeneity at once

\[NPL_{it} = \psi \sum_{j=1}^{N} w_{ij}\, NPL_{jt} + \rho\, NPL_{i,t-1} + x_{it}\beta + \alpha_i + \lambda_i' f_t + \varepsilon_{it}\]

Spatial lag \(\psi W\,NPL\), temporal lag \(\rho\,NPL_{i,t-1}\), endogenous covariates \(x_{it}\beta\), bank effect \(\alpha_i\), and the latent factors \(\lambda_i' f_t\) — every term but \(\alpha_i\) is a source of endogeneity.

Where we’re going

  • The lab: 350 US banks, 36 quarters, an economic-distance network
  • The estimator: defactor first, then run IV
  • The full model: a significant spatial lag and one-third of the variance from factors
  • Three stress tests: drop the factors, drop the spatial lag, free the slopes

The Investigation

Act II

The lab: 350 US banks across the entire crisis, 12,250 observations

  • OutcomeNPL, non-performing loans over total loans (%)
  • Span — 2006:Q1 to 2014:Q4, 36 quarters straddling the GFC
  • Network — a \(350 \times 350\) weight matrix \(W\), not geographic but economic-distance (Spearman rank correlations of bank debt ratios)

Strongly balanced: every bank in every quarter. The matrix is row-standardized with about 18 neighbours per bank, so the spatial lag is a weighted average of interconnected banks.

Defactored IV is a two-step trick: subtract the rainstorm, then read each pond

Step 1 — Defactor

  • Extract latent common factors \(\lambda_i' f_t\) by principal components
  • Remove them from data before estimating

Step 2 — IV / GMM

  • Run instrumental variables on the defactored data
  • Standard 2SLS asymptotics now hold

Removing the factors first kills the cross-sectional dependence — no incidental-parameters bias, no Lee–Yu correction.

INEFF is endogenous, so INTEREST does the instrumenting

spxtivdfreg NPL INEFF CAR SIZE BUFFER PROFIT QUALITY LIQUIDITY, ///
    absorb(ID) splag tlags(1) spmatrix("W.csv", import) ///
    iv(INTEREST CAR SIZE BUFFER PROFIT QUALITY LIQUIDITY, splags lag(1)) std

splag adds the spatial lag, tlags(1) the temporal lag, iv(...) instruments INEFF with INTEREST and lags, std standardizes before extracting factors.

The estimator finds the factors itself: 2 in the regressors, 1 in the errors

  • \(r_x = 2\) common factors in the regressors
  • \(r_u = 1\) common factor in the error term
  • Together they capture Fed policy, the housing collapse, the interbank freeze

Heterogeneous loadings \(\lambda_i\) let each bank react to the same aggregate shock with its own intensity — something two-way fixed effects cannot do.

With factors in, the spatial lag is real: a 0.39 contemporaneous spillover

0.394

\(\psi\), spatial autoregressive parameter (z = 4.65, p < 0.001) — neighbours’ NPL up 1 pp raises this bank’s NPL by 0.39 pp

LIQUIDITY dominates the covariates, and the J-test blesses the instruments

Term Coef. z Sig.?
\(\psi\) (W·NPL) 0.394 4.65 yes
\(\rho\) (L1.NPL) 0.290 5.33 yes
LIQUIDITY 2.452 9.09 yes
INEFF 0.447 4.28 yes
BUFFER −0.055 −4.59 yes

Hansen J: \(\chi^2(19) = 18.83\), \(p = 0.468\) — fails to reject, so the instruments survive overidentification.

The Resolution

Act III

Drop the factors and temporal persistence doubles while the J-test rejects

Term With factors Without factors
\(\psi\) (spatial) 0.394*** 0.288***
\(\rho\) (temporal) 0.290*** 0.594***
LIQUIDITY 2.452*** 0.843***
Factors \((r_x, r_u)\) 2, 1 0, 0
J-test \(p\) 0.468 0.000

Excluded factors are serially correlated, so \(\rho\) swells to absorb them — and the instruments are no longer valid.

The no-factor model also masks a real effect: SIZE goes insignificant

With factors

  • SIZE = 0.223, significant (z = 2.36)
  • larger banks load more on macro shocks

Without factors

  • SIZE = 0.089, not significant
  • exposure mistaken for noise

Omitting factors can hide a genuine relationship, not just inflate spurious ones.

The spatial lag earns its place: dropping it inflates the covariates

Term Full model Without spatial lag
\(\psi\) (W·NPL) 0.394***
\(\rho\) (L1.NPL) 0.290*** 0.323***
INEFF 0.447*** 0.638***
SIZE 0.223** 0.346***
J-test \(p\) 0.468 0.226

Both pass the J-test — the choice is economic, and theory says banks are interconnected.

A permanent liquidity shock costs three times what the coefficient shows

7.765

Long-run total effect of LIQUIDITY vs a short-run coefficient of 2.452 — amplified over time and across the network

Long-run impact = short-run effect, amplified twice

\[\text{Total LR effect} = \frac{\beta}{(1-\rho)(1-\psi)} = \frac{2.452}{(1-0.290)(1-0.394)} \approx 7.77\]

The temporal multiplier \(1/(1-\rho) = 1.41\) compounds the shock over time; the spatial multiplier \(1/(1-\psi) = 1.65\) spreads it across the network.

The strongest objection — and the answer

Objection. The mean-group estimator drives the spatial lag to insignificance (\(\psi = 0.032\), \(p = 0.536\)). So maybe the spillovers were never real.

Response. Imposing homogeneous slopes on heterogeneous banks can manufacture spurious spatial dependence — but MG is only \(\sqrt{N}\)-consistent with just 35 periods, so a few outlier banks can swamp the average. Read it as a caution, not a refutation: \(\rho\) stays stable at 0.301, and theory still predicts interconnection.

Four specifications, one verdict: factors stay, spatial lag stays

Full No factors No spatial MG
\(\psi\) 0.394*** 0.288*** 0.032
\(\rho\) 0.290*** 0.594*** 0.323*** 0.301***
LIQUIDITY 2.452*** 0.843*** 2.534*** 6.330***
J-test \(p\) 0.468 0.000 0.226

The J-test decides factors; theory and significance decide the spatial lag; MG is a robustness check, read with care.

Model the common factors, or the spatial story you tell will be the wrong one.