Why bounds, not a single number?
A job-training program may help workers find jobs, or it may not — and an unmeasured confounder (prior work experience) makes a single estimate impossible to trust. Partial identification answers the next-best question: what range of values is the true effect consistent with, given only the data and minimal assumptions? The honest answer in this post is a wide interval [-0.298, +0.702] — Manski's no-assumption bounds — that contains the true ATE of 0.27 but is too wide to settle the question.
The animation below shows the three layers of certainty. A naive point estimate pretends the confounder is gone. The Manski bounds never move — they reflect fundamental uncertainty. The entropy bounds shrink as you add an information-theoretic assumption, all the way to a point if you are willing to assume everything. Different tabs let you turn those dials yourself.
Three layers of certainty about the ATE
The orange dot only appears when assumptions are strong enough to collapse the bound to a single point — a fantasy in this post since the confounder is real. Notice how the steel-blue Manski bar never budges: more assumptions, not more data, is what narrows bounds.
Bounds Showdown
The real numbers from the post. Toggle Manski, Autobound, Tian-Pearl, and entropy. Hover for width, lower/upper, and whether the bound covers the true ATE.
Confounding Simulator
Tune confounder prevalence, treatment effect, and confounder strength. Watch the naive estimate drift while the Manski bounds bracket the truth every time.
Sample-size Sensitivity
The headline misconception: bigger N does not narrow identification bounds. See the Manski width stay flat at 1.0 from N = 100 to N = 5,000.
Glossary (open a card if a term is unfamiliar)
Point identification
Partial identification
Manski bounds
Autobound (LP)
Entropy bounds
Tian-Pearl bounds
PNS
Bound width
Coverage
ATE
Bounds Showdown — the post's real numbers
These are the exact bounds computed in the post on the same N = 1,000 confounded dataset (true ATE = 0.27). Toggle the outcome (ATE vs. PNS) to switch between the average effect and the probability of necessity-and-sufficiency. Hover a bar for the upper/lower endpoints, width, and whether the bound contains the true ATE.
Outcomes
What to look for
- Manski and Autobound coincide on the ATE. The blue and orange bars overlap exactly at [-0.298, 0.702] — linear programming confirms Manski is already sharp. No clever LP trick can do better without extra assumptions.
- Entropy buys a 32% width reduction. Adding H(U | X, Y) ≤ 0.1 narrows the ATE band to [-0.228, 0.454] (width 0.68). It still crosses zero — the data, even with the entropy assumption, cannot confidently sign the effect.
- Switch to PNS. Tian-Pearl gives [0, 0.702] — the lower bound pinning at zero means we cannot rule out that training is never individually necessary-and-sufficient. Interestingly, the entropy constraint is less effective for counterfactual PNS than for ATE.
Confounding Simulator — turn the dials yourself
Generate fresh binary data with confounding. The Manski bounds are computed in closed form from the observed contingency table. Slide the confounder strength up and watch the naive estimate drift, while the Manski bounds stay anchored around the truth.
What to look for
- Raise the confounder strength β_U. The orange "naive" line drifts away from the teal "true" line. The Manski bound never loses the truth.
- Drop β_U to 0. The naive estimate snaps to the truth — there is no confounding. The Manski bound is still width ≈ 1 because the data cannot prove that no confounder exists.
- Change N from 100 to 5,000. The bars barely move. Identification bounds, unlike confidence intervals, do not shrink with sample size.
Sample-size Sensitivity — what more data buys you
A common misconception: collecting more data narrows partial-identification bounds. This is generally not true. Identification bounds reflect what is fundamentally unknowable without observing the confounder. Below: Manski width vs. N from the post's experiment (30 reps per grid point). The Manski band sits flat at 1.0 across two orders of magnitude of N; entropy bounds also stabilise around 0.68.
What to look for
- Both bands are flat in N. The mean Manski width is exactly 1.0 from N = 100 to N = 5,000. Mean entropy width is ≈ 0.68. Only the variance around those means shrinks with N.
- Identification ≠ statistical estimation. Confidence intervals shrink at rate 1/√N. Identification bounds do not move at all with N — they encode "what the data could ever tell us, even with infinite samples."
- To narrow bounds you need structure, not size. Options listed in the post: an instrument (BinaryIV), monotone treatment response, or direct measurement of U. The Tab 3 simulator shows the data-size lever; nothing you do with it narrows the bound.
Why does this surprise people?
Most empirical work uses 95% confidence intervals, which behave like
[β̂ − 1.96 SE, β̂ + 1.96 SE] and shrink as N grows. Identification
bounds answer a different question: what is the set of effects
consistent with the data and the assumed structure, even in the limit
of infinite data? When the assumed structure is weak (Manski),
that set is wide and stays wide. The path to narrower bounds runs
through assumptions — not through bigger samples.