← Back to the post
Interactive data dictionary

Do Institutions Cause Prosperity? The AJR (2001) IV Datasets

The eight Acemoglu–Johnson–Robinson replication datasets behind an instrumental-variables tutorial in Stata.

8
datasets
55
variables
64
ex-colonies (base sample)
163–376
rows per file

Downloads

Each dataset is available as a labeled Stata .dta and its source file.

⇩ Download all data (ZIP)stata_codebook.do

DatasetGrainRowsStataSource
maketable1country (cross-section)376 × 11maketable1.dtamaketable1.dta
maketable2country (cross-section)163 × 9maketable2.dtamaketable2.dta
maketable3country (cross-section)376 × 11maketable3.dtamaketable3.dta
maketable4country (cross-section)163 × 10maketable4.dtamaketable4.dta
maketable5country (cross-section)163 × 12maketable5.dtamaketable5.dta
maketable6country (cross-section)163 × 29maketable6.dtamaketable6.dta
maketable7country (cross-section)163 × 15maketable7.dtamaketable7.dta
maketable8country (cross-section)163 × 12maketable8.dtamaketable8.dta

Run stata_codebook.do in Stata once to attach long-form per-variable notes to the .dta files.

Load directly in code

Every file loads straight from GitHub (raw URLs). Swap the file name to load any dataset.

Stata

* Stata 14+ : `use` reads an https URL directly
global BASE "https://raw.githubusercontent.com/cmg777/starter-academic-v501/master/content/post/stata_iv/data/"
use "${BASE}maketable1.dta", clear
describe
notes

Python

!pip install -q pyreadstat
import pandas as pd
BASE = "https://raw.githubusercontent.com/cmg777/starter-academic-v501/master/content/post/stata_iv/data/"
df = pd.read_stata(BASE + "maketable1.dta")

# load every dataset at once
files = ["maketable1", "maketable2", "maketable3", "maketable4", "maketable5", "maketable6", "maketable7", "maketable8"]
data = {f: pd.read_stata(BASE + f + ".dta") for f in files}

# pyreadstat (richest metadata) reads LOCAL files -> download first
import pyreadstat, urllib.request
urllib.request.urlretrieve(BASE + "maketable1.dta", "maketable1.dta")
df, meta = pyreadstat.read_dta("maketable1.dta")

Copy and paste this snippet in Google Colab app. https://colab.research.google.com/notebooks/empty.ipynb

R

# R : haven::read_dta auto-downloads an https URL
library(haven)
BASE <- "https://raw.githubusercontent.com/cmg777/starter-academic-v501/master/content/post/stata_iv/data/"
df <- read_dta(paste0(BASE, "maketable1.dta"))

Overview & sources

Companion data for a hands-on Stata tutorial that replicates Acemoglu, Johnson and Robinson (2001), The Colonial Origins of Comparative Development. The study instruments modern property-rights institutions (avexpr) with European settler mortality during colonization (logem4) to recover the causal effect of institutions on income (logpgp95) across ex-colonies. The naive OLS slope is 0.522; the two-stage least-squares (2SLS) estimate is 0.944 — about 81% larger — implying that measurement error dominates OLS bias and that institutional reform is roughly twice as valuable as naive regressions suggest. The eight datasets here are AJR's original replication archive: one per table of the paper, each a country cross-section that progressively narrows from the full ~163-country world to the 64-country base sample of ex-colonies with valid settler-mortality data.

Eight cross-sections, one per AJR table. Every file is a country-level cross-section keyed on shortnam (3-letter country code); there is no time dimension. maketable1 and maketable3 carry the full ~376-row archive; the others hold ~163 rows. The base sample is selected with baseco==1 (and, in the original coding, this flag is missing for non-base countries rather than set to 0). The datasets share a common spine (shortnam, avexpr, logpgp95, logem4) and each adds the specific controls its table needs — OLS controls (Table 2), first-stage/historical-institution variables (Tables 3 & 8), colonial / legal / religion controls (Table 5), geography & climate (Table 6), and modern health channels (Table 7).

Data sources

SourceProvidesReference / URL
Acemoglu, Johnson &amp; Robinson (2001)All eight datasets (maketable1–maketable8) — the original AJR replication archive, one per table of the paperAcemoglu, D., Johnson, S., & Robinson, J. A. (2001). The Colonial Origins of Comparative Development: An Empirical Investigation. American Economic Review, 91(5), 1369–1401. https://www.aeaweb.org/articles?id=10.1257/aer.91.5.1369
World Bank / Penn World TableLog PPP GDP per capita 1995 (logpgp95); Hall–Jones GDP per worker (loghjypl)Underlying income series compiled by AJR from World Bank and Hall & Jones (1999).
Curtin (settler mortality)European settler/soldier mortality rates used to build extmort4 / logem4Mortality figures assembled by AJR from Curtin (1989, 1998) and related historical sources; see Albouy (2012) on imputation.
Method referencesEstimators, weak-instrument and overidentification diagnosticsImbens & Angrist (1994, LATE); Staiger & Stock (1997); Stock & Yogo (2005); Olea & Pflueger (2013); Baum, Schaffer & Stillman (2007, ivreg2).

Cite this data

Please cite this dataset as follows.

APA

Mendez, C. (2026). Do Institutions Cause Prosperity? An IV Tutorial in Stata (AJR 2001 replication data) [Data set]. https://carlos-mendez.org/post/stata_iv/

Acemoglu, D., Johnson, S., & Robinson, J. A. (2001). The Colonial Origins of Comparative Development: An Empirical Investigation. American Economic Review, 91(5), 1369–1401. Albouy, D. Y. (2012). The Colonial Origins of Comparative Development: An Investigation of the Settler Mortality Data. American Economic Review, 102(6), 3059–3076. Imbens, G. W., & Angrist, J. D. (1994). Identification and Estimation of Local Average Treatment Effects. Econometrica, 62(2), 467–475.

BibTeX

@misc{mendez2026stataiv,
  author       = {Mendez, Carlos},
  title        = {Do Institutions Cause Prosperity? An IV Tutorial in Stata (AJR 2001 replication data)},
  year         = {2026},
  howpublished = {\url{https://carlos-mendez.org/post/stata_iv/}},
  note         = {Data set}
}

@article{ajr2001colonial,
  author  = {Acemoglu, Daron and Johnson, Simon and Robinson, James A.},
  title   = {The Colonial Origins of Comparative Development: An Empirical Investigation},
  journal = {American Economic Review},
  volume  = {91}, number = {5}, pages = {1369--1401}, year = {2001}
}
@article{albouy2012colonial,
  author  = {Albouy, David Y.},
  title   = {The Colonial Origins of Comparative Development: An Investigation of the Settler Mortality Data},
  journal = {American Economic Review},
  volume  = {102}, number = {6}, pages = {3059--3076}, year = {2012}
}
@article{imbens1994late,
  author  = {Imbens, Guido W. and Angrist, Joshua D.},
  title   = {Identification and Estimation of Local Average Treatment Effects},
  journal = {Econometrica},
  volume  = {62}, number = {2}, pages = {467--475}, year = {1994}
}

Variable explorer search & filter all 55 variables

Type to filter by name or label, or use the chips to filter by type. Each row shows a mini distribution. Click a header to sort.

VariableTypeDistributionLabelDefinitionUnitsIn filesSource
africa#dummyshare coded 1 = 0.307Africa dummy1 if the country is in Africa (continent control).0/1maketable2, maketable4, maketable7AJR (2001)
asia#dummyshare coded 1 = 0.258Asia dummy1 if the country is in Asia (continent control).0/1maketable2, maketable4, maketable7AJR (2001)
avelf#continuousmin 0 | median 0.275 | max 1Ethnolinguistic fractionalizationAverage ethnolinguistic fractionalization index (probability two random people differ).0-1maketable6AJR (2001)
avexpr#continuousmin 1.64 | median 7 | max 10Avg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').0-10 scalemaketable1, maketable2, maketable3, maketable4, maketable5, maketable6, maketable7, maketable8AJR (2001), from Political Risk Services
baseco#dummyshare coded 1 = 1.000Base-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1/missingmaketable1, maketable2, maketable4, maketable5, maketable6, maketable7, maketable8AJR (2001)
catho80#continuousmin 0 | median 10.6 | max 97.3Catholic share of population, 1980 (%)Percent of population Catholic in 1980 (religion-composition control).% (0-100)maketable5WCE via AJR
cons00a#continuousmin 1 | median 1 | max 7Constraint on executive in 1900Polity constraint-on-the-executive score in 1900 (historical-institution / alternative instrument).1-7 scalemaketable1, maketable3, maketable8Polity via AJR
cons1#continuousmin 1 | median 3 | max 7Constraint on executive, 1st year of independencePolity constraint-on-the-executive score in the country's first year of independence.1-7 scalemaketable1, maketable3, maketable8Polity via AJR
democ00a#continuousmin 0 | median 0 | max 10Democracy in 1900Polity democracy score in 1900 (a historical-institution / alternative-instrument variable).0-10 scalemaketable1, maketable3, maketable8Polity via AJR
democ1#continuousmin 0 | median 1 | max 10Democracy, 1st year of independencePolity democracy score in the country's first year of independence (alternative instrument).0-10 scalemaketable8Polity via AJR
deslow#dummyshare coded 1 = 0.189Desert (low) soil dummySoil/climate-zone indicator (1 if low-latitude desert).0/1maketable6AJR (2001)
desmid#dummyshare coded 1 = 0.025Desert (mid) soil dummySoil/climate-zone indicator (1 if mid-latitude desert).0/1maketable6AJR (2001)
drystep#dummyshare coded 1 = 0.038Dry-steppe soil dummySoil/climate-zone indicator (1 if dry steppe).0/1maketable6AJR (2001)
drywint#dummyshare coded 1 = 0.006Dry-winter climate dummySoil/climate-zone indicator (1 if dry-winter climate).0/1maketable6AJR (2001)
edes1975#continuousmin 0 | median 0 | max 100European descent in 1975 (%)Percent of the population of European descent in 1975.% (0-100)maketable6AJR (2001)
euro1900#continuousmin 0 | median 3 | max 100European settlers in 1900 (% of pop.)Share of the population that was of European descent in 1900 (also used as an alternative instrument).% (0-100)maketable1, maketable3, maketable8AJR (2001)
excolony#dummyshare coded 1 = 0.523Ex-colony dummy1 if the country was ever a European colony (FLOPS definition).0/1maketable3AJR (2001)
extmort4#continuousmin 2.55 | median 85 | max 2.94e+03Corrected settler mortality rateAnnualized European settler/soldier mortality during colonization (raw level behind logem4).deaths per 1,000maketable1, maketable3AJR (2001)
f_brit#dummyshare coded 1 = 0.309British colony dummy1 if the country was a British colony (colonizer-identity control).0/1maketable5AJR (2001)
f_french#dummyshare coded 1 = 0.148French colony dummy1 if the country was a French colony (colonizer-identity control).0/1maketable5AJR (2001)
goldm#continuousmin 0 | median 0 | max 47Gold mineral measureFirst of five mineral-resource measures (gold).resource unitsmaketable6AJR (2001)
humid1#continuousmin 18 | median 70 | max 97Humidity indicator 1 (of 4)First of four humidity indices used as climate controls.indexmaketable6AJR (2001)
humid2#continuousmin 54 | median 88 | max 98Humidity indicator 2 (of 4)Second of four humidity indices (climate control).indexmaketable6AJR (2001)
humid3#continuousmin 10 | median 52 | max 86Humidity indicator 3 (of 4)Third of four humidity indices (climate control).indexmaketable6AJR (2001)
humid4#continuousmin 35 | median 74 | max 92Humidity indicator 4 (of 4)Fourth of four humidity indices (climate control).indexmaketable6AJR (2001)
imr95#continuousmin 4.9 | median 49.5 | max 170Infant mortality rate, 1995Infant mortality rate in 1995, deaths per 1,000 live births (modern health channel).per 1,000 birthsmaketable7AJR (2001)
indtime#continuousmin 5 | median 37 | max 195Years independent (1995 - first year)Number of years a country had been independent by 1995.yearsmaketable3, maketable8AJR (2001)
iron#continuousmin 0 | median 0 | max 16Iron mineral measureIron mineral-resource measure (geology control).resource unitsmaketable6AJR (2001)
landlock#dummyshare coded 1 = 0.196Landlocked dummy1 if the country is landlocked (geography control).0/1maketable6AJR (2001)
lat_abst#continuousmin 0 | median 0.267 | max 0.722Absolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).0-1maketable2, maketable3, maketable4, maketable5, maketable6, maketable7, maketable8AJR (2001)
latabs#continuousmin 0 | median 0.15 | max 0.667Absolute latitude (McArthur-Sachs)Absolute latitude (0-1 scaled), McArthur-Sachs version, used as a geography instrument.0-1maketable7McArthur & Sachs via AJR
leb95#continuousmin 37.2 | median 65.7 | max 79Life expectancy at birth, 1995Life expectancy at birth in 1995 (modern health channel).yearsmaketable7AJR (2001)
logem4#continuousmin 0.936 | median 4.44 | max 7.99Log settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log deaths per 1,000maketable1, maketable3, maketable4, maketable5, maketable6, maketable7, maketable8AJR (2001), from Curtin and related sources
loghjypl#continuousmin -3.54 | median -1.55 | max 0Log GDP per worker (Hall-Jones)Natural log of GDP per worker from Hall & Jones (1999); an alternative income measure.log US$maketable1, maketable2, maketable4Hall & Jones (1999) via AJR
logpgp95#continuousmin 6.11 | median 8.3 | max 10.3Log GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log US$ (PPP)maketable1, maketable2, maketable3, maketable4, maketable5, maketable6, maketable7, maketable8AJR (2001), from World Bank
lt100km#continuousmin 0 | median 0.239 | max 1Share of land within 100km of coastFraction of territory within 100 km of the coast (geography instrument).0-1maketable7McArthur & Sachs via AJR
malfal94#continuousmin 0 | median 0.0005 | max 1Falciparum malaria index, 1994Index of falciparum-malaria prevalence in 1994 (modern health channel).0-1maketable7AJR (2001), from Gallup-Sachs
meantemp#continuousmin -0.2 | median 24.5 | max 29.3Mean temperature (McArthur-Sachs)Mean annual temperature, used as a geography instrument in overidentified specs.deg Cmaketable7McArthur & Sachs via AJR
muslim80#continuousmin 0 | median 2.05 | max 99.8Muslim share of population, 1980 (%)Percent of population Muslim in 1980 (religion-composition control).% (0-100)maketable5WCE via AJR
no_cpm80#continuousmin 0.1 | median 21 | max 100Other-religion share, 1980 (%)100 minus Catholic, Protestant, and Muslim shares in 1980 (residual religion-composition control).% (0-100)maketable5WCE via AJR
oilres#continuousmin 0 | median 0 | max 1.57e+07Oil-reserves measureOil-reserves measure (resource control).resource unitsmaketable6AJR (2001)
other#dummyshare coded 1 = 0.025Other-continent dummy1 if the country is not in Asia, Africa, or the Americas (continent control).0/1maketable2AJR (2001)
rich4#dummyshare coded 1 = 0.025Neo-Europe dummy1 for the 'neo-Europes' (e.g. Australia, Canada, New Zealand, USA); dropped in some robustness columns.0/1maketable4AJR (2001)
shortnam#identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.stringmaketable1, maketable2, maketable4, maketable5, maketable6, maketable7, maketable8AJR (2001)
silv#continuousmin 0 | median 0 | max 13Silver mineral measureSilver mineral-resource measure (geology control).resource unitsmaketable6AJR (2001)
sjlofr#dummyshare coded 1 = 0.469French legal origin dummy1 if the country has French legal origin (legal-tradition control).0/1maketable5AJR (2001), legal-origins literature
steplow#dummyshare coded 1 = 0.208Steppe (low) soil dummyFirst of six soil/climate-zone indicators (1 if low-latitude steppe).0/1maketable6AJR (2001)
stepmid#dummyshare coded 1 = 0.057Steppe (mid) soil dummySoil/climate-zone indicator (1 if mid-latitude steppe).0/1maketable6AJR (2001)
temp1#continuousmin -4 | median 21 | max 32Temperature indicator 1 (of 5)First of five temperature indices used as climate controls.index (deg C)maketable6AJR (2001)
temp2#continuousmin -6 | median 26 | max 40Temperature indicator 2 (of 5)Second of five temperature indices (climate control).index (deg C)maketable6AJR (2001)
temp3#continuousmin 23 | median 38 | max 49Temperature indicator 3 (of 5)Third of five temperature indices (climate control).index (deg C)maketable6AJR (2001)
temp4#continuousmin -44 | median 0 | max 20Temperature indicator 4 (of 5)Fourth of five temperature indices (climate control).index (deg C)maketable6AJR (2001)
temp5#continuousmin 1 | median 15 | max 26Temperature indicator 5 (of 5)Fifth of five temperature indices (climate control).index (deg C)maketable6AJR (2001)
yellow#dummyshare coded 1 = 0.472Yellow-fever vector dummy1 if the yellow-fever vector is present today (disease-environment control).0/1maketable7AJR (2001)
zinc#continuousmin 0 | median 0 | max 15Zinc mineral measureZinc mineral-resource measure (geology control).resource unitsmaketable6AJR (2001)

Cross-file variable index

Which file each variable appears in (● = present).

Variablemaketable1maketable2maketable3maketable4maketable5maketable6maketable7maketable8
africa
asia
avelf
avexpr
baseco
catho80
cons00a
cons1
democ00a
democ1
deslow
desmid
drystep
drywint
edes1975
euro1900
excolony
extmort4
f_brit
f_french
goldm
humid1
humid2
humid3
humid4
imr95
indtime
iron
landlock
lat_abst
latabs
leb95
logem4
loghjypl
logpgp95
lt100km
malfal94
meantemp
muslim80
no_cpm80
oilres
other
rich4
shortnam
silv
sjlofr
steplow
stepmid
temp1
temp2
temp3
temp4
temp5
yellow
zinc

Construction & formulas

The tutorial estimates the causal effect of institutions on income via two-stage least squares (2SLS). The structural model has an endogenous regressor X = avexpr correlated with the error, an outcome Y = logpgp95, and an instrument Z = logem4:

The 2SLS estimate identifies a Local Average Treatment Effect (LATE) for "complier" countries whose institutions would respond to a change in settler mortality (Imbens & Angrist 1994); under constant effects, LATE = ATE.

The datasets

Switch datasets with the tabs. Each shows the full variable dictionary plus a sortable statistics table with mini distributions and data coverage.

expand to search (Ctrl/⌘+F) or print across all datasets

country (cross-section)  376 × 11 · GDP 1995; institutions 1985-95; settlers 1900; mortality colonial era · ~376 rows; 64 base-sample ex-colonies (baseco==1)

Panel key: shortnam · Source for Table 1 summary statistics on the whole world and the AJR base sample.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
euro1900 continuousEuropean settlers in 1900 (% of pop.)Share of the population that was of European descent in 1900 (also used as an alternative instrument).Percent European in 1900, AJR.% (0-100)AJR (2001)tables 1, 3, 8
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
cons1 continuousConstraint on executive, 1st year of independencePolity constraint-on-the-executive score in the country's first year of independence.Polity index, 1 (low) to 7 (high) constraint.1-7 scalePolity via AJRtables 1, 3, 8
democ00a continuousDemocracy in 1900Polity democracy score in 1900 (a historical-institution / alternative-instrument variable).Polity democracy index, 0 (low) to 10 (high).0-10 scalePolity via AJRtables 1, 3, 8
cons00a continuousConstraint on executive in 1900Polity constraint-on-the-executive score in 1900 (historical-institution / alternative instrument).Polity index, 1 (low) to 7 (high) constraint.1-7 scalePolity via AJRtables 1, 3, 8
extmort4 continuousCorrected settler mortality rateAnnualized European settler/soldier mortality during colonization (raw level behind logem4).Deaths per 1,000 mean strength, corrected/standardized by AJR (≈2.55 to 2,940).deaths per 1,000AJR (2001)tables 1, 3
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8
loghjypl continuousLog GDP per worker (Hall-Jones)Natural log of GDP per worker from Hall & Jones (1999); an alternative income measure.log of Hall–Jones output per worker.log US$Hall & Jones (1999) via AJRtables 1, 2, 4
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%376254
euro1900min 0 | median 3 | max 10044%16627030.103.00100.041.86
avexprmin 1.64 | median 7 | max 1034%129841.646.997.0010.001.83
logpgp95min 6.11 | median 8.3 | max 10.343%1621516.118.308.3010.291.07
cons1min 1 | median 3 | max 724%9261.003.633.007.002.39
democ00amin 0 | median 0 | max 1024%90901.12010.002.54
cons00amin 1 | median 1 | max 726%9661.001.851.007.001.79
extmort4min 2.55 | median 85 | max 2.94e+0325%94442.55215.085.002,940.0398.1
logem4min 0.936 | median 4.44 | max 7.9924%89440.9364.614.447.991.30
loghjyplmin -3.54 | median -1.55 | max 034%127108-3.54-1.71-1.5501.08
basecoshare coded 1 = 1.00017%6411.001.001.001.000

country (cross-section)  163 × 9 · GDP 1995; institutions 1985-95 · ~163 rows; 64 base-sample ex-colonies

Panel key: shortnam · Naive OLS regressions of logpgp95 on avexpr with latitude and continent controls.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
africa dummyAfrica dummy1 if the country is in Africa (continent control).Binary continent indicator.0/1AJR (2001)tables 2, 4, 7
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
other dummyOther-continent dummy1 if the country is not in Asia, Africa, or the Americas (continent control).Binary continent indicator.0/1AJR (2001)table 2
asia dummyAsia dummy1 if the country is in Asia (continent control).Binary continent indicator.0/1AJR (2001)tables 2, 4, 7
loghjypl continuousLog GDP per worker (Hall-Jones)Natural log of GDP per worker from Hall & Jones (1999); an alternative income measure.log of Hall–Jones output per worker.log US$Hall & Jones (1999) via AJRtables 1, 2, 4
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%163163
africashare coded 1 = 0.307100%163200.30701.000.463
lat_abstmin 0 | median 0.267 | max 0.72299%1629600.2960.2670.7220.190
avexprmin 1.64 | median 7.05 | max 1074%121801.647.077.0510.001.80
logpgp95min 6.11 | median 8.27 | max 10.391%1481386.118.308.2710.291.11
othershare coded 1 = 0.025100%163200.02501.000.155
asiashare coded 1 = 0.258100%163200.25801.000.439
loghjyplmin -3.54 | median -1.56 | max 075%123104-3.54-1.73-1.5601.08
basecoshare coded 1 = 1.00039%6411.001.001.001.000

country (cross-section)  376 × 11 · settlers/mortality colonial era; institutions 1900 and 1st-year-indep · ~376 rows

Panel key: shortnam · First-stage and historical-institution determinants of modern expropriation protection.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
euro1900 continuousEuropean settlers in 1900 (% of pop.)Share of the population that was of European descent in 1900 (also used as an alternative instrument).Percent European in 1900, AJR.% (0-100)AJR (2001)tables 1, 3, 8
excolony dummyEx-colony dummy1 if the country was ever a European colony (FLOPS definition).Binary indicator from AJR/FLOPS coding.0/1AJR (2001)table 3
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
cons1 continuousConstraint on executive, 1st year of independencePolity constraint-on-the-executive score in the country's first year of independence.Polity index, 1 (low) to 7 (high) constraint.1-7 scalePolity via AJRtables 1, 3, 8
indtime continuousYears independent (1995 - first year)Number of years a country had been independent by 1995.1995 minus the first year of independence.yearsAJR (2001)tables 3, 8
democ00a continuousDemocracy in 1900Polity democracy score in 1900 (a historical-institution / alternative-instrument variable).Polity democracy index, 0 (low) to 10 (high).0-10 scalePolity via AJRtables 1, 3, 8
cons00a continuousConstraint on executive in 1900Polity constraint-on-the-executive score in 1900 (historical-institution / alternative instrument).Polity index, 1 (low) to 7 (high) constraint.1-7 scalePolity via AJRtables 1, 3, 8
extmort4 continuousCorrected settler mortality rateAnnualized European settler/soldier mortality during colonization (raw level behind logem4).Deaths per 1,000 mean strength, corrected/standardized by AJR (≈2.55 to 2,940).deaths per 1,000AJR (2001)tables 1, 3
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
lat_abstmin 0 | median 0.256 | max 0.72245%17010100.2940.2560.7220.189
euro1900min 0 | median 3 | max 10044%16627030.103.00100.041.86
excolonyshare coded 1 = 0.52358%218200.5231.001.000.501
avexprmin 1.64 | median 7 | max 1034%129841.646.997.0010.001.83
logpgp95min 6.11 | median 8.3 | max 10.343%1621516.118.308.3010.291.07
cons1min 1 | median 3 | max 724%9261.003.633.007.002.39
indtimemin 5 | median 37 | max 19524%92475.0077.6637.00195.061.49
democ00amin 0 | median 0 | max 1024%90901.12010.002.54
cons00amin 1 | median 1 | max 726%9661.001.851.007.001.79
extmort4min 2.55 | median 85 | max 2.94e+0325%94442.55215.085.002,940.0398.1
logem4min 0.936 | median 4.44 | max 7.9924%89440.9364.614.447.991.30

country (cross-section)  163 × 10 · GDP 1995; institutions 1985-95; mortality colonial era · ~163 rows; 64 base-sample ex-colonies

Panel key: shortnam · The headline 2SLS regression: instrument avexpr with logem4 (effect = 0.944).

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
africa dummyAfrica dummy1 if the country is in Africa (continent control).Binary continent indicator.0/1AJR (2001)tables 2, 4, 7
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
rich4 dummyNeo-Europe dummy1 for the 'neo-Europes' (e.g. Australia, Canada, New Zealand, USA); dropped in some robustness columns.Binary indicator from AJR.0/1AJR (2001)table 4
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8
asia dummyAsia dummy1 if the country is in Asia (continent control).Binary continent indicator.0/1AJR (2001)tables 2, 4, 7
loghjypl continuousLog GDP per worker (Hall-Jones)Natural log of GDP per worker from Hall & Jones (1999); an alternative income measure.log of Hall–Jones output per worker.log US$Hall & Jones (1999) via AJRtables 1, 2, 4
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%163163
africashare coded 1 = 0.307100%163200.30701.000.463
lat_abstmin 0 | median 0.267 | max 0.72299%1629600.2960.2670.7220.190
rich4share coded 1 = 0.025100%163200.02501.000.155
avexprmin 1.64 | median 7.05 | max 1074%121801.647.077.0510.001.80
logpgp95min 6.11 | median 8.27 | max 10.391%1481386.118.308.2710.291.11
logem4min 0.936 | median 4.44 | max 7.9953%87440.9364.604.447.991.30
asiashare coded 1 = 0.258100%163200.25801.000.439
loghjyplmin -3.54 | median -1.56 | max 075%123104-3.54-1.73-1.5601.08
basecoshare coded 1 = 1.00039%6411.001.001.001.000

country (cross-section)  163 × 12 · GDP 1995; religion shares 1980 · ~163 rows; 64 base-sample ex-colonies

Panel key: shortnam · Robustness: add British/French colonizer, French legal origin, and religion shares.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
catho80 continuousCatholic share of population, 1980 (%)Percent of population Catholic in 1980 (religion-composition control).Percent Catholic, World Christian Encyclopedia 1995.% (0-100)WCE via AJRtable 5
muslim80 continuousMuslim share of population, 1980 (%)Percent of population Muslim in 1980 (religion-composition control).Percent Muslim, World Christian Encyclopedia 1995.% (0-100)WCE via AJRtable 5
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
no_cpm80 continuousOther-religion share, 1980 (%)100 minus Catholic, Protestant, and Muslim shares in 1980 (residual religion-composition control).100 - (Catholic + Protestant + Muslim) percent, 1980.% (0-100)WCE via AJRtable 5
f_brit dummyBritish colony dummy1 if the country was a British colony (colonizer-identity control).Binary indicator (FLOPS expansion).0/1AJR (2001)table 5
f_french dummyFrench colony dummy1 if the country was a French colony (colonizer-identity control).Binary indicator (FLOPS expansion).0/1AJR (2001)table 5
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
sjlofr dummyFrench legal origin dummy1 if the country has French legal origin (legal-tradition control).Recoded French-legal-origin indicator.0/1AJR (2001), legal-origins literaturetable 5
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%163163
catho80min 0 | median 10.6 | max 97.399%162114030.0410.6097.3035.61
muslim80min 0 | median 2.05 | max 99.899%16295025.292.0599.8036.88
lat_abstmin 0 | median 0.267 | max 0.72299%1629600.2960.2670.7220.190
no_cpm80min 0.1 | median 21 | max 10098%1601540.10032.7121.00100.032.19
f_britshare coded 1 = 0.30999%162200.30901.000.463
f_frenchshare coded 1 = 0.14899%162200.14801.000.356
avexprmin 1.64 | median 7.05 | max 1074%121801.647.077.0510.001.80
sjlofrshare coded 1 = 0.46999%162200.46901.000.501
logpgp95min 6.11 | median 8.27 | max 10.391%1481386.118.308.2710.291.11
logem4min 0.936 | median 4.44 | max 7.9953%87440.9364.604.447.991.30
basecoshare coded 1 = 1.00039%6411.001.001.001.000

country (cross-section)  163 × 29 · GDP 1995; geography time-invariant · ~163 rows; 64 base-sample ex-colonies

Panel key: shortnam · Robustness: control for temperature, humidity, soil, minerals, landlock, ethnic fractionalization.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
avelf continuousEthnolinguistic fractionalizationAverage ethnolinguistic fractionalization index (probability two random people differ).Average of 5 fractionalization indicators (East-Levine).0-1AJR (2001)table 6
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
temp1 continuousTemperature indicator 1 (of 5)First of five temperature indices used as climate controls.AJR temperature indicator (degrees C scale).index (deg C)AJR (2001)table 6
temp2 continuousTemperature indicator 2 (of 5)Second of five temperature indices (climate control).AJR temperature indicator.index (deg C)AJR (2001)table 6
temp3 continuousTemperature indicator 3 (of 5)Third of five temperature indices (climate control).AJR temperature indicator.index (deg C)AJR (2001)table 6
temp4 continuousTemperature indicator 4 (of 5)Fourth of five temperature indices (climate control).AJR temperature indicator.index (deg C)AJR (2001)table 6
temp5 continuousTemperature indicator 5 (of 5)Fifth of five temperature indices (climate control).AJR temperature indicator.index (deg C)AJR (2001)table 6
humid1 continuousHumidity indicator 1 (of 4)First of four humidity indices used as climate controls.AJR humidity indicator.indexAJR (2001)table 6
humid2 continuousHumidity indicator 2 (of 4)Second of four humidity indices (climate control).AJR humidity indicator.indexAJR (2001)table 6
humid3 continuousHumidity indicator 3 (of 4)Third of four humidity indices (climate control).AJR humidity indicator.indexAJR (2001)table 6
humid4 continuousHumidity indicator 4 (of 4)Fourth of four humidity indices (climate control).AJR humidity indicator.indexAJR (2001)table 6
steplow dummySteppe (low) soil dummyFirst of six soil/climate-zone indicators (1 if low-latitude steppe).Binary soil/climate-zone indicator.0/1AJR (2001)table 6
deslow dummyDesert (low) soil dummySoil/climate-zone indicator (1 if low-latitude desert).Binary soil/climate-zone indicator.0/1AJR (2001)table 6
stepmid dummySteppe (mid) soil dummySoil/climate-zone indicator (1 if mid-latitude steppe).Binary soil/climate-zone indicator.0/1AJR (2001)table 6
desmid dummyDesert (mid) soil dummySoil/climate-zone indicator (1 if mid-latitude desert).Binary soil/climate-zone indicator.0/1AJR (2001)table 6
drystep dummyDry-steppe soil dummySoil/climate-zone indicator (1 if dry steppe).Binary soil/climate-zone indicator.0/1AJR (2001)table 6
drywint dummyDry-winter climate dummySoil/climate-zone indicator (1 if dry-winter climate).Binary soil/climate-zone indicator.0/1AJR (2001)table 6
edes1975 continuousEuropean descent in 1975 (%)Percent of the population of European descent in 1975.Percent European descent, 1975.% (0-100)AJR (2001)table 6
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
landlock dummyLandlocked dummy1 if the country is landlocked (geography control).Binary indicator.0/1AJR (2001)table 6
goldm continuousGold mineral measureFirst of five mineral-resource measures (gold).Resource-quantity measure, AJR (not a 0/1 dummy).resource unitsAJR (2001)table 6
iron continuousIron mineral measureIron mineral-resource measure (geology control).Resource-quantity measure, AJR.resource unitsAJR (2001)table 6
silv continuousSilver mineral measureSilver mineral-resource measure (geology control).Resource-quantity measure, AJR.resource unitsAJR (2001)table 6
zinc continuousZinc mineral measureZinc mineral-resource measure (geology control).Resource-quantity measure, AJR.resource unitsAJR (2001)table 6
oilres continuousOil-reserves measureOil-reserves measure (resource control).Resource-quantity measure, AJR.resource unitsAJR (2001)table 6
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%163163
avelfmin 0 | median 0.275 | max 180%13112700.3600.2751.000.306
lat_abstmin 0 | median 0.267 | max 0.72299%1629600.2960.2670.7220.190
temp1min -4 | median 21 | max 3298%15932-4.0018.9121.0032.008.14
temp2min -6 | median 26 | max 4098%15935-6.0024.0826.0040.009.38
temp3min 23 | median 38 | max 4998%1592423.0038.2538.0049.005.07
temp4min -44 | median 0 | max 2098%15948-44.00-2.64020.0016.50
temp5min 1 | median 15 | max 2698%159251.0014.2215.0026.006.59
humid1min 18 | median 70 | max 9798%1595418.0067.5770.0097.0016.20
humid2min 54 | median 88 | max 9898%1593054.0086.2288.0098.007.37
humid3min 10 | median 52 | max 8698%1595610.0049.0852.0086.0016.00
humid4min 35 | median 74 | max 9298%1594035.0073.3674.0092.0010.19
steplowshare coded 1 = 0.20898%159200.20801.000.407
deslowshare coded 1 = 0.18998%159200.18901.000.392
stepmidshare coded 1 = 0.05798%159200.05701.000.232
desmidshare coded 1 = 0.02598%159200.02501.000.157
drystepshare coded 1 = 0.03898%159200.03801.000.191
drywintshare coded 1 = 0.00698%159200.00601.000.079
edes1975min 0 | median 0 | max 10096%15621032.180100.043.67
avexprmin 1.64 | median 7.05 | max 1074%121801.647.077.0510.001.80
logpgp95min 6.11 | median 8.27 | max 10.391%1481386.118.308.2710.291.11
landlockshare coded 1 = 0.196100%163200.19601.000.398
goldmmin 0 | median 0 | max 4798%159600.421047.003.84
ironmin 0 | median 0 | max 1698%1591100.314016.001.60
silvmin 0 | median 0 | max 1398%159400.346013.001.95
zincmin 0 | median 0 | max 1598%1591000.503015.002.03
oilresmin 0 | median 0 | max 1.57e+0794%154660295,853015,700,0001,531,138
logem4min 0.936 | median 4.44 | max 7.9953%87440.9364.604.447.991.30
basecoshare coded 1 = 1.00039%6411.001.001.001.000

country (cross-section)  163 × 15 · GDP 1995; health 1994-95 · ~163 rows; 62-64 base-sample ex-colonies

Panel key: shortnam · Robustness/threat: control for malaria, life expectancy, infant mortality, and add coastal/temperature/latitude instruments for overidentified GMM.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
africa dummyAfrica dummy1 if the country is in Africa (continent control).Binary continent indicator.0/1AJR (2001)tables 2, 4, 7
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
malfal94 continuousFalciparum malaria index, 1994Index of falciparum-malaria prevalence in 1994 (modern health channel).Share of population at risk of falciparum malaria, 1994 (0-1).0-1AJR (2001), from Gallup-Sachstable 7
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8
asia dummyAsia dummy1 if the country is in Asia (continent control).Binary continent indicator.0/1AJR (2001)tables 2, 4, 7
yellow dummyYellow-fever vector dummy1 if the yellow-fever vector is present today (disease-environment control).Binary indicator.0/1AJR (2001)table 7
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8
leb95 continuousLife expectancy at birth, 1995Life expectancy at birth in 1995 (modern health channel).Years; World Bank / WHO.yearsAJR (2001)table 7
imr95 continuousInfant mortality rate, 1995Infant mortality rate in 1995, deaths per 1,000 live births (modern health channel).Deaths per 1,000 live births; World Bank.per 1,000 birthsAJR (2001)table 7
meantemp continuousMean temperature (McArthur-Sachs)Mean annual temperature, used as a geography instrument in overidentified specs.Degrees Celsius; McArthur & Sachs.deg CMcArthur & Sachs via AJRtable 7
lt100km continuousShare of land within 100km of coastFraction of territory within 100 km of the coast (geography instrument).Share 0-1; McArthur & Sachs.0-1McArthur & Sachs via AJRtable 7
latabs continuousAbsolute latitude (McArthur-Sachs)Absolute latitude (0-1 scaled), McArthur-Sachs version, used as a geography instrument.abs(latitude) / 90, McArthur & Sachs.0-1McArthur & Sachs via AJRtable 7

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%163163
africashare coded 1 = 0.307100%163200.30701.000.463
lat_abstmin 0 | median 0.267 | max 0.72299%1629600.2960.2670.7220.190
malfal94min 0 | median 0.0005 | max 196%1575200.2945.00e-041.000.402
avexprmin 1.64 | median 7.05 | max 1074%121801.647.077.0510.001.80
logpgp95min 6.11 | median 8.27 | max 10.391%1481386.118.308.2710.291.11
logem4min 0.936 | median 4.44 | max 7.9953%87440.9364.604.447.991.30
asiashare coded 1 = 0.258100%163200.25801.000.439
yellowshare coded 1 = 0.472100%163200.47201.000.501
basecoshare coded 1 = 1.00039%6411.001.001.001.000
leb95min 37.2 | median 65.7 | max 7937%605937.2462.0865.7078.9811.43
imr95min 4.9 | median 49.5 | max 17037%60594.9057.0749.45170.037.71
meantempmin -0.2 | median 24.5 | max 29.337%6056-0.20023.1324.4729.304.96
lt100kmmin 0 | median 0.239 | max 137%614800.3740.2391.000.355
latabsmin 0 | median 0.15 | max 0.66737%614000.1780.1500.6670.132

country (cross-section)  163 × 12 · settlers 1900; historical institutions 1900 and 1st-year-indep · ~163 rows; 64 base-sample ex-colonies

Panel key: shortnam · Overidentification: pair logem4 with alternative historical-institution instruments and run Hansen J / Panel-D sensitivity.

Variable dictionary

VariableLabelDefinitionConstructionUnitsSourceCoverage
shortnam identifierCountry code (3-letter)Three-letter country identifier; the row key in every dataset.AJR country abbreviation (e.g. AUS, USA, NGA); used for scatter point labels (mlabel).stringAJR (2001)all files
lat_abst continuousAbsolute latitude (scaled 0-1)Absolute latitude of the capital, divided by 90 (a geography control).abs(latitude of capital) / 90.0-1AJR (2001)tables 2, 3, 4, 5, 6, 7, 8
euro1900 continuousEuropean settlers in 1900 (% of pop.)Share of the population that was of European descent in 1900 (also used as an alternative instrument).Percent European in 1900, AJR.% (0-100)AJR (2001)tables 1, 3, 8
avexpr continuousAvg protection against expropriation riskAverage index of protection against expropriation of private investment, ~1985-95 (the endogenous regressor X — 'modern institutions').Mean over available years of the Political Risk Services expropriation-risk index, scaled 0 (worst) to 10 (best).0-10 scaleAJR (2001), from Political Risk Servicesall files
logpgp95 continuousLog GDP per capita, PPP, 1995Natural log of 1995 PPP GDP per capita (the outcome Y).log of World Bank PPP GDP per capita, 1995.log US$ (PPP)AJR (2001), from World Bankall files
democ1 continuousDemocracy, 1st year of independencePolity democracy score in the country's first year of independence (alternative instrument).Polity democracy index, 0 (low) to 10 (high).0-10 scalePolity via AJRtable 8
cons1 continuousConstraint on executive, 1st year of independencePolity constraint-on-the-executive score in the country's first year of independence.Polity index, 1 (low) to 7 (high) constraint.1-7 scalePolity via AJRtables 1, 3, 8
indtime continuousYears independent (1995 - first year)Number of years a country had been independent by 1995.1995 minus the first year of independence.yearsAJR (2001)tables 3, 8
democ00a continuousDemocracy in 1900Polity democracy score in 1900 (a historical-institution / alternative-instrument variable).Polity democracy index, 0 (low) to 10 (high).0-10 scalePolity via AJRtables 1, 3, 8
cons00a continuousConstraint on executive in 1900Polity constraint-on-the-executive score in 1900 (historical-institution / alternative instrument).Polity index, 1 (low) to 7 (high) constraint.1-7 scalePolity via AJRtables 1, 3, 8
logem4 continuousLog settler mortalityNatural log of European settler/soldier mortality during early colonization (the instrument Z).log of corrected annualized deaths per 1,000 (extmort4).log deaths per 1,000AJR (2001), from Curtin and related sourcestables 1, 3, 4, 5, 6, 7, 8
baseco dummyBase-sample flag (1 = AJR base sample)Indicator for the 64-country base sample of ex-colonies with valid settler-mortality data.1 for base-sample countries; MISSING (not 0) otherwise. Restrict with keep if baseco==1.1/missingAJR (2001)tables 1, 2, 4, 5, 6, 7, 8

Distribution & statistics (click a header to sort)

VariableDistributionCoverageNDistinctMinMeanMedianMaxSD
shortnam100%163163
lat_abstmin 0 | median 0.267 | max 0.72299%1629600.2960.2670.7220.190
euro1900min 0 | median 1.95 | max 10094%15426030.471.95100.042.39
avexprmin 1.64 | median 7.05 | max 1074%121801.647.077.0510.001.80
logpgp95min 6.11 | median 8.27 | max 10.391%1481386.118.308.2710.291.11
democ1min 0 | median 1 | max 1053%871103.371.0010.003.67
cons1min 1 | median 3 | max 754%8861.003.593.007.002.41
indtimemin 5 | median 37 | max 19554%88465.0077.1737.00195.062.14
democ00amin 0 | median 0 | max 1053%87901.15010.002.58
cons00amin 1 | median 1 | max 756%9161.001.861.007.001.82
logem4min 0.936 | median 4.44 | max 7.9953%87440.9364.604.447.991.30
basecoshare coded 1 = 1.00039%6411.001.001.001.000

Known limitations & caveats