Data & datasets

Everything behind this site is open — plus a practical catalog of the large datasets a serious depression-exposome program can mine.

Download this project's data

Exposure × evidence matrix (CSV)
56 factors · grades · mechanisms Master source register (CSV)
2258 sources · type · URL World-map data (JSON)
185 countries · prevalence · notes WHO prevalence (CSV)
country · % · source Factor data (JSON)
powers the interactive table Comprehensive synthesis (PDF) Comprehensive synthesis (Word) Gaps & Frontiers report (PDF) Gaps & Frontiers report (Word) Deep Dive (PDF)
verdicts · protocols · convergence Deep Dive (Word) Strengthening the Study (PDF)
self-critique · PAFs · new causes · prevention Strengthening the Study (Word)

Large datasets to mine

Mega-cohorts / biobanks

UK Biobank (~500k; genetics, 1,500-field exposome, imaging, metabolomics) · All of Us (~800k, diverse, EHR+WGS) · MoBa, ALSPAC, Generation Scotland (developmental/family) · NESDA (deep biomarkers).

Epidemiology / surveillance

NHANES (open; ~250 chemical analytes + PHQ-9 — the chemical-exposome engine) · IHME GBD / GHDx & WHO GHO (country burden).

Genetics

PGC MDD summary statistics (open; MR instruments) · FinnGen (register-linked) · Million Veteran Program (diverse).

Microbiome

American Gut / Microsetta · Dutch Microbiome Project / LifeLines — for MbWAS and microbiome–metabolome–depression triangulation.

Environmental layers

US EPA air quality · ACAG global satellite PM2.5 · VIIRS night-lights (light-at-night) — link to any geocoded cohort.

Exposome initiatives

EXPANSE (EU, tens of millions) · HHEAR (NIEHS untargeted chemical profiling) — ExWAS at scale.

Best triangulation: ExWAS discovery in NHANES → replicate in UK Biobank → diverse replication in All of Us; MR using PGC + FinnGen + MVP instruments (check cross-ancestry concordance); developmental chain MoBa → ALSPAC → ABCD.

Data dictionary — factor matrix

domain — Biological, Environmental, Diet & gut, Lifestyle, Psychosocial.
direction — Risk, Protective, Marker.
evidence — Causal (strong) · Robust · Moderate · Emerging · Contested.
mechanism / key_finding / confounders / note / detail — plain-language summary and caveats.

Licensing: factor summaries, grades and map notes are released for reuse with attribution (Objektiv AI · Claude). WHO map data © WHO (GHO, public). Underlying studies remain © their publishers — follow each source link.