Data & datasets
Everything behind this site is open — plus a practical catalog of the large datasets a serious depression-exposome program can mine.
Download this project's data
56 factors · grades · mechanisms Master source register (CSV)
2258 sources · type · URL World-map data (JSON)
185 countries · prevalence · notes WHO prevalence (CSV)
country · % · source Factor data (JSON)
powers the interactive table Comprehensive synthesis (PDF) Comprehensive synthesis (Word) Gaps & Frontiers report (PDF) Gaps & Frontiers report (Word) Deep Dive (PDF)
verdicts · protocols · convergence Deep Dive (Word) Strengthening the Study (PDF)
self-critique · PAFs · new causes · prevention Strengthening the Study (Word)
Large datasets to mine
Mega-cohorts / biobanks
UK Biobank (~500k; genetics, 1,500-field exposome, imaging, metabolomics) · All of Us (~800k, diverse, EHR+WGS) · MoBa, ALSPAC, Generation Scotland (developmental/family) · NESDA (deep biomarkers).
Epidemiology / surveillance
NHANES (open; ~250 chemical analytes + PHQ-9 — the chemical-exposome engine) · IHME GBD / GHDx & WHO GHO (country burden).
Genetics
PGC MDD summary statistics (open; MR instruments) · FinnGen (register-linked) · Million Veteran Program (diverse).
Microbiome
American Gut / Microsetta · Dutch Microbiome Project / LifeLines — for MbWAS and microbiome–metabolome–depression triangulation.
Environmental layers
US EPA air quality · ACAG global satellite PM2.5 · VIIRS night-lights (light-at-night) — link to any geocoded cohort.
Exposome initiatives
EXPANSE (EU, tens of millions) · HHEAR (NIEHS untargeted chemical profiling) — ExWAS at scale.
Data dictionary — factor matrix
- domain — Biological, Environmental, Diet & gut, Lifestyle, Psychosocial.
- direction — Risk, Protective, Marker.
- evidence — Causal (strong) · Robust · Moderate · Emerging · Contested.
- mechanism / key_finding / confounders / note / detail — plain-language summary and caveats.