Resilient Data Futures
QuestionQ-0034draft

What does open access multiply about preserved-data ROI?

§7.62026-05-040 out · 1 in

A subsidiary question under Q-0004 and Q-0014. Q-0014 documented ROI on preserved infrastructure (5:1 to 800:1 across studies); Q-0034 asks what happens to that return when the preserved data can additionally be openly shared.

The answer is C-0032. Open access converts preserved data into reuse at planet scale, compounding the §7.5 ROI:

  • Human Genome Project + genomics: $14.5B federal investment 1988-2012 → $965B economic impact.
  • Landsat: Pre-2008 closed-access policy capped at 53 scenes/day; post-2008 open policy reached 5,700 scenes/day and $25.6B/year economic value by 2023. Same satellites; the access change unlocked the value.
  • Protein Data Bank: 88% of 210 new FDA-approved drugs 2010-2016 facilitated by open PDB structures; 100% of 34 cancer drugs 2019-2023.
  • COVID-19: SARS-CoV-2 genome shared publicly January 10-11 2020; BioNTech Project Lightspeed launched January 27 (17 days later); Pfizer-BioNTech vaccines generated ~$1.9T in global economic value.
  • EU FAIR cost: European Commission estimated cost of not having FAIR research data at €10.2B/year minimum.

Resilient infrastructure delivers preservation that allows data to survive long enough for compounding use to materialize. Openness delivers the reuse that realizes the value. The investment in infrastructure pays off in both cases; the open-data multiplier compounds the return where the data can be opened.