Resilient Data Futures
EvidenceE-0049draft

INSDC — 53.9 trillion bases across 3 continents, continuously operational since 1980s

§6.12026-05-033 out · 0 in

The International Nucleotide Sequence Database Collaboration maintains three mirrored databases on three continents — NCBI (US), EMBL-EBI (UK), DDBJ (Japan) — synchronized daily through a shared Feature Table format (S-0012).

Key facts:

  • 53.9 trillion bases
  • 6.27 billion records
  • Continuous operation since the 1980s
  • Any single node can go down without data loss because the other two hold complete copies

INSDC is the canonical example of Tier 2 working as intended at scale. Resilience depends on the three institutions continuing to coordinate and fund their operations — exactly the dependency C-0012 identifies. As of 2026 that coordination has held for nearly 40 years, demonstrating that Tier 2 can deliver multi-decade resilience under stable governance.

The case is also useful as a contrast for E-0025 (GISAID): identical technical capability, opposite governance pattern. INSDC's federated 3-institution structure produces the resilience GISAID's centralized single-organization structure does not.