arXiv Preprint Infrastructure (1991)
infrastructure pace layer · 1991–ongoing
lifespan: 200 yrs · motor: push
Open-access preprint server founded 14 August 1991 by Paul Ginsparg at Los Alamos National Laboratory (originally xxx.lanl.gov, hep-th subdiscipline). Email-based distribution replaced by HTTP server 1993; moved to Cornell University Library 2001; officially housed under arXiv.org. Purpose: eliminate journal-paywall delay and accelerate physics/math/CS knowledge diffusion through instantaneous open posting of preprint manuscripts before peer review. Funded by Cornell + Simons Foundation ($2.5M/yr 2019+) + ~200 institutional sponsors. Cumulative 2.3M+ papers by 2024; deposit rate ~200k/yr. Domain expansion: math 1991+, cs 1993+, q-bio 2003, q-fin 2009, physics subfields 1996+; bioRxiv (Cold Spring Harbor Lab, Nov 2013, John Inglis); medRxiv (BMJ + Yale + CSHL, June 2019); ChemRxiv (2017+); EarthArXiv (2017+); SocArXiv (2016+). Open-access norm catalysts: NIH Public Access Policy 2008; EU Plan S 2018. AI/ML paper explosion: Transformer paper (Vaswani et al. 2017, arXiv:1706.03762) initiated mass AI preprint culture; NeurIPS 2017+ conferences became arXiv-first. LLM training: OpenAI, Anthropic, Google DeepMind all train on arXiv corpus. DM-27 diagnostic (w0): capture_resistance HIGH; argument_of_progress HIGH. This is one of the few DM-Day cards warranting dm_current=divergentism — the preprint norm is a positive DM-Day case with capacity intact, narrative coherent, and no successful publisher capture (Elsevier's SSRN acquisition 2016 did not capture arXiv physics/math/CS core). DeLanda coding: LaTeX standardization + submission metadata schema are strongly coded rules. V0.2 GAP: EmergenceSubtype (§11.43) not applicable — motor is push (small-scale distributed researcher force), not emergence. V0.2 GAP: arXiv-identifier-scheme is a proto-standard not yet a Latour OPP candidate for full TranslationEvent; deferred to Phase 2.
Machine type
incorporeal
Plasticity
plastic
Substrate
Wave source
wave-9-atlas
Inputs
- researcher_preprints
- cornell_institutional_hosting_and_administration
- nsf_simons_foundation_funding
- latex_standardization_substrate
Outputs
- open_access_preprint_papers
- open_access_preprint_norm
- citation_graph_infrastructure
- llm_training_corpus_preprints
Landscape pressures
- llm-training-corpus-extraction-pressure (55% intensity)
- journal-publisher-platform-capture-attempt (40% intensity)
- ai-generated-preprint-quality-degradation (60% intensity)
Intra-era couplings
- parallel_class Wikipedia (2001) · 0.65 CANON
- complemented_by GitHub Code-Collaboration Platform (2008) · 0.80 CANON
- substrate_provision OpenAI Foundation Model Lab (2015) · 0.88 CANON
- substrate_provision IPCC Climate Science Machine (1988) · 0.70 CANON
Cross-era couplings
- substrate_provision Post-Humboldtian Research University (1810) · 0.85 CANON
- sublimation_coupling Wire Service Journalism: AP / Reuters (1846) · 0.60
- substrate_provision LLM Inference Platform (class, 2022–present) · 0.85 CANON
State variables
Phase snapshots
Notable instances
- arXiv hep-th (High Energy Physics — Theory, 1991) (1991) — Origin discipline; Ginsparg's founding subdirectory at LANL; the root instance of the entire arXiv system.
- arXiv cs (Computer Science, 1993+) (1993) — CS/AI/ML papers; became dominant by volume post-2017 (Transformer paper effect); NeurIPS/ICML submission norm.
- bioRxiv (2013, Cold Spring Harbor Laboratory) (2013) — Cold Spring Harbor Lab spin-out; John Inglis; biology preprint server; COVID-19 pandemic 2020 validation (rapid vaccine-…
- medRxiv (2019, BMJ + Yale + CSHL) (2019) — Medical preprints; BMJ Group + Yale University + Cold Spring Harbor Lab joint governance; COVID-19 preprint surge 2020.
- ChemRxiv (2017+, American Chemical Society) (2017) — Chemistry preprints; ACS + Royal Society of Chemistry + German Chemical Society + others; later governance.
- SocArXiv (2016+, Center for Open Science) (2016) — Social science preprints; Center for Open Science (OSF) hosting; adoption slower in social sciences vs. STEM.
Sources
- Ginsparg, Paul (1996). Winners and Losers in the Global Research Village (UNESCO Expert Conference 1996) · 90%
- Suber, Peter (2012). Open Access (MIT Press) · 88%
- Pinfield, Stephen (2024). Scholarly Communication and Open Access · 82%
- arXiv (2024). arXiv Annual Reports 2019-2024 · 90%
- Atlas (Prime Radiant) (2026). research/09-atlas/dm-mm-industrial-stubs/findings.md (DM-27) · 85%