Capture-Resistant AI Infrastructure (LM-Dawn class, 2020–present)
infrastructure pace layer · 2022–ongoing
lifespan: 200 yrs
Class card for LM-Dawn open-weight AI infrastructure collectives that are structurally distinct from proprietary inference platforms by three institutional design commitments: (1) open model weights published under permissive or copyleft-adjacent licenses (Apache 2.0, MIT, or custom community licenses) so that any actor may run, modify, and redistribute the model without permission or fee; (2) community governance rather than shareholder governance — steering committees, academic advisory boards, or foundation-style governance (not board of directors optimizing for capture); (3) non-proprietary deployment infrastructure — models deployable on commodity hardware (laptops, local servers, community GPU clusters) rather than requiring hyperscaler-scale compute. Canonical instances: Meta Llama family (1–4, 2023–2025, Apache-2.0 / Llama Community License), Google Gemma (Apache 2.0), Mistral 7B / Mixtral (Apache 2.0), DeepSeek-V2/R1 (MIT), EleutherAI / GPT-NeoX (Apache 2.0), BLOOM 176B (BigScience Open RAIL), Falcon (TII), StableLM (Apache 2.0). These models collectively represent the heretic class within LM-Dawn AI machinery: they exhibit the LM-Dawn structural logic (open weights = non-rivalrous cognitive substrate) while using DM-era institutions (Meta, Google, Mistral, DeepSeek corporate entities) as their launchers — a classic heretic-within-incumbent pattern. The LM-Dawn target form is civic-stewarded: Hugging Face model hub as distribution commons, distributed fine-tuning collectives, on-device inference communities (llama.cpp ecosystem, Ollama, Jan.ai). Distinguishing from machine:llm-inference-platform-class: the broader inference-platform class includes both proprietary (OpenAI, Anthropic) and open-weight providers; capture_resistance_index is LOW (0.30 [EXTRAP]) on that class because compute concentration is the binding constraint. This card captures the heretic sub-class where the binding constraint shifts to governance and licensing: once weights are public under permissive license, any actor can run them on commodity hardware. capture_resistance_index is HIGH (0.75 [EXTRAP]) here because structural capture requires relicensing (contractually difficult) or breaking the open-weights deployment chain (technically futile once weights are public). Proletarianization signal (Stiegler oq-6-14): HIGH (0.70 [EXTRAP]). Open-weight distribution inscribes model competence into tertiary retention artifacts (parameter tensors, tokenizer configs, training codebases), but the competence to reproduce, fine-tune, evaluate, and align these weights requires living ML-engineering skills. As fine-tuning and deployment tooling commoditizes (AutoTrain, LM Studio, Ollama), the re-internalization loop thins: practitioners use open-weights as black boxes without understanding data curation, RLHF, or evaluation methodology. Proletarianization is HIGH but lower than proprietary-inference (0.75) because the open-weights community (EleutherAI, together.ai, the-big-science consortium) maintains stronger practitioner-competence cultivation norms. emergence_subtype: crowdsourced (community governance, volunteer fine-tuning collectives, open-eval leaderboards like LMSYS Arena / Open LLM Leaderboard). [Field missing from Machine model — Phase-0 oversight; placed in description per workaround in session 3.d embedded constraints.] Cross-era position: hostile_inheritance from DM proprietary-platform OpenAI and DM compute-infrastructure AWS; substrate_provision from MM Humboldtian university (algorithmic foundations, academic ML research) and MM electrical grid (inference power); mutualistic_coupling with DM Meta (Meta releases Llama as heretic to weaken OpenAI's OPP while the heretic ecosystem returns mindshare and ecosystem legitimacy); black_hole_dependency on TSMC advanced semiconductor manufacturing (open-weight inference still requires high-FLOP GPUs at training time). All quantitative state-variable values are [EXTRAP]; framing is [CANON] per Wave-6 heretic-finder seed and Wave-0 LM definitions.
Machine type
incorporeal
Plasticity
plastic
Substrate
Wave source
wave-6-cross-era-coupling-typology-heretic-finder
Inputs
- open_weight_model_releases
- gpu_compute_electricity_training_time
- community_contributor_labor
- open_licenses_and_governance_documents
Outputs
- open_weight_model_artifacts
- capture_resistant_cognitive_substrate
- community_evaluation_and_alignment_norms
Landscape pressures
- corporate_relicensing_capture_attempt (72% intensity)
- compute_concentration_training_dependency (85% intensity)
- alignment_safety_community_governance_fragmentation (65% intensity)
Intra-era couplings
- substrate_for Open-Source Software Ecosystem (LM-Dawn class) · 0.85
- negates LLM Inference Platform (class, 2022–present) · 0.70 EXTRAP
Cross-era couplings
- hostile_inheritance OpenAI Foundation Model Lab (2015) · 0.68 EXTRAP
- substrate_provision AWS Cloud Infrastructure (Amazon Web Services, 2006) · 0.72
- substrate_provision Linux / Open-Source Ecosystem (1991) · 0.88 CANON
- substrate_provision GitHub Code-Collaboration Platform (2008) · 0.80 CANON
- parasitic_extraction Post-Humboldtian Research University (1810) · 0.68 EXTRAP
- mutualistic_coupling Meta Platforms (Social-Media Platform, 2004) · 0.55 EXTRAP
- black_hole_dependency TSMC Advanced Semiconductor Foundry (1987) · 0.90
- substrate_provision National Electrical Grid (Insull / US Grid, 1882–ongoing) · 0.88
State variables
Phase snapshots
Notable instances
- Meta Llama Family (1–4, 2023–2025) (2023) — Llama 1 (Feb 2023), Llama 2 (Jul 2023), Llama 3 (Apr 2024), Llama 3.1 (Jul 2024), Llama 3.2 (Sep 2024). Custom Llama Com…
- BLOOM 176B (BigScience, 2022) (2022) — First fully-open large-scale language model: 1,000+ academic researchers across 60 countries; 433 MWh training at Jean Z…
- EleutherAI / GPT-Neo / GPT-NeoX / Pythia (2021–present) (2021) — EleutherAI founding collective (2020); GPT-Neo (1.3B, 2.7B; 2021, Apache 2.0); GPT-NeoX 20B (2022, Apache 2.0); Pythia s…
- Mistral 7B / Mixtral 8x7B (Mistral AI, 2023) (2023) — Mistral 7B (Sep 2023, Apache 2.0); Mixtral 8x7B (Dec 2023, Apache 2.0). EU-headquartered; fully Apache 2.0 without usage…
- DeepSeek-V2 / R1 (DeepSeek AI, 2024–2025) (2024) — DeepSeek-V2 (May 2024, MIT license); DeepSeek-R1 (Jan 2025, MIT license). Chinese corporate-open-weight heretic: MoE arc…
- Google Gemma (Apache 2.0, 2024) (2024) — Gemma 2B, 7B (Feb 2024, Apache 2.0); Gemma 2 9B, 27B (Jun 2024, Apache 2.0). Google's corporate-open-weight heretic anal…
Sources
- Rao (2024). World Machines — civilizational-era framing
- Wave-6 (2026). cross-era-coupling-typology/findings.md §1.4 forward-applications candidate 1
- Stiegler (2016). Automatic Society vol.1
- Bratton (2016). The Stack: On Software and Sovereignty · 82%
- Wave-0 (2026). world-machines-eras findings.md (LM definitions)
- Meta AI (2023). Llama 2: Open Foundation and Fine-Tuned Chat Models · 88%