LLM Inference Platform (class, 2022–present)
commerce pace layer · 2022–ongoing
lifespan: 100 yrs
Class card for large language model inference providers as a machine type (OpenAI API, Anthropic API, Hugging Face Inference Endpoints, Mistral platform, Google AI Studio/Gemini, Together AI, Replicate, Groq, Cerebras, Fireworks.ai). Distinguishing features from the DM-instance counterpart machine:openai-foundation-model-lab-2015: this card is the LM-era abstraction — the class of API-mediated inference machines that inhabit the Cloud layer of Bratton's Stack without having built it. Individual labs are notable_instances; the class captures the shared operational grammar. Core operational grammar: (1) motor is absent — no extrinsic telos of Progress; LLM inference platforms are constituted by the feedback between user-demand signals and capability-capability deployment cycles without a directing Wissenschaft ideal; (2) substrate is inanimate + cognitive — trained model weights are inanimate tertiary-retention artifacts (Stiegler) that produce cognitive output at inference time; the class does not own the corporeal datacenter substrate (that is held by hyperscaler DM machines); (3) pace_layer is commerce, not infrastructure — pricing pressure, token cost, and model-release cadence are commerce-pace; (4) regime is chaotic [EXTRAP] — capability overhang, alignment uncertainty, and governance fragmentation make the class chaotic in the Cynefin sense. Cross-era position: heavy substrate_provision dependency on MM-era electrical grid (multi-GW inference load); substrate_provision from DM AWS cloud; parasitic_extraction against MM Humboldtian research university (training corpus + PhD talent without sustaining the system); parasitic_extraction against MM newspaper/broadcast info-substrate (LLMs trained on news archives without sustaining the publication-revenue model). Optional: zombie_dependency on MM industrial-era patent system (copyright law applies to training data; NYT v. OpenAI, Authors Guild v. OpenAI in progress). Compute concentration intensifies: AWS, Azure, GCP provide >95% of inference substrate; capture_resistance_index is LOW (~0.30 [EXTRAP]). Proletarianization signal (Stiegler): VERY HIGH (0.75 [EXTRAP]). The AGI narrative obscures the underlying ML-engineering competence; if no living competence re-internalizes transformer- training craft, inscribed weights persist but the lineage terminates. lineage_substrate: technical_memory (training datasets + model weights + API specs as Stiegler tertiary retention). All quantitative state-variable values are [EXTRAP]; framing is [CANON] per Wave-0 LM definitions and Wave-6 cross-era-coupling-typology §5 Stub 3 seed.
Machine type
incorporeal
Plasticity
plastic
Substrate
Wave source
wave-6-cross-era-coupling-typology-stub3
Inputs
- prompt_text_and_context
- gpu_compute_electricity_via_hyperscaler
- training_corpus_text_internet_scale
- ml_engineering_researcher_labor
Outputs
- generated_text_and_structured_reasoning
- api_access_revenue_and_pricing_signal
- capability_narrative_structuring_lm_attention
Landscape pressures
- open_weights_competitive_pressure (80% intensity)
- alignment_safety_governance_fragmentation (72% intensity)
- inference_cost_deflationary_spiral (85% intensity)
Intra-era couplings
- substrate_for machine:agentic-ai-workflow-platform-2024 · 0.78 EXTRAP
Cross-era couplings
- substrate_provision AWS Cloud Infrastructure (Amazon Web Services, 2006) · 0.95
- parasitic_extraction Post-Humboldtian Research University (1810) · 0.72 EXTRAP
- substrate_provision National Electrical Grid (Insull / US Grid, 1882–ongoing) · 0.90
- parasitic_extraction InfoSubstrate Newspaper-Broadcast (1830) · 0.58 EXTRAP
State variables
Phase snapshots
Notable instances
- OpenAI ChatGPT/API (Nov 2022) (2022) — Singularly load-bearing instance (RESEARCH_AUDIT_BRIEF §6.1). Separate DM-Day card machine:openai-foundation-model-lab-2…
- Anthropic Claude API (March 2023) (2023) — Claude 1 API launch March 2023; Constitutional AI + RLHF-from-AI-Feedback. Promotion candidate: Anthropic is singularly …
- Hugging Face Inference Endpoints (2019) (2019) — Open-source model hub + inference endpoints; paradigm case of LM-class open-weights inference access. 500K+ models hoste…
- Mistral API Platform (Sept 2023) (2023) — European open-weights LLM inference provider; Mistral 7B, Mixtral 8x7B, Mistral Large. Represents EU-based inference alt…
- Google AI Studio / Gemini API (2023) — Gemini 1.0 (Dec 2023), Gemini 1.5 (Feb 2024). Google's LLM-inference-as-API surface; 1M+ context window (Gemini 1.5) is …
- Together AI (2023) (2023) — Open-weights inference cloud; aggregates Llama, Mistral, Qwen, and other open-weights models. Canonical instance of "inf…
- Groq / Cerebras (hardware-accelerated inference, 2024) (2024) — LPU/WSE-accelerated inference providers; ~10x token/second vs. GPU baseline; represent the hardware-differentiation sub-…
Sources
- Rao (2024). World Machines — civilizational-era framing
- Wave-6 (2026). cross-era-coupling-typology/findings.md §5 Stub 3
- Bratton (2016). The Stack: On Software and Sovereignty · 82%
- Stiegler (2016). Automatic Society vol.1
- Wave-0 (2026). world-machines-eras findings.md (LM definitions)
- IEA (2024). Electricity 2024 Report (datacenters chapter) · 85%