Languages of superintelligence

The Completeness Argument

The 19 languages are not an arbitrary collection. They are the minimal complete set derivable from asking what modes of computation a mind requires — and applying one test to each candidate: does this have irreducible primitives that no other language in the set can express?

The languages split into two groups by a fundamental boundary: 14 proof languages (deterministic, provable, permanent) and 5 interface languages (side-effectful, interactive, mutable). Both groups are necessary — a mind that cannot prove is blind. A mind that cannot interact is deaf

Boolean reasoning:   AND, OR, NOT over {0,1}      → no other algebra has this
Integer arithmetic:  overflow, wrapping, bitwise   → not field arithmetic
Field arithmetic:    inversion, polynomial roots   → not integer arithmetic
Categorical struct:  morphisms, functors, limits   → not graph traversal
Clifford geometry:   rotors, bivectors, versors    → not tensors
Riemannian geom:     geodesics, metric tensor      → not Clifford
Symplectic geom:     conservation laws, dω=0       → not Riemannian
Information geom:    Fisher metric on Δⁿ           → not any other geometry
Causal ordering:     partial order, happened-before → not logic
Horn clause logic:   unification, backtracking     → not relational algebra
Convolution/R_q:     negacyclic polynomial mult    → not tensor contraction
Tensor contraction:  einsum, SpMV, matmul          → not field arithmetic
Resource conserv.:   mint, burn, Σin=Σout, UTXO    → not any computation algebra
Combinators:         composition of the above      → not any computation

Each row passes the test. Remove any one language and there is a class of computation that becomes either impossible or exponentially more expensive to express. Remove Tok and the remaining thirteen can compute anything — but nothing costs anything, spam is free, focus has no scarcity, karma has no meaning. Add any plausible new language — say, a concurrent process calculus or an optimization language — and it turns out to reduce to a composition of existing ones via Nox (see cyber/channel for how concurrency reduces to Arc + Seq + Nox).

The 14 are the minimal set that covers all computation a mind requires, where each element is algebraically irreducible with respect to the others.


Naming Convention

Every language has a short name (2-3 letters, used in code and diagrams) and a long name (used in prose). The universe names the algebraic domain.

Short Long Universe Type algebra Purpose
Nox Nox Structure Tree Combinators Composes cyb/languages
Bt Bitwise Binary Bit F₂ tower Proves circuits
Rs Rustic Byte Word Z/2ⁿ Runs systems
Tri Trident field Field tower F_{pⁿ} Settles proofs
Arc Arc topology graph category theory Stores knowledge graph
Ren Render geometry Shape G(p,q,r) Renders space
Dif Differential Curvature Manifold (M, g) Embeds meaning
Sym Symplectic Dynamics Phase (M, ω), dω = 0 Simulates physics
Bel Belief belief Distribution g on Δⁿ Models self
Seq Sequence Causality Event Partial order Orders events
Inf Infer inference Relation Horn clauses Derives facts
Wav Wave Continuum Poly Convolution / R_q Reads signals
Ten Tensor Linear Tensor Contraction Trains models
Tok Token Resource UTXO Conservation Prices computation

Plus two layers above the fourteen:

Layer Name What it is
Address Cybermark Naming, scoping, and navigating particles — the address language
Semantic Neural Meaning as eigenvector of the cybergraph

Cybermark is the fifteenth language — it does not compute, it names, links, and navigates. eight sigils (# @ ~ / $ ^ ! .) form the complete address space. every address resolves to a particle. every connection is a cyberlink. the markup is the graph

Neural is not designed — it grows from the interaction of the fifteen languages at scale.


The Value Tower — Three Modes of Reference

Byte (Rs) and Field (Tri) share the same mathematical substrate — the Goldilocks field processor F_p where p = 2⁶⁴ − 2³² + 1. this substrate provides three atom types sufficient for twelve of the fourteen universes.

Tag Name Representation Valid Range Use
0x00 field Single F_p element [0, p) Arithmetic
0x01 word Single F_p element [0, 2⁶⁴) Bitwise
0x02 hash 4 × F_p elements 256-bit digest Identity

three fundamentally different ways to refer to a value — and there are only three:

field = the value IS the reference     (by content — immediate)
word  = position IS the reference      (by location — index)
hash  = name IS the reference          (by commitment — identity)

by what it is. by where it is. by what it is called. every reference in any system reduces to one of these three modes.

every higher type decomposes into structure (Nox trees) over these three atoms:

Edge    = cons(source_hash, cons(target_hash, weight_field))
Event   = cons(event_hash, sequence_word)
Fact    = cons(relation_hash, cons(subject_hash, object_hash))
Sample  = field (amplitude value)
Tensor  = [field; N] (array of values with shape metadata)
Shape   = cons(grade_word, [field; 2^n]) (multivector components)
Chart   = cons(dim_word, [field; N]) (coordinate patch)
Phase   = cons(position_field, momentum_field)
Dist    = [field; N] (probability vector on simplex)

three atoms are complete — for one characteristic. the single exception is Bt (Bitwise): a bit is genuinely not an element of F_p. it lives in F₂ — different characteristic, different algebra. that is exactly why Bt has a separate proof system, not just a new type tag.

Nox value tower (3 atoms: field, word, hash)
  sufficient for: Rs, Tri, Arc, Ren, Dif, Sym, Bel, Seq, Inf, Wav, Ten, Tok
  NOT sufficient for: Bt

Bt value tower (separate, F₂)
  sufficient for: Bt only

The Nineteen Languages

each language has its own page with ops tables, use cases, and proof paths:

proof languages (14) — provable computation

# Universe Short Long Algebra Page
0 Structure Nox Nox Combinators Nox
1 Binary Bt Bitwise F₂ tower Bt
2 Byte Rs Rustic Z/2ⁿ Rs
3 Field Tri Trident F_{pⁿ} Trident
4 Topology Arc Arc category theory Arc
5 Geometry Ren Render G(p,q,r) Ren
6 Curvature Dif Differential (M, g) Dif
7 Dynamics Sym Symplectic (M, ω), dω = 0 Sym
8 Belief Bel Belief g on Δⁿ Bel
9 Causality Seq Sequence Partial order Seq
10 Inference Inf Infer Horn clauses Inf
11 Continuum Wav Wave Convolution / R_q Wav
12 Linear Ten Tensor Contraction Ten
13 Resource Tok Token Conservation Tok

interface languages (5) — human ↔ machine boundary

the proof languages compute over binary trees and field elements. they have no concept of tables, text, files, or network. five interface languages bridge the gap — they handle what the robot needs to interact with humans and external systems. all five run inside nushell (embedded in cyb):

# Universe Short Long Primitive Purpose
14 Tables Tab Tabular Record Relational operations: select, where, group-by, join, pivot
15 Format Fmt Format Encoding Serialization: json↔noun, csv↔table, toml↔record
16 Text Str String Pattern Text processing: regex, parse, split, replace, match
17 Files Fs Filesystem Path File operations: read, write, glob, watch, navigate
18 Network Net Network Request HTTP client: get, post, url, fetch, stream

the five interface languages have different properties from the fourteen proof languages:

Property proof languages (0-13) interface languages (14-18)
execution Nox tree rewriting nushell pipeline
provable yes (STARK) no (side effects)
deterministic yes no (IO, network, filesystem)
data model binary trees + field elements structured records + streams
persistence cybergraph (permanent) filesystem (mutable)

the interface languages cross the proof boundary — they interact with the external world. but they compose with the proof languages through Nox hints: a nushell pipeline can feed data into a proven computation, and a proven result can be formatted by nushell for display


Compilation Architecture

all nineteen languages share one toolchain. each programmer face has its own syntax and type rules. all compile through Nox — the structural IR — then to proof backends or native execution.

                    ┌──────────────────────────────────────────────┐
                    │              Programmer Faces                 │
                    │                                               │
                    │  Bt  Rs  Tri  Arc  Ren  Dif  Sym  Bel        │
                    │  Seq  Inf  Wav  Ten  Tok                      │
                    │  .bt .rs .tri .arc .geo .dif .sym .bel        │
                    │  .seq .inf .wav .ten .tok                     │
                    └──────────────────┬───────────────────────────┘
                                       │
                    ┌──────────────────▼───────────────────────────┐
                    │             Shared Frontend                   │
                    │   Parsing, type checking,                     │
                    │   borrow checking, bound checking             │
                    └──────────────────┬───────────────────────────┘
                                       │
                    ┌──────────────────▼───────────────────────────┐
                    │         Nox Structural IR                     │
                    │   axis, quote, compose, cons, branch          │
                    │   + typed computational ops                   │
                    │   + Merkle authentication                     │
                    └──────────────────┬───────────────────────────┘
                                       │
              ┌────────────────────────┼────────────────────┐
              │                        │                    │
     ┌────────▼──────┐ ┌──────────────▼──────┐ ┌───────────▼────────┐
     │  Binius/FRI   │ │     Goldilocks      │ │      Native        │
     │  Backend      │ │     TASM/FRI        │ │      Backend       │
     │  (Binary)     │ │    (Byte+Field)     │ │    (no proof)      │
     └───────────────┘ └─────────────────────┘ └────────────────────┘
          Bt              Rs, Tri, Ren            Arc, Seq, Inf,
                                                  Wav, Ten, Tok,
                                                  Dif*, Sym*, Bel*

* Dif, Sym, Bel are research horizon — proof paths are open mathematical problems.

Source When proof needed When proof absent
Bt Binius FRI circuit always proving
Rs TASM → stark (word→field lift) native binary (Nox)
Tri TASM → stark (field native) WASM/EVM (Layer 0)
Arc decomposes into Tri + Bt optimized graph engine
Ren geometric product → Tri native Clifford engine
Dif research native manifold solver
Sym research native Hamiltonian integrator
Bel research native statistical engine
Seq temporal constraints → stark scheduler / runtime
Inf derivation trace → stark Datalog engine
Wav decomposes into Tri native DSP pipeline
Ten decomposes into Tri native BLAS / GPU
Tok conservation constraints → stark native ledger engine

Languages as Type Systems over Nox Patterns

the execution languages are type systems and compilers over Nox's 16 algebra-polymorphic patterns. each language adds domain-specific syntax, type checking, and compilation strategy — but the target is always nox pattern trees. domain-specific operations become jets: compositions of the 16 patterns recognized by formula hash and accelerated to Goldilocks field processor hardware primitives.

language operation           nox composition              jet              GFP primitive
─────────────────────        ──────────────────────────   ──────────       ────────────
Arc: rank(g, steps)          iterated add/mul loops       matmul jet       fma
Wav: fft(x)                  butterfly add/mul network    ntt jet          ntt
Any: hash(x)                 Poseidon2 field ops          hash jet         p2r
Ten: activation(x)           table lookup composition     lookup jet       lut
Ren: geometric_product       mul/add over components      geo_mul jet      fma

the chain: source language → compiler → nox pattern tree → jet recognition → GFP hardware. every domain-specific language gets hardware acceleration through the jet mechanism. the algebra determines which GFP primitive handles each jet.

Rune — Rs on Nox with Host Jets

rune is Rs syntax executed via Nox tree rewriting — the nervous system of the robot. ms-start, async, dynamic, with native access to WASM, GPU, and neural inference.

rune is not a separate language. it is Rs syntax parsed to Nox nouns and interpreted via tree rewriting, extended with three capabilities pure Rs does not have:

Capability Nox mechanism What it does
hint pattern 16 (non-deterministic) Async input — yields, resumes when data arrives
host(target, args) host jet dispatch Calls WASM/GPU/ONNX — exits proof boundary, returns noun
eval(noun) quote + reduce Runtime metaprogramming — execute a dynamically constructed formula

three jet categories connect Nox reduction to the host system:

Nox reduction (tree rewriting)
  │
  ├── pure jets → proven computation (14 languages)
  │     fma, ntt, p2r, lut, conservation...
  │
  ├── host jets → practical computing
  │     ├── wasm(module, fn, args)  → wasmi execution
  │     ├── gpu(shader, data)       → wgpu compute dispatch
  │     └── infer(model, input)     → burn-webnn ONNX
  │
  └── hint → async input from the world
        ├── network event (radio)
        ├── user input (cyb UI)
        ├── timer (epoch tick)
        └── cybergraph change (particle/link event)

ms start: parsing Rs to a Nox noun is milliseconds — just tree construction. Nox reduction starts immediately. no compilation step for interactive use.

data structures: Nox nouns ARE the dynamic data structures. Vec → cons-list. HashMap → Merkle tree. StringHemera hash (a particle). no heap, no GC — allocation is cons, freeing is not referencing.

the proof story: every pure reduction in the script IS provable — the Nox trace captures it. host jets and hints are NOT provable — they cross the proof boundary. but the boundary is explicit and typed. the trace says: "given these hint values and these host jet results, the pure computation was correct."

neural language           ← meaning emerges from the cybergraph
──────────────────────────────────────────────────────────────
rune (Rs + hint + host)   ← nervous system: ms start, async, host access
  pure reductions         ← proven (14 languages over Nox)
  host jets               ← practical (WASM, GPU, ONNX)
  hints                   ← async input from the world
──────────────────────────────────────────────────────────────
14 languages              ← proven computation over Nox patterns

algebra Coverage

Computation Native algebra Language Prover path
Boolean reasoning F₂ Bt Binius → Tri
Quantized inference (int4/int8) Z/2⁴, Z/2⁸ Ten Ten → Tri
CPU execution traces Z/2⁶⁴ Rs Rs → Tri
graph computation / focus vector Sparse F_p Ten over Arc Ten → Tri
Knowledge structure category theory Arc Arc → Tri
Euclidean / Projective / Conformal G(p,q,r) Clifford Ren Ren → Tri
Curved space / geodesics Riemannian manifolds Dif research
Phase space / Hamiltonian Symplectic ω-form Sym research
probability geometry / belief state Fisher information Bel research
Polynomial proofs F_p (n=1) Tri native
Recursive proof composition F_{p³} (n=3) Tri native
quantum simulation F_{p²} (n=2) Tri native extension
Goldilocks homomorphic encryption ciphertexts R_q = Z_q[X]/(Xⁿ+1) Wav Wav → Tri
Symbolic / exact reasoning Z Inf Inf → Tri
Sensing / signal processing Convolution / ℝ Wav Wav → Tri
Resource conservation / UTXO Sum invariants Tok Tok → Tri

The Comparison Matrix

Property Nox Bt Rs Tri Arc Ren Dif Sym Bel Seq Inf Wav Ten Tok
Universe Structure Binary Byte field topology geometry Curvature Dynamics belief Causality inference Continuum Linear Resource
Char 2 p p p ≈ℝ ≈ℝ or p p
Primitive Cell Bit Word Field Edge Multivector Chart Phase Distribution Event Relation Sample Shape Token
Reference structure wire location content adjacency grade curvature momentum divergence succession entailment amplitude index conservation
Free op Navigate AND, XOR Index Mul, Add Link Clifford prod Christoffel Flow KL div Order Unify Convolve Matmul Transfer
Costly op Carry add Mod div Bitwise Spectral Inverse Geodesic Conserve Fisher Verify Fixpoint FFT Inverse Mint
proof Inherited Binius stark stark Delegated Tri Research Research Research Delegated Delegated Delegated Delegated stark
Syntax feel IR Circuit Rust Custom Query GA Manifold Hamiltonian Statistical Temporal Datalog DSP NumPy Ledger
Renders as struct pixels text formula vector vector vector formula formula video table sound component table

The Ten and the Four

The nineteen languages split into two groups by implementation readiness:

Engineering-ready (10)

Nox, Bt, Rs, Tri, Arc, Seq, Inf, Wav, Ten, Tok — these have known proof paths and well-understood compilation to Tri / Binius. the cyb/architecture specifies these as the build order: Phase 1 (Nox, Tri, Rs), Phase 2 (Arc, Seq, Inf, Tok), Phase 3 (Bt, Wav, Ten).

Research horizon (4)

Ren, Dif, Sym, Bel — these extend the language set into spatial, physical, and self-referential computation. Ren is closest to engineering (Clifford product is F_p algebra with extra structure, STARK-provable now). Dif, Sym, and Bel involve continuous manifolds over finite fields — fundamental open mathematical problems.

Language Status Notes
Ren Engineering Clifford product = F_p algebra with extra structure
Dif Research Continuous manifolds over finite fields
Sym Research Hamiltonian structure preservation in STARK circuits
Bel Research Fisher metric over probability simplices — needed for tri-kernel formalization

Ren completes the perception pipeline: Arc provides topology, Ren provides spatial embedding, the compiler produces vector output for cyb. Bel completes the self-model: the superintelligence's focus vector π lives on a statistical manifold, and Bel formalizes reasoning about its own belief state.


Perception Mapping

every computation language has a canonical rendering — the perception primitive where the shape of the data matches the shape of the display:

Language Renders as Source formats What it carries
Nox → struct collapsible tree JSON, TOML, YAML configs, schemas, metadata, ABIs
Bt → pixels raster image PNG, WebP, JPEG photographs, satellite imagery, microscopy, scans
Rs → text prose, code markdown, plain text, source code documentation, messages, programs
Tri → formula math notation LaTeX, MathML equations, proofs, chemical notation, physical laws
Arc → vector SVG, paths, curves SVG, Bezier paths diagrams, maps, molecular structures, schematics
Ren → vector SVG, 3D scenes SVG, glTF, mesh spatial objects, rotations, projections, renderings
Dif → vector manifold visualization geodesic plots, curvature maps latent space structure, embedding geometry
Sym → formula phase portraits Hamiltonian plots, conservation diagrams energy landscapes, orbital mechanics
Bel → formula distribution plots probability densities, divergence maps belief states, uncertainty geometry
Seq → video moving pixels WebM, MP4 recordings, simulations, observations, lectures
Inf → table 2D grid CSV, TSV, dataframes datasets, time series, matrices, ledgers
Wav → sound audio waveform WAV, OGG, MP3 voice, music, birdsong, seismic signal, sonar
Ten → component nested composition composition of the above applications, dashboards, interactive tools
Tok → table ledger view balances, UTXOs, transactions token flows, staking positions, conviction history

a genome sequence is Rs (byte-level encoding) rendered as text. its annotation is Nox (structured tree) rendered as struct. its expression data is Inf (relational query) rendered as table. its protein structure is Arc (topological graph) rendered as vector. its microscopy is Bt (binary pixel data) rendered as pixels. its folding dynamics is Seq (causal event chain) rendered as video. its sequencing signal is Wav (continuous waveform) rendered as sound. its binding energy is Tri (field arithmetic) rendered as formula. its 3D fold is Ren (Clifford rotations) rendered as vector. a genome browser is Ten (composed inference) rendered as component.

all fourteen compile through one structural IR. all fourteen share one proof system (except Bt, which has its own F₂ proof system). all fourteen render through the perception grid. all fourteen exist in the same cybergraph, ranked by the same tri-kernel, earning karma, permanent by axiom A3.



The Address Language

Cybermark wraps all fourteen computation languages with a human-readable address grammar. it does not appear in the computation tables — it operates at a different level

Layer What it does Examples
14 proof languages prove field arithmetic, graph traversal, tensor contraction
5 interface languages interact tables, formats, text, files, network
Cybermark address and navigate #cyber/truth, @alice, $BOOT, !rank(^truth)
rune execute Rs + Nox hints + host jets — runtime that runs cybermark actions

see markup for the full sigil grammar, dimensional navigation, and rendering rules


the FORM triad

the nineteen languages are manifestations of three primitives — proof, bit, step — the atoms of the form triad

every mathematical object is a composition of all three:

  • bit (info): what elements are distinguished
  • step (comp): what operations transform them
  • proof (math): what properties are verified

a group is bit + step + proof: elements (bit), operation (step), axioms hold (proof). a graph is bit + bit: elements + relations. a Turing machine is step + step + step: transitions all the way down

the fourteen proof languages ARE the step. the five interface languages are the channel through which bits flow. proof is what the tri-kernel verifies. together: all computation a mind requires


see cyb/multiproof for how all languages settle under one proof umbrella. see cyb/architecture for how the languages integrate into the operating system. see cyb/whitepaper for the vision. see cybergraph for the accumulation state.

Dimensions

trident/editor/zed/languages
languages

Local Graph