rag-as-native-attention

RAG as Native Attention

Wanderland is RAG where the retriever is the substrate itself, not a bolt-on step.

Conventional RAG	Wanderland
Retrieval is an external stage	Retrieval is the attention mechanism
Fetches documents, feeds to LLM	Fences, pages, graph queries ARE Q over persistent K/V
Attention only runs inside the model	Attention runs over the entire corpus

Query Type	Attention Equivalent
`peek(slug:fence)`	Single-head local attention
`peek(slug)` (all fences on page)	Multi-head attention within local region
`query(pattern/tags/graph-walk)`	Global attention over entire DAG

Every time you:

Already stored. Provenance-tagged. Re-usable.

GraphRAG papers argue that RAG over knowledge graphs gives better structure and retrieval granularity.

Wanderland goes further: the graph is the primary medium of thought.

Queries can:

Traditional	Wanderland
Docs with RAG bolted on	Persistent, queryable attention field
Retrieval then computation	Retrieval and computation are same operation
Ad-hoc chunking	Context windows assembled by graph traversal
Results without history	Provenance flows with results across layers

RAG as native attention over a DAG substrate, with fences as heads, queries as Q, and layered caches as V.

An externalizable mind, not a pile of markdown.

slots:
- context:
  - ISA specification implements the RAG-as-native-attention pattern
  slug: unified-peek-poke-cache-design-20260105