Decentralized 72B · Live
Decentralized LLM // Mesh Architecture
This is a live decentralized LLMQwen2.5-72B-Instruct running across a mesh of independent nodes, not a single server. Its weights are split across GPU nodes, so no node ever holds the complete model.

How it works: a coordinator sits at the center of the mesh and routes each request. The GPU nodes carry the model — each computes its share and hands the result to the next, so every token is produced collectively across the mesh. Around them, node clients form the wider network — smaller machines that join, route, and support the cluster. The point is the architecture: pooled, trustless compute where powerful and everyday hardware work side by side, and everyone earns for the work they do.
Qwen2.5-72B · AWQ 4-bit GPU nodes + node clients No node holds full weights Encrypted wire protocol x402 CIRC gate On-chain payment attribution
The model is hot-swappable and the topology is dynamic — nodes join the cluster and the workload redistributes automatically, all over a model-agnostic wire protocol with on-chain payment attribution. Scaling to more nodes or larger models is a config change, not a rewrite.
← DEEP DIVE DLLM  //  DECENTRALIZED LLM
CONNECTING
Coordinator
GPU node
Client
Offline
Fetching...