This document provides high-level documentation for Cumulus.
Each Substrate blockchain provides a runtime. The runtime is the state transition function of the blockchain. Cumulus provides interfaces and extensions to convert a Substrate FRAME runtime into a Parachain runtime. Polkadot expects each runtime exposes an interface for validating a Parachain's state transition and also provides interfaces for the Parachain to send and receive messages of other Parachains.
To convert a Substrate runtime into a Parachain runtime, the following code needs to be added to the runtime:
cumulus_pallet_parachain_system::register_validate_block!(Block, Executive);
This macro call expects the Block
and Executive
type. It generates the validate_block
function
that is expected by Polkadot to validate the state transition.
When compiling a runtime that uses Cumulus, a WASM binary is generated that contains the full code
of the Parachain runtime plus the validate_block
functionality. This binary is required to
register a Parachain on the relay chain.
When the Parachain validator calls the validate_block
function, it passes the PoVBlock (See Block
building for more information) and the parent header of the Parachain that is
stored on the relay chain. From the PoVBlock witness data, Cumulus reconstructs the partial trie.
This partial trie is used as storage while executing the block. Cumulus also redirects all storage
related host functions to use the witness data storage. After the setup is done, Cumulus calls
execute_block
with the transactions and the header stored in the PoVBlock. On success, the new
Parachain header is returned as part of the validate_block
result.
Parachains support light-clients, full nodes, and authority nodes. Authority nodes are called Collators in the Polkadot ecosystem. Cumulus provides the consensus implementation for a Parachain and the block production logic.
The Parachain consensus will follow the relay chain to get notified about which Parachain blocks are included in the relay-chain and which are finalized. Each block that is built by a Collator is sent to a validator that is assigned to the particular Parachain. Cumulus provides the block production logic that notifies each Collator of the Parachain to build a Parachain block. The notification is triggered on a relay-chain block import by the Collator. This means that every Collator of the Parachain can send a block to the Parachain validators. For more sophisticated authoring logic, the Parachain will be able to use Aura, BABE, etc. (Not supported at the moment)
A Parachain Collator will join the Parachain network and the relay-chain network. The Parachain network will be used to gossip Parachain blocks and to gossip transactions. Collators will only gossip blocks to the Parachain network that have a high chance of being included in the relay chain. To prove that a block is probably going to be included, the Collator will send along side the notification the so-called candidate message. This candidate message is issued by a Parachain validator after approving a block. This proof of possible inclusion prevents spamming other collators of the network with useless blocks. The Collator joins the relay-chain network for two reasons. First, the Collator uses it to send the Parachain blocks to the Parachain validators. Secondly, the Collator participates as light/full-node of the relay chain to be informed of new relay-chain blocks. This information will be used for the consensus and the block production logic.
Polkadot requires that a Parachain block is transmitted in a fixed format. These blocks sent by a
Parachain to the Parachain validators are called proof-of-validity blocks (PoVBlock). Such a
PoVBlock contains the header and the transactions of the Parachain as opaque blobs (Vec<u8>
). They
are opaque, because Polkadot can not and should not support all kinds of possible Parachain block
formats. Besides the header and the transactions, it also contains the witness data and the outgoing
messages.
A Parachain validator needs to validate a given PoVBlock, but without requiring the full state of the Parachain. To still make it possible to validate the Parachain block, the PoVBlock contains the witness data. The witness data is a proof that is collected while building the block. The proof will contain all trie nodes that are read during the block production. Cumulus uses the witness data to reconstruct a partial trie and uses this a storage when executing the block.
The outgoing messages are also collected at block production. These are messages from the Parachain the block is built for to other Parachains or to the relay chain itself.
Every Substrate blockchain supports runtime upgrades. Runtime upgrades enable a blockchain to update its state transition function without requiring any client update. Such a runtime upgrade is applied by a special transaction in a Substrate runtime. Polkadot and Cumulus provide support for these runtime upgrades, but updating a Parachain runtime is not as easy as updating a standalone blockchain runtime. In a standalone blockchain, the special transaction needs to be included in a block and the runtime is updated.
A Parachain will follow the same paradigm, but the relay chain needs to be informed before
the update. Cumulus will provide functionality to notify the relay chain about the runtime update. The
update will not be enacted directly; instead it takes X
relay blocks (a value that is configured
by the relay chain) before the relay chain allows the update to be applied. The first Parachain
block that will be included after X
relay chain blocks needs to apply the upgrade.
If the update is applied before the waiting period is finished, the relay chain will reject the
Parachain block for inclusion. The Cumulus runtime pallet will provide the functionality to
register the runtime upgrade and will also make sure that the update is applied at the correct block.
After updating the Parachain runtime, a Parachain needs to wait a certain amount of time Y
(configured by the relay chain) before another update can be applied.
The WASM blob update not only contains the Parachain runtime, but also the validate_block
function provided by Cumulus. So, updating a Parachain runtime on the relay chain involves a
complete update of the validation WASM blob.