Skip to main content
T TON Adoption
Basics DEEP DIVE · 2026

TON Sharding Inside: How High TPS Is Achieved (2026)

TON's sharding architecture in depth: masterchain, workchains, dynamic split/merge, Catchain 2.0, real TPS versus the 104K test and the network's honest limits.

Author
TON Adoption Team · research desk
Published
9 min read

TON was designed from day one as a network where sharding is not an optional L1 add-on but the architecture itself. TON Foundation marketing tends to quote “millions of TPS”; engineering posts cite the 104,715 TPS October 2024 record; explorers will show you 40 tx/s. All three are true in their own frame, but they describe different worlds. This article walks the architecture top-down to show where the high TPS comes from and where the design genuinely runs out of headroom.

TON architecture: masterchain, workchains, shardchains

The network is hierarchical. At the top sits the masterchain (workchain ID -1) — a service blockchain that stores network configuration, the validator set, the list of active shards and finalises state across the whole system. There is exactly one masterchain.

Underneath the spec allows up to 2^32 parallel workchains, each potentially with its own VM, address format and fee rules. In 2026 mainnet only one workchain is live — basechain (workchain ID 0). That is where every familiar transaction lives: TON transfers, jetton movements, NFTs, DEX contracts.

Each workchain in turn dynamically splits into shardchains. Per the TON Whitepaper, section 2 the theoretical ceiling is 2^60 shards per workchain. Production is far more modest: shard-prefix length is capped at 4 bits, so basechain can split into at most 16 shardchains at any moment. That gap between the spec ceiling and the production cap is the source of most of the “millions of TPS versus 40 tx/s” confusion we are about to unpack.

The architectural invariant: each contract lives in exactly one shard, and which shard is determined by the prefix of its account ID. When a shard splits in two, contracts whose ID begins with 0... stay on one child shard, those with 1... move to the other. The address itself does not change.

Dynamic sharding: split, merge and the Infinite Sharding Paradigm

Most sharded networks (Near, formerly Ethereum 2.0) fix the shard count in protocol and change it only by hard fork. TON decides on the fly. That is the Infinite Sharding Paradigm from the whitepaper: conceptually every account has its own virtual blockchain, and real shardchains are simply groupings of those per-account chains by ID prefix.

Split/merge mechanics:

  • When shard load exceeds the upper threshold defined in validator config, the shard splits in two — one additional bit is added to the prefix. Account IDs do not move; they are now served by a different validator subset.
  • When load drops below the lower threshold, two sibling shards merge back together.
  • Both thresholds are network-config parameters, voted on by validators and not pinned to a single public number in the docs.

For developers the effect is that you write the contract as if sharding does not exist. All routing detail is hidden inside the protocol via hypercube routing — a scheme that delivers cross-shard messages through intermediate blocks. Specs in TON docs and the original Catchain paper.

The price for that transparency is cross-shard latency. A message sent from a contract in one shard to a contract in another is not delivered atomically in the same block: it is first committed in the source shard, then routed via the masterchain, then accepted in the destination shard. Before Catchain 2.0 this took roughly 12 seconds with several shards active; after April 2026 the number dropped proportionally but still measures in seconds — typically 1.2-2s depending on load.

Real TPS, the 104,715 test, and what sits between

Here is the table to keep in mind whenever you read any TPS figure for TON:

SourceTPSWhat it actually measures
Chainspect, May 2026~40Mainnet average over the last hour
Chainspect, recent 100-block peak1,542Short burst during airdrop / event
Oct 2024 test, CertiK-attested104,715256 Alibaba Cloud nodes, 512 shardchains, synthetic contracts, 10 minutes
TON Foundation theoretical”millions”Extrapolation assuming full 2^60 shard expansion

The 104,715 TPS test from October 2024, attested by CertiK, is the engineering upper bound, not the network’s day-to-day reality. The setup: the team spun up 256 high-spec nodes on Alibaba Cloud, activated 512 shardchains simultaneously, and drove pre-deployed contracts under load for 10 minutes. These were not vanilla transfers — the public write-up stresses that the contracts performed “complex execution”. The test rig is reproducible from github.com/ton-blockchain/performance-test.

Why does mainnet deliver 40 tx/s when the hardware can sustain 104K:

  1. Validator and shard counts. Mainnet runs roughly 408 validators (chainspect) and typically 4-8 active shards at any moment — not 512.
  2. Actual demand. The mix of users who actually exist on TON today generates 40 tx/s. That is demand, not ceiling.
  3. Geographic distribution. The test ran in one datacentre; mainnet is globally distributed, which adds consensus latency.
  4. Transaction mix. Real traffic is dominated by jetton transfers (heavier than raw TON sends) and DEX interactions (multi-message flows).

The honest read: 104,715 TPS is evidence that the architecture will not bottleneck as demand scales. It is not a claim that the network is ready to absorb Visa-grade volume today.

Catchain 2.0: what shipped on April 9, 2026

Catchain 2.0 is the largest consensus upgrade in TON mainnet history. Shipped on April 9, 2026. What changed:

  • Block time dropped from ~5s to ~0.4s.
  • Finality dropped from ~10s to ~0.8s.
  • Throughput rose roughly 10x on the same validator set.
  • Annual issuance rose from 0.6% to 3.6% — the cost of more frequent blocks.

Under the hood, Catchain is a pBFT-derived consensus requiring more than 2/3 of validator signatures. Catchain 2.0 rewrote the pipeline so that proposing the next block now runs in parallel with verifying the previous one — that overlap is the main source of the latency win. Implementation notes live in blog.ton.org and in the protocol whitepaper.

From the user’s perspective the observable change is that transfers and DeFi interactions are now noticeably snappier. Jetton-transfer confirmations feel “instant” rather than “two Mississippi”. For DEXes the gap between blocks is now tight enough that arbitrage has moved closer to CEX cadence.

Where the design actually runs out of headroom

Reading only the press posts leaves an impression of an “infinitely scalable” network. In practice TON’s architecture has specific, named bottlenecks.

1. Single-account bottleneck. All transactions for one contract must land in one shard — otherwise prefix-based routing breaks. A hot DEX pool, a popular auction contract, or a viral mini-app contract is therefore capped at the throughput of a single shard. Practical ceiling: a few hundred tx/s per contract. The architectural answer is to shard the contract at the application level (multiple parallel pool contracts, sharded NFT collections), which adds engineering cost.

2. Cross-shard latency. Hypercube routing is an elegant theoretical solution, but the price is multi-block latency between shards. Pre-Catchain 2.0 the number was ~12s; now it sits around 1.2-2s, but it still costs several blocks. For UX tuned around “one-second confirmation” this is visible in complex DeFi flows where a message threads through 3-4 contracts in different shards.

3. The prefix cap. The declared 2^60 shards per workchain live only in the spec. The production parameter caps shard prefix at 4 bits — 16 shards on basechain. Reaching “million-TPS” territory is several protocol upgrades away, not a config flag.

4. Test vs production. The 104,715 TPS run used 256 high-spec servers in a single datacentre with pre-deployed synthetic contracts. Mainnet runs 408 geographically distributed validators on heterogeneous hardware with a real transaction mix. The order-of-magnitude gap is not a bug, it is the difference between two operating regimes.

5. Inflation as the price of speed. A 10x throughput gain came with a 6x issuance increase. That is a legitimate choice, but it is a choice, not a free upgrade.

Comparing with other L1s on the same metrics helps calibrate expectations:

ChainTheoretical TPSReal TPS (1h)Block timeFinalityNakamoto coef.
TON104,715 (test)400.4s0.8s~408 validators
Solana65,000~1,0000.4s12.8s (full) / 2.5s20
Ethereum L1~3015-2012s~13 min~3
Aptos~160,000~100<1s<1s16

TON beats Ethereum L1 on latency and throughput by orders of magnitude, beats Solana on finality (0.8s vs 12.8s at a comparable block time), and beats Aptos on practical decentralisation (~408 validators vs 16). It loses to Solana and Aptos on theoretical and demonstrated per-chain TPS, and that loss is architectural: monolithic Solana/Aptos designs optimise for global ordering, while sharded TON optimises for parallelism.

Roadmap through 2027

Durov announced MTONGA (Make TON Great Again) in early 2026 — a seven-step optimisation programme. Mid-May 2026 status:

  1. Catchain 2.0 — shipped April 9, 2026.
  2. Roughly 6x fee cut — announced, target ~$0.0005 per jetton transfer.
  3. Validator/collator split. Validators will watch only the masterchain; a new collator role will pre-assemble shardchain blocks. Goal: lower validator hardware requirements while raising throughput.
  4. Optimistic collation. Collators will forward blocks for propagation before validation completes, shaving another second from the latency budget.
  5. AgenticKit, Rust Node v1, Tolk dev tools. Developer tooling and an alternative Rust node client — reducing the network’s dependence on the single C++ implementation.
  6. A new consensus layer. The post-Catchain-2.0 evolution; details not yet public.
  7. Layer-2 Payment Network + BTC Teleport bridge. A fast-payments layer and a native Bitcoin bridge.

Delivery is spread across 2026-2027. Pragmatic expectation: by end-2026, fee reduction and the start of collator/validator split work; through 2027, Rust node and L2 payment network in production.

What this means for builders and holders

If you are building a TON dApp in 2026, the reasonable defaults are:

  • Finality ~0.8s — design UX for “near-instant” rather than “synchronous”.
  • Cross-shard messages are not atomic. Hide complex multi-contract flows behind a state machine that handles intermediate states correctly.
  • One contract, one shard. If you anticipate a high-throughput scenario (mass airdrop, hot NFT mint), shard the state at the application layer.
  • 104,715 TPS is not a marketing cudgel, it is a ceiling under a specific configuration. Plan for 100-300 tx/s per contract as your working baseline.

If you are a holder thinking about long-term network health, the indicators that matter are validator-count resilience (does decentralisation hold after the collator/validator split lands), real inflation (does it stay at the announced 3.6% rather than drifting up), and the L2 payment layer (which should absorb high-frequency micropayments and offload L1).

Sources

Frequently asked

Per Chainspect in May 2026, the mainnet average is roughly 40 tx/s, with a recent 100-block peak of 1,542 tx/s. The 104,715 TPS headline came from an October 2024 test on 256 cloud servers running 512 shardchains over a 10-minute window — it is an engineering ceiling, not a live mainnet number.
The April 9 2026 upgrade cut block time from about 5s to roughly 0.4s, finality from about 10s to roughly 0.8s, and lifted throughput by an order of magnitude. The tradeoff was a jump in annual issuance from 0.6% to 3.6%.
Mainnet has one workchain (basechain) plus the masterchain. The production shard-prefix length is capped at 4 bits, so basechain can split into at most 16 shardchains at any moment. The 2^60 figure is a spec ceiling, not 2026 reality.
All transactions for one contract land in one shard — that follows from prefix-based routing of account IDs. Per-account throughput is therefore bounded by a single shard's capacity, and horizontal scale only kicks in when load spreads across many independent accounts.
It is the scheme that ferries messages between shards through intermediate blocks. Pre-Catchain 2.0 cross-shard latency ran around 12s; after the upgrade it dropped proportionally but still costs several blocks, roughly 1.2-2s depending on load and active shard count.
MTONGA is a seven-step plan: a roughly 6x fee cut, validator/collator split, optimistic collation, an alternative Rust node client, AgenticKit, and a Layer-2 payment network plus a BTC Teleport bridge. Delivery is spread across 2026-2027.

Related