AMD details how it built a product line-up with just two RDNA 4 dies — Flexible design and asymmetric harvesting enables production of multiple models without new silicon

Graphics processing units (GPUs) are designed in a way that their performance can be scaled up or down during the design phase, or even after tape-out. AMD’s 9000-series, built on RDNA 4 (also known as Navi 4), is no exception. AMD confirmed this during one of its Hot Chips 2025 presentations, and demonstrated how it can cut down the design of its GPUs to produce more SKUs.

Building a product family using two GPU designs

AMD

(Image credit: AMD)

By using this strategy, AMD spawned a smaller Navi 44 (Radeon RX 9060-series) out of the bigger Navi 48 (Radeon RX 9070-series) design by reducing the number of shader engines (SEs), Infinity cache, GDDR6 controllers, and PHYs, but leaving things like the command processor, display engines, media engines, security processor, and other specific things intact. By reusing photomasks from Navi 48, AMD saved on manufacturing costs. In addition to this, AMD built the Radeon RX 9070 and RX 9070 GRE from the full-fat Radeon RX 9070 XT by disabling certain elements, which essentially increased yields and enabled them to hit pricing targets. Such an approach also shortened the relevant GPU’s time-to-market, as fewer unique silicon designs required tape out, validation, and production.

Asymmetric harvesting

The most important element of this strategy is the way a Shader Engine (SE) can be harvested. An SE is a fundamental building block of the GPU, housing multiple Work Group Processors (WGPs), Compute Units (CUs), and fixed-function stages for geometry, rasterization, and rendering. On RDNA 4-based products, AMD allows entire shader engines to be disabled when defects are present or when a lower performance target is desired. In addition to this, AMD may disable specific WGPs, which provides a lot of additional flexibility.

AMD

(Image credit: AMD)

Memory harvesting adds another dimension of flexibility. The RDNA 4 memory subsystem contains multiple GDDR6 controllers, linked through Infinity Fabric and cache structures. Each memory controller can be fused off individually, meaning that AMD can reduce the effective bus width in increments of 64 bits.

Leave a Reply

Your email address will not be published. Required fields are marked *