Radeon RX 7900 XTX & XT in the test: AMD RDNA 3 against Nvidia GeForce RTX 4000 “Ada Lovelace”

Shortly before Christmas, AMD's new graphics cards based on RDNA 3 are entering the market. ComputerBase tested the new flagship model Radeon RX 7900 XTX and the Radeon RX 7900 XT, which, among other things, compete against Nvidia's GeForce RTX 4080 and want to score with prices of 1,149 euros and 1,050 euros respectively.

Table of contents < ol class="toc__items toggle-body-container js-toc-items" id="toc">

1 AMD RDNA 3 vs. Nvidia GeForce RTX 4000 “Ada Lovelace”

RX 7900 XTX and RX 7900 XT for Christmas
A few words about the technology around Navi 31 and RDNA 3

2 The reference design of the RX 7900 XT(X) in detail< ol>

“The reference” for 1,149 and 1,050 euros

Test system and test methodology

3 clock rates, benchmarks (WQHD, UHD & 5K ) with and without RT

The average clock rates under load
Benchmarks with and without RT in 3,840 × 2,160
Benchmarks with AMD FSR 2.x and Nvidia DLSS 2.x including ray tracing
Benchmarks with and without RT in 2,560 × 1,440
Benchmarks with and without RT in 5,120 × 2,880

4 Performance in the latest games and app benchmarks

RDNA 3 in current new releases
Benchmarks in applications

5 Volume, temperature, power consumption

Volume &amp ; Cooling
Power consumption: games, YouTube, desktop
Energy efficiency in FPS per watt

6 Technology comparison RDNA 3 vs. RDNA 2, OC and UV< ol>

RDNA 2 vs. RDNA 3: How much more power does the new technology bring?

Overclocking (via higher power target)

Undervoltage

7 Price, availability and conclusion

Price (RRP) and availability
Conclusion

RX 7900 XTX and RX 7900 XT for Christmas

AMD's next-gen graphics cards will as announced, hit the retailer shelves before Christmas. As of today, the new flagship model Radeon RX 7900 XTX and the smaller Radeon RX 7900 XT can be tested, from December 13th both the reference design and the first (possibly only a few) custom designs will be available in stores.

And so Nvidia's Ada Lovelace generation now has competition. Probably not the GeForce RTX 4090 (test), which in many respects will remain far unrivaled for AMD's new Radeon RX 7000 series. Instead, the Navi 31 GPU duels with the GeForce RTX 4080 (test), which AMD says it wants to beat in the game grid performance.

AMD wants to score with a lower price

Apart from that, AMD also wants to score in terms of costs, because Nvidia has left a huge barn door open with the previous GeForce RTX 4000 cards. Finally, the GeForce RTX 4080 is undoubtedly very expensive with an RRP of 1,399 euros. The RDNA-3 offshoots will not be a bargain either, but they are cheaper. AMD wants 1,149 euros for the Radeon RX 7900 XTX with 24 GB of memory, the Radeon RX 7900 XT with 20 GB will start from 1,050 euros. Prices are for reference models, most custom designs will cost accordingly more.

RX 7900 XTX & RX 7900 XT vs RTX 4080 – Custom reviews coming very soon

On the following pages, ComputerBase will now test the entire package of the Radeon RX 7900 XTX and Radeon RX 7900 XT and clarify the most important questions. For example, whether ray tracing and thus the vulnerability of RDNA 2 has improved. And what about the energy efficiency of the new graphics cards. Away from the normal test course, the editors also take a look at the latest games such as A Plague Tale: Requiem, The Callisto Protocol and Spider-Man: Miles Morales. Overclocking and undervolting will also play a role. The test will show whether the Radeon RX 7900 XTX will finally be able to beat the Nvidia GeForce RTX 4080.

The article will only deal with AMD's reference design, which will also be available from the board partners apart from AMD itself. Tests of the real custom designs are not yet allowed, but will follow shortly – the first series of tests have already been completed.

First offers from Tuesday 13th December, 3:00 p.m.

Radeon RX 7900 XT(X): Reference at AMD
Radeon RX 7900 XT(X): reference and custom designs at Alternate*
Radeon RX 7900 XT(X): reference and custom designs at Caseking*
Radeon RX 7900 XT (X): Reference and custom designs at Mindfactory*
Radeon RX 7900 XT(X): Reference and custom designs at NBB.de*

< h2 class="text-width text-h2" id="abschnitt_ein_paar_worte_zu_technik_rund_um_navi_31_und_rdna_3">A few words about the technology around Navi 31 and RDNA 3

At this point, due to time constraints, the technology of RDNA 3 should not go into great detail, instead only the most interesting innovations are summarized. Navi 31 is the first chiplet design for GPUs that combines a 300 mm “Graphics Compute Die” (GCD) in the N5 process at TSMC with six “Memory Caches” manufactured in the N6 process and measuring a total of 220 mm² Dies” (MCD) combined in one package. According to AMD, this should bring advantages in terms of costs, but also has disadvantages in terms of pure performance.

GPU chiplets have advantages and disadvantages

But how are the chiplets actually connected to each other? The “Infinity Fabric” used in the CPUs is not suitable for GPUs, since according to AMD the bandwidth requirement for graphics cards is more than 10 times higher per MCD than for the CPUs per CCD. To make this possible, AMD has developed a new connection called “Infinity Fanout Links” that delivers a total maximum bandwidth of 5.3 TB per second.

However, AMD honestly admits that the chiplet process for CPUs brings cost advantages, but at the same time disadvantages in terms of performance. With the same clock, Navi 21 has a 5 to 10 percent worse latency when accessing DRAM compared to Navi 31. In addition, the latency to the “Infinity Fabric” increases by a comparable value. AMD wants to compensate for this with higher clock rates or even turn it into an advantage, but that doesn't change the fact that a monolithic Navi 31 would still be faster with the same clock rates. This is the disadvantage that GPU chiplets currently have.

Dual issue and more for more power per CU

The GCD consists of a total of 96 compute units in Navi 31 when fully expanded, which is only a small increase compared to the predecessor Navi 21. In order to get a performance increase of more than 20 percent, AMD has now designed the FP32 units as “dual issue”. So you can do two calculations at the same time. Theoretically, this doubles the computing power, with AMD not for nothing refraining from mentioning 12,288 FP32-ALUs (96 CUs × 64 FP32-ALUs × 2) and instead speaking of 6,144 FP32-ALUs.

That is certainly more honest, because AMD has taken the most resource-saving way possible to save transistors. After all, the driver compiler can only combine certain commands, which can then be calculated more quickly. If this is not possible, however, only 6,144 FP32-ALUs are used. Now it's the task of the driver compiler to make as many commands as possible suitable for dual issues – and that's why AMD also says that in the future more programs will be executed twice as fast as they are currently. This means that the driver team also has more work to do with RDNA 3 than with RDNA 2. Nvidia has already taken a very similar path with Ampere with double FP32 units, which goes a little further than AMD's – but probably also costs more transistors.

In order to get more performance out of the arithmetic units, AMD has further improved the compute units, although the actual structure has remained the same. The cache sizes have increased significantly, the L2 cache is now 50 percent larger at 6 MB, the L1 cache is 300 percent larger at a total of 3 MB and the L0 cache at 3 MB has also grown by 240 percent. Furthermore, the vector registers have been enlarged and accelerated, so that AMD speaks of an average of 17.4 percent more performance per CU than with RDNA 2 at the same clock rate with RDNA 3.

Raytracing makes a big leap, but…

In addition, RDNA 3 introduces the second generation of ray tracing units, which is still structured in the same way as RDNA 2 and is therefore also at home in the texture units – but it should be significantly faster. The RT units of RDNA 2 should no longer have to track every RT ray, but can also cancel those that are no longer needed and, especially in complex scenarios, the “ray tracking” should now work much faster. In general, each individual beam should be able to be guided to the target faster than with RDNA 2. What RDNA 3 still cannot do, however, unlike Nvidia GPUs, is accelerated creation of the BVH structure – this is still done by the FP32 units – and unlike Lovelace, shaders cannot be reordered for optimized ray tracing.

With RDNA 3, AMD speaks of up to 80 percent better RT performance at high RT loads. However, there is one peculiarity to consider here. Like the predecessor, RDNA 3 has one RT unit per compute unit. However, since the number of compute units only increases by 20 percent and the new dual-issue ALUs are no help with ray tracing when the special units limit, the RT performance is up to 80 percent more powerful, at least in theory no disproportionate jump per RT unit. This would have required more RT units per CU. This increases rasterizer and RT performance comparably.