NVIDIA RTX 4000 Ada Generation Laptop GPU vs NVIDIA Quadro M5500 vs NVIDIA Quadro M520
NVIDIA RTX 4000 Ada Generation Laptop GPU
► remove from comparison![NVIDIA NVIDIA RTX 4000 Ada Generation Laptop GPU](fileadmin/_processed_/5/a/csm_ada_lovelace_architeture_slide_nvidia_6f95451889.jpg)
The Nvidia RTX 4000 Ada Generation, not to be confused with the A4000, P4000 or RTX 4000 Turing Generation, is a very powerful professional graphics card for use in laptops that sports 7,424 CUDA cores and 12 GB of ECC GDDR6 VRAM. Brought into existence in 2023, this graphics adapter leverages TSMC's 5 nm process and Nvidia's Ada Lovelace architecture to achieve great performance combined with moderate power consumption. The Nvidia-recommended TGP range for the card is very wide at 60 W to 175 W leading to bizarre performance differences between different systems powered by what is supposed to be the same product.
Hardware-wise, the RTX 4000 is a GeForce RTX 4080 Laptop in disguise. Consequently, both make use of the AD103 chip and have little difficulty running triple-A games at QHD 1440p.
Quadro series graphics cards ship with much different BIOS and drivers than GeForce cards and are targeted at professional users rather than gamers. Commercial product design, large-scale calculations, simulation, data mining, 24 x 7 operation - if any of this sounds familiar, then a Quadro card will make you happy.
Architecture and Features
Ada Lovelace brings a range of improvements over older graphics cards utilizing the outgoing Ampere architecture. It's not just a better manufacturing process and a higher number of CUDA cores that we have here (up to 16,384 versus 10,752); under-the-hood refinements are plentiful, including an immensely larger L2 cache and an optimized ray tracing routine (a different way to determine what is transparent and what isn't is used) and other changes. Naturally, these graphics cards can both encode and decode some of the most widely used video codecs, AVC, HEVC and AV1 included; they also support a host of Nvidia technologies, including Optimus and DLSS 3, and they can certainly be used for various AI tasks.
The RTX 4000 features 58 RT cores of the 3rd generation, 232 Tensor cores of the 4th generation and 7,424 CUDA cores, making it a lot faster than the RTX 3500 Ada Generation. Elsewhere, the graphics card comes with 12 GB of 192-bit wide ECC GDDR6 memory for a very healthy throughput of ~432 GB/s. Error correction can be turned off if desired. The fact that error correction is present here proves that the RTX 4000 is indeed targeted at professional users.
Just like Ampere-based cards, the RTX 4000 makes use of the PCI-Express 4 protocol. 8K SUHD monitors are supported, however DP 1.4a video outputs can potentially prove to be a bottleneck down the line.
Performance
While we have not tested a single system featuring an RTX 4000 Ada Generation as of February 2024, we have plenty of performance data for the GeForce RTX 4080 Laptop, a graphics card with very similar specifications. Based on that, we fully expect the RTX 4000 to deliver:
- a Blender 3.3 Classroom CUDA score of around 23 seconds
- a 3DMark 11 GPU score of around 57,000
- at least 115 fps in GTA V (1440p - Highest settings possible, 16x AF, 4x MSAA, FXAA)
- close to 50 fps in Cyberpunk 2077 (1440p - High settings, Ultra RT, "Quality" DLSS)
Nvidia's marketing materials mention "up to 33.5 TFLOPS" of performance which is impressive. The RTX 5000 Ada Generation delivers a little over 42 TFLOPS, for reference.
Your mileage may vary depending on how competent the cooling solution of your laptop is and how high the TGP power target of the RTX 4000 is. One other thing worth mentioning is that enabling error correction appears to reduce the amount of video memory that is available to applications and games by up to a gigabyte.
Power consumption
Nvidia no longer divides its laptop graphics cards into Max-Q and non-max-Q models. Instead, laptop makers are free to set the TGP according to their needs, and the range can sometimes be shockingly wide. This is the case with the RTX 4000, as the lowest value recommended for it sits at just 60 W while the highest is more than two times higher at 175 W (this most likely includes Dynamic Boost). The slowest system built around an RTX 4000 Ada can easily be 60% slower than the fastest one. This is the kind of delta that we've already seen on consumer-grade laptops featuring the latest GeForce RTX cards.
Last but not the least, the improved 5 nm process (TSMC 4N) the RTX 4000 Ada is built with makes for very decent energy efficiency, as of mid 2023.
NVIDIA Quadro M5500
► remove from comparison![NVIDIA Quadro M5500](typo3temp/_processed_/a/d/csm_NVIDIA_QuadroM5500-rs_22db67be25.jpg)
The Nvidia Quadro M5500 is a high-end, DirectX 12 (FL_12_1) and OpenGL 4.5-compatible graphics card for mobile workstations. It is a Maxwell-based GPU built on the GM204 chip with all 2048 shader cores activated and is manufactured in 28 nm at TSMC. Therefore, the GPU is similar to the consumer Nvidia GeForce GTX 980 (Notebook). Compared to the slower and less power hungry Quadro M5000M, the M5500 offers 512 more shaders which should result in 30-40% more performance according to Nvidia.
The Quadro series offers certified drivers that are optimized for stability and performance in professional applications like CAD or DCC. OpenGL performance, for example, should be significantly better compared to GeForce graphics cards of similar specifications.
Using CUDA (Compute Capability 5.2) or OpenCL 1.2, the cores of the Quadro M5000M can be used for general calculations.
The power consumption of the Quadro M5000M is rated at the same TGP of 150 Watt. Therefore, the card is suited for very large notebooks with 17-inch displays or greater.
The first laptop with the M5500 in it is the MSI WT72 Workstation.
NVIDIA Quadro M520
► remove from comparisonThe Nvidia Quadro M520 is a DirectX 12 (FL 11_0) and OpenGL 4.5-compatible graphics card for mobile workstations. It is a Maxwell-based GPU built on the GM108 chip with 384 shader cores and manufactured in 28 nm by TSMC. That means its similar to the Quadro M500M (DDR3) but uses faster GDDR5 memory. The GeForce 940MX with GDDR5 memory is the consumer variant of the Quadro M520.
The Quadro series offers certified drivers that are optimized for stability and performance in professional applications like CAD or DCC. OpenGL performance, for example, should be significantly better than with GeForce graphics cards of similar specifications.
Architecture
Compared to Kepler, Maxwell has been optimized in several details to increase power efficiency. Smaller Streaming Multiprocessors (SMM) with only 128 ALUs (Kepler: 192) and an optimized scheduler should lead to better utilization of the shaders. Nvidia promises that a Maxwell SMM with 128 ALUs can offer 90% of the performance of a Kepler SMX with 192 ALUs. GM108 features 3 SMMs and thus 384 shader cores, 24 TMUs and 8 ROPs (64-bit interface).
Another optimization is the massively enlarged L2 cache. The larger size can reduce some of the memory traffic to allow for a relatively narrow memory interface without significantly hurting performance.
GM108 supports DirectX 11.2 (feature level 11.0 only) as well as DirectX 12.
Performance
Currently the clock speeds are not known of the M520 variant. They should be similar to the consumer 940 MX with GDDR5. Therefore, the performance should be similar to the Quadro K620M.
Features
The feature set should include support for up to four active displays. High-resolution monitors of up to 3840x2160 pixels can be connected using DisplayPort 1.2 or HDMI 1.4a (HDMI 2.0 not supported). HD-Audio codecs, such as Dolby TrueHD and DTS-HD, can be transmitted via bitstream mode through the HDMI port. However, as most laptops will feature Optimus, the integrated GPU will likely have direct control over the display ports and may limit the feature set available by the Nvidia Kepler cards.
Using CUDA or OpenCL, the cores of the Quadro M520 can be used for general calculations.
GM108 integrates the sixth generation of the PureVideo HD video engine (VP6), offering a better decoding performance for H.264 and MPEG-2 videos. Of course, VP6 supports all features of previous generations (4K support, PIP, video encoding via NVENC API).
Power Consumption
The power consumption of the Quadro M520 is rated at 25 Watt TGP (max power consumption including memory and board). That is slightly below the M500M that is rated at 30 Watt. Therefore, the GPU is best suited for laptops 13 - 14-inches in size and above. The M500M also supports Optimus to automatically switch between an integrated graphics card and the Nvidia GPU.
NVIDIA RTX 4000 Ada Generation Laptop GPU | NVIDIA Quadro M5500 | NVIDIA Quadro M520 | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Quadro M Series |
|
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Architecture | Ada Lovelace | Maxwell | Maxwell | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Pipelines | 7424 - unified | 2048 - unified | 384 - unified | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Raytracing Cores | 58 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Tensor / AI Cores | 232 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Theoretical Performance | 33.6 TFLOPS FP32 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Memory Speed | 16000 effective = 2000 MHz | 6606 MHz | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Memory Bus Width | 192 Bit | 256 Bit | 64 Bit | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Memory Type | GDDR6 | GDDR5 | GDDR5 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Max. Amount of Memory | 12 GB | 8 GB | 1024 MB | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Shared Memory | no | no | no | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Memory Bandwidth | 432 GB/s | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
API | DirectX 12 Ultimate, Shader 6.7, OpenGL 4.6, OpenCL 3.0, Vulkan 1.3 | DirectX 12_1, OpenGL 4.5 | DirectX 12 (FL 11_0), Shader 5.0, OpenGL 4.5 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Power Consumption | 150 Watt (60 - 150 Watt TGP) | 150 Watt | 25 Watt | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
technology | 5 nm | 28 nm | 28 nm | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
PCIe | 4.0 x16 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Displays | 4 Displays (max.), HDMI 2.1, DisplayPort 1.4a | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Notebook Size | large | large | medium sized | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Date of Announcement | 21.03.2023 | 04.04.2016 | 13.01.2017 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Link to Manufacturer Page | images.nvidia.com | www.nvidia.de | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Predecessor | RTX A3000 Laptop GPU | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Core Speed | 1139 - 1140 (Boost) MHz | 756 - 1019 (Boost) MHz | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Transistors | 5.2 Billion | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Features | CUDA, 3D Vision, PhysX, GeForce Experience, Surround, GameStream, GPU Boost 2.0, Adaptive Vertical Sync, G-SYNC, SLI | Optimus, PhysX, 3D Vision Pro, nView, Optimus |