GeForce GTX 580 Specifications

The GeForce GTX 580 1.5GB uses all 16 SMs

The headline specification is undoubtedly the enabling of the 16th and final SM (Streaming Multiprocessor, or ‘stream processor cluster’ in neutral terminology) of the GF100 Fermi design.

However, the new GF110 design still uses the 32 stream processors per SM layout of the original GF100 rather than the 48 per SM of the GeForce GTX 460 GPU.

Even Nvidia sees the 32 layout as a less efficient design, but we suspect that it’s not possible to get four GPCs (Graphics Processing Clusters) with four SMs each onto a die small enough to actually make if each SM contained 48 stream processors rather than 32. In the end, brute force wins out.

Perhaps this layout will change with TSMC’s 28nm process, but that’s not due until halfway through 2011, with GPUs based on this process (from ATI and Nvidia) pencilled in for the autumn of that year.

As well as the extra resources, and the increased high-precision fp16 capabilities (which Nvidia claims is worth a 4-12 per cent performance increase), the GeForce GTX 580 1.5GB operates at higher frequencies than the GeForce GTX 480 1.5GB. While the GPU core of the latter runs at 700MHz (meaning that its 480 stream processors operate at 1.4GHz) the GPU core of the GeForce GTX 580 1.5GB runs at 772MHz, with its 512 stream processors clocked at 1.544GHz.

The 1.5GB of GDDR5 memory also runs faster, with an effective frequency of 4.08GHz rather than 3.7GHz. While this gives the GTX 580 1.5GB more memory bandwidth, the rest of the GPU is the same, with the same 384-bit memory interface and 48 ROPs. The reason for the rise in texture units (from 60 to 64) is because each SM of the GF100 design contains four texture units - unlocking the 16th SM unlocked four more textures.

 Nvidia GeForce GTX 580 1.5GBNvidia GeForce GTX 480 1.5GBNvidia GeForce GTX 470 1,280MBATI Radeon HD 5870 1GBATI Radeon HD 6870 1GBATI Radeon HD 5970 2GB
CodenameGF110GF100GF100Cypress XTBarts XTHemlock XT
Stream Processors512 (1,544MHz)480 (1.4GHz)448 (1,215MHz)1,600 (850MHz)1,120 (900MHz)2 x 1,600 (725MHz)
Layout16 SMs, 4 GPCs15 SMs, 4GPCs14 SMs, 4 GPCs20 SIMD engines14 SIMD engines2 x 20 SIMD engines
Rasterisers444222 x 2
Tesselation Units161514112 x 1
Texture Units64605680562 x 80
ROPs48484032322 x 32
Transistors3 billion3 billion3 billion2.15 billion1.7 billion2 x 2.15 billion
Size530mm2530mm2530mm2334mm2255mm22 x 334mm2
Frequency1.02GHz (4.08GHz effective)924MHz (3.7GHz effective)837MHz (3.2GHz effective)1,050MHz (4.2GHz effective)1,050MHz (4.2GHz effective)1GHz (4GHz effective)
Interface384-bit384-bit320-bit256-bit256-bit2 x 256-bit
Bandwidth192.4GB/sec177GB/sec134GB/sec134.4GB/sec134.4GB/sec2 x 128GB/sec
Card Specifications
Power Connectors1 x 6-pin, 1 x 8-pin PCI-E1 x 6-pin, 1 x 8-pin PCI-E2 x 6-pin PCI-E2 x 6-pin PCI-E2 x 6-pin PCI-E1 x 6-pin, 1 x 8-pin PCI-E
Maximum Power Draw244W250W215W188W151W294W
Idle Power DrawUnspecifiedUnspecifiedUnspecified27W19WUnspecified
Recommended PSU600W600W550W500WUnspecifiedUnspecified
Typical Street Price£400£330£200£320£220£490

