本文共 4207 字,大约阅读时间需要 14 分钟。
Each graphics unit has several vital theoretical parameters that affect real-world game, 3D graphics and compute performance. These are texture fillrate, pixel fillrate, memory bandwidth, along with single- and double-precision performance. Below you will find why they are important, and which card has better characteristics.
Pixel fill rate (gigapixels/s)
Higher is better | The Tesla K10 card has more Raster Operations Pipelines (ROPs) than the NVIDIA Tesla C2075. On top of that, its graphics clock rate is higher, therefore its pixel fillrate is substantially higher. Better maximum pixel fill rate means that the GPU can draw more pixels on and off screen each second, increasing overall performance, unless the card is limited by something else, such as texture fillrate, memory bandwidth or CPU speed. | |||||||||
- NVIDIA Tesla C2075 - NVIDIA Tesla K10 | ||||||||||
Texture fill rate (gigatexels/s)
Higher is better | Because the NVIDIA Tesla K10 graphics unit has many more TMUs (Texture Mapping Units) and higher graphics frequency, its texture fillrate is considerably higher. Better texture fill rate means that the GPU can use more complex 3D effects and/or apply more textures to each textured picture element, which improves games visual appearance. | |||||||||
Single Precision performance(GFLOPS)
Higher is better | Maximum Single Precision performance indicates how fast the graphics card is at executing programs, that process primarily single-precision floating point data. The performance is expressed in GFLOPS or billions of Floating Point Operations Per Second. Generally, the more stream processors or CUDA cores the graphics card has, and the the faster they run at, the higher Single Precision performance will be. The NVIDIA Tesla K10 GPU has an upper hand here. Higher single-precision performance number means the graphics card will perform better in general computing applications. Since CUDA cores or stream processors are also used as vertex and geometry shaders for 3D image generation, higher performance is also beneficial to games. | |||||||||
Double Precision performance(GFLOPS)
Higher is better | Maximum Double Precision performance is similar to the Single Precision performance, except that it applies to double-precision (64-bit) floating point operations. Since games do not use double-precision arithmetics, this characteristic is unimportant to games performance. The Tesla C2075 is faster when processing 64-bit floating-point numbers. | |||||||||
- NVIDIA Tesla C2075 - NVIDIA Tesla K10 | ||||||||||
Memory bandwidth (GB/s)
Higher is better | To speed up processing, the GPUs store 3D scene data, textures and intermediate data, used for image generation, in on-board memory. The video memory usually has much higher bandwidth than system RAM, and more bandwidth allows the GPU to run at higher display resolutions, use larger and more detailed textures, and apply more complex 3D effects and filters. The bandwidth depends on a few components, such as memory type, speed, and memory interface width. Specifically, higher memory bandwidth of the NVIDIA Tesla K10 is due to higher memory clock. |
All rows with different specifications or features are highlighted.
General information | ||
Market segment | HPC / Server | |
Manufacturer | NVIDIA | |
Model | ||
Architecture / Interface | ||
Die name | GF100 | 2 x GK104 |
Architecture | Fermi | Kepler |
Fabrication process | 40nm | 28nm |
Bus interface | PCI-E 2.0 x 16 | PCI-E 3.0 x 16 |
Cores / shaders | ||
CUDA cores | 448 | 3072 |
ROPs | 48 | 64 |
Pixel fill rate | 27.6 gigapixels/s | 47.68 gigapixels/s |
Texture units | 56 | 256 |
Texture fill rate | 32.2 gigatexels/s | 190.72 gigatexels/s |
Single Precision performance | 1030.4 GFLOPS | 4577.28 GFLOPS |
Double Precision performance | 515.2 GFLOPS | 190.72 GFLOPS |
Clocks / Memory | ||
Base clock | 745 MHz | |
Graphics clock | 575 MHz | |
Processor clock | 1150 MHz | |
Memory size | 6144 MB | 8192 MB |
Memory type | GDDR5 | |
Memory clock | 750 MHz | 1250 MHz |
Memory interface width | 384 | 256 |
Memory bandwidth | 144 GB/s | 320 GB/s |
Other features | ||
Maximum power | 247 Watt | 250 Watt |
Better values / features are marked with green color, and worse values are in red color.
转载地址:http://jtiqb.baihongyu.com/