We do not have any performance data for the A10 and the A40 for the 3DMark 11 Performance GPU benchmark.
We do not have any performance per dollar data for the A10 and the A40 for the 3DMark 11 Performance GPU benchmark.
Summary
About the NVIDIA A10 GPU
The NVIDIA A10 is a workstation graphics card that launched in Q2 2021. It is built on the Ampere GPU microarchitecture (codename GA102) and is manufactured on a 8 nm process.
Memory
The A10 has 24 GB of GDDR6 memory, with a 1,563 MHz memory clock and a 384 bit interface. This gives it a memory bandwidth of 600.2 Gb/s, which affects how fast it can transfer data to and from memory. GPU memory stores temporary data that helps the GPU with complex math and graphics operations. More memory is generally better, as not having enough can cause performance bottlenecks.
Cores and Clock Speeds
The A10 includes 9,216 CUDA cores, the processing units for handling parallel computing tasks. The GPU operates at a core clock speed of 885 MHz and can dynamically boost its clock speed up to 1,695 MHz. Complementing the processing units are 288 texture mapping units (TMUs) for efficient texture filtering and 96 render output units (ROPs) for pixel processing. Additionally, the GPU features 288 tensor cores optimized for AI-accelerated workloads and 72 RT cores dedicated to real-time ray tracing calculations.
Compatibility & Power Consumption
The A10 is a compact, low-profile graphics card that fits into 1 PCIe expansion slot. NVIDIA recommends a power supply of at least 450 W to handle the GPU's thermal design power (TDP) of 150 W.
About the NVIDIA A40 GPU
The NVIDIA A40 is a workstation graphics card that launched in Q4 2020. It is built on the Ampere GPU microarchitecture (codename GA102) and is manufactured on a 8 nm process.
Memory
The A40 has 48 GB of GDDR6 memory, with a 1,812 MHz memory clock and a 384 bit interface. This gives it a memory bandwidth of 695.8 Gb/s, which affects how fast it can transfer data to and from memory. GPU memory stores temporary data that helps the GPU with complex math and graphics operations. More memory is generally better, as not having enough can cause performance bottlenecks.
Cores and Clock Speeds
The A40 includes 10,752 CUDA cores, the processing units for handling parallel computing tasks. The GPU operates at a core clock speed of 1,305 MHz and can dynamically boost its clock speed up to 1,740 MHz. Complementing the processing units are 336 texture mapping units (TMUs) for efficient texture filtering and 112 render output units (ROPs) for pixel processing. Additionally, the GPU features 336 tensor cores optimized for AI-accelerated workloads and 84 RT cores dedicated to real-time ray tracing calculations.
Compatibility & Power Consumption
The A40 occupies 2 PCIe expansion slots. It supports DisplayPort 1.4a display connections. NVIDIA recommends a power supply of at least 700 W to handle the GPU's thermal design power (TDP) of 300 W.
General Info
General overview of the GPU, including details like its manufacturer, release date, launch price, and current production status.
Info | A10 | A40 |
---|---|---|
Manufacturer | NVIDIA | NVIDIA |
Architecture | Ampere | Ampere |
Market Segment | Workstation | Workstation |
Release Date | Q2 2021 | Q4 2020 |
Production Status | Active | Active |
Shop | Check Price | Check Price |
Gaming Performance
Benchmark Performance
We do not have any performance data for the A10 and the A40 for the 3DMark 11 Performance GPU benchmark.
We do not have any performance per dollar data for the A10 and the A40 for the 3DMark 11 Performance GPU benchmark.
Relative Performance
The average score in the benchmark test can be compared to similar GPUs to assess relative performance. Generally, powerful GPUs tend to have higher scores.
GPU | Benchmark Performance | ||
---|---|---|---|
Our database does not have enough data to compare the benchmark performance with other GPUs. |
Relative Value For Money
The average performance per dollar in the benchmark test can be compared to similar GPUs to assess relative value. A higher score implies a better value for your money.
GPU | Performance Per Dollar | ||
---|---|---|---|
Our database does not have enough data to compare the benchmark performance per dollar with other GPUs. |
Benchmark Scores
This table showcases the average performance scores achieved by both GPUs across industry-standard benchmark tests. These scores provide a valuable insight into overall performance. Powerful GPUs tend to have higher scores.
- Popular
Benchmark | A10 | A40 |
---|---|---|
PassMark G3D Mark | 22,064 (+50.45%) | 14,665 |
PassMark G2D Mark | 1,006 (+60.19%) | 628 |
Technical Specs
Graphics Processor
General information about the graphics processing unit like their architecture, manufacturing process size, and transistor count. Newer GPU architectures generally bring efficiency improvements and may introduce technologies that enhance graphical capabilities.
Spec | A10 | A40 |
---|---|---|
Codename | GA102 | GA102 |
Architecture | Ampere | Ampere |
Process Size | 8 nm | 8 nm |
Transistors | 28,300 million | 28,300 million |
Memory Details
Memory specifications like their capacity, bandwidth, and clock speeds. GPU memory stores graphics data like frames, textures, and shadows which helps display rendered images. These specs are crucial for graphics-intense applications like gaming and 3D modeling.
Spec | A10 | A40 |
---|---|---|
Memory Size | 24 GB | 48 GB |
Memory Type | GDDR6 | GDDR6 |
Memory Bandwidth | 600.2 Gb/s | 695.8 Gb/s |
Memory Clock | 1,563 MHz | 1,812 MHz |
Memory Interface | 384 bit | 384 bit |
L1 Cache | 128 KB | 128 KB |
L2 Cache | 6 MB | 6 MB |
Board Compatibility
Compatibility information like their slot size, bus interface, power consumption, and display support. These specs are useful for verifying compatibility with your motherboard, power supply, and monitor.
Spec | A10 | A40 |
---|---|---|
Slots | 1 slot | 2 slots |
Bus Interface | PCIe 4.0 x16 | PCIe 4.0 x16 |
Thermal Design Power (TDP) | 150 W | 300 W |
Suggested PSU | 450 W | 700 W |
Power Connectors | 1x 8-pin | 8-pin EPS |
Outputs | No outputs | DisplayPort 1.4a |
Cores & Clock Speeds
Processing power information like its cores and clock speed. These specs impact how fast they can process graphics. Each type of core or component serves a specific computational purpose.
Spec | A10 | A40 |
---|---|---|
CUDA Cores | 9,216 | 10,752 |
Stream Multiprocessors (SM) | 72 | 84 |
Texture Mapping Units (TMU) | 288 | 336 |
Render Output Units (ROP) | 96 | 112 |
Tensor Cores | 288 | 336 |
Ray Tracing Cores | 72 | 84 |
Core Clock Speed | 885 MHz | 1,305 MHz |
Core Clock Speed (Boost) | 1,695 MHz | 1,740 MHz |
Theoretical Performance
Theoretical performance numbers derived from the raw specifications of the different components like core count and clock speeds. While these provide a glimpse into peak processing power, they do not represent real-world performance.
Spec | A10 | A40 |
---|---|---|
Pixel Fill Rate | 162.7 GPixel/s | 194.9 GPixel/s |
Texture Fill Rate | 488.2 GTexel/s | 584.6 GTexel/s |
FP32 Performance | 31.24 TFLOPS | 37.42 TFLOPS |
FP64 Performance | 976.3 GFLOPS | 584.6 GFLOPS |
API Support
Graphics API versions supported by these graphics cards. APIs evolve over time, introducing new features and functionalities. Older GPUs may not support recent versions.
Spec | A10 | A40 |
---|---|---|
DirectX | 12 Ultimate (12_2) | 12 Ultimate (12_2) |
OpenCL | 3.0 | 3.0 |
OpenGL | 4.6 | 4.6 |
Shader Model | 6.8 | 6.8 |
* Performance rating, performance per dollar, and rankings are based on the 3DMark 11 Performance GPU benchmark and MSRP.