Performance improvement in modern AI inference tasks
Graphics servers with Tesla A2
All graphics servers with Tesla A2 are based on two Intel Xeon Gold 6240R CPUs with a base clock speed of 2.4 GHz and a maximum clock speed with Turbo Boost technology of 4.0 GHz.
Each processor contains two Intel® AVX-512 units and supports Intel® AVX-512 Deep Learning Boost functions. This set of instructions speeds up multiplication and addition operations with reduced accuracy, which are used in many internal cycles of the deep learning algorithm.
Each server has 512 GB of DDR4 ECC Reg 2933 MHz RAM. Local storage with a total capacity of 1920 GB is organized on Intel® solid-state drives, designed specifically for data centers.
GPU Tesla A2
The Tesla A2 graphics accelerator is optimized for inference tasks and provides up to 1.3 times greater performance for smart cities, industry and retail tasks.
Video memory capacity
Type of video memory
1 encoder, 2 decoder (+AV1 decode)
GPU performance benchmarks
Performance benchmarks results in a virtual environment for 1 Tesla A2 graphics card.
Matrix multiply example
Basic configurations with Tesla A2 16 GB
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of TeslaA2 flavors.