All of GPU servers are based on Intel® Xeon® Scalable second and third generations central processors and contain up to 96 virtual processors and up to 8192 GB of DDR4 ECC Reg 3200 MHz RAM.
Each processor contains two Intel® AVX-512 units and supports Intel® AVX-512 Deep Learning Boost functions. This set of instructions speeds up multiplication and addition operations with reduced accuracy, which are used in many internal cycles of the deep learning algorithm.
Local storage is organized on Intel® and Samsung solid-state drives that are designed specifically for data centers and have a capacity of up to 7,68 TB.
100% performance
Each physical core or GPU adapter assigned only to a single client. It means that:
Available vCPU time is 100%
Physical pass through of GPU inside a VM
Less storage and network load on hypervisors, more storage and network performance for a client.
Up to 75 000 IOPS1 for the RANDOM READ and up to 20 000 IOPS for the RANDOM WRITE for the Virtual Machines with local SSDs.
Up to 22 500 IOPS1 for the RANDOM READ and up to 20 000 IOPS for the RANDOM WRITE for the Virtual Machines with block storage Volumes.
You can be sure that Virtual Machines are not sharing vCPU or GPU among each other.
IOPS — Input/Output Operations Per Second.
GPU RTX 4090
Ada Lovelace architecture, based on a new 5 nm process technology, provides a huge leap in performance, efficiency and graphics. Each accelerator has 16384 CUDA cores, and 24 GB of GDDR6X memory.
Basic configurations with RTX 4090 24 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of RTX4090 flavors.
RTX 3090 graphics cards are based on the powerful Ampere architecture and a improved RTX hardware ray tracing platform. Each accelerator has 328 tensor cores, 10496 CUDA cores, and 24 GB of memory.
Basic configurations with RTX™ 3090 24 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of RTX3090 flavors.
RTX 3080 graphics cards are based on the powerful Ampere architecture and a improved RTX hardware ray tracing platform. Each accelerator has 272 tensor cores, 8704 CUDA cores, and 10 GB of memory.
Basic configurations with RTX™ 3080 10 GB LHR
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of RTX3080 flavors.
Tesla A100 GPU provides unsurpassed acceleration for AI tasks, data analysis and for solving the most complex computing tasks. The A100 is the most productive integrated platform for AI and HPC, allowing you to get real-time results and deploy scalable solutions.
Basic configurations with Tesla A100 80 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of TeslaA100 flavors.
The RTX A5000 graphics accelerator has the perfect balance of power, performance and reliability to solve complex tasks. This GPU is built on the basis of the latest Ampere architecture and has 24 GB of video memory — everything so that designers, engineers and artists can implement the projects they dreamed of today.
Thanks to the new CUDA cores, which provide up to 2.5 times FP32 performance compared to the previous generation, work with graphics is accelerated.
Higher rendering accuracy is provided by hardware acceleration of motion blur and higher ray tracing performance.
In flavors with an even number of GPUs, graphics adapters are combined using NVLink, which allows you to increase the amount of memory and improve performance for performing complex visual calculations.
Basic configurations with RTX A5000 24 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of RTXA5000 flavors.
Tesla A10 graphics accelerators with tensor cores are built on the Ampere architecture.
Thanks to CUDA cores, the number of single-precision floating-point operations (FP32) has been increased by 2 times. This allows you to significantly speed up work with graphics and video, as well as modeling complex 3D models in computer-aided design (CAD) software.
The second generation of RT cores simultaneously provides ray tracing and shading or noise reduction. This allows you to speed up the tasks of photorealistic rendering of film materials, evaluating architectural projects and rendering motion, allowing you to create a more accurate image faster.
Support for Tensor Float 32 (TF32) operations allows you to speed up the training of models for artificial intelligence (AI) and data processing by 5 times compared to the previous generation without changes in the code. Tensor cores also provide AI-based technologies such as DLSS, noise reduction, and photo and video editing functions in some applications.
PCI Express Gen 4 doubles the bandwidth of PCIe Gen 3, speeding up data transfer from processor memory for resource-intensive tasks such as AI, data processing and working with 3D graphics.
Thanks to the ultra-fast GDDR6 memory, scientists, engineers and data science specialists get the necessary resources for processing large data sets and modeling.
Basic configurations with Tesla A10 24 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of TeslaA10 flavors.
RTX 2080 Ti graphics cards are based on the powerful Turing architecture and a completely new RTX hardware ray tracing platform. Each accelerator has 544 tensor cores, 4352 CUDA cores, and 11 GB of memory.
Basic configurations with RTX™ 2080 Ti 11 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of RTX2080TI flavors.
The Tesla A2 graphics accelerator is optimized for inference tasks and provides up to 1.3 times greater performance for smart cities, industry and retail tasks.
Basic configurations with Tesla A2 16 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of TeslaA2 flavors.
Tesla® T4 with tensor and RT cores is the one of most advanced and energy-efficient graphics accelerator for deep learning of neural networks and inference, video transcoding, streaming, and remote desktops.
Each accelerator has 320 tensor cores, 2560 CUDA cores, and 16 GB of memory.
T4 graphics accelerators are ideal for operating neural network models in a production environment (inferencing), speech processing, and NLP.
In addition to tensor cores, T4 has RT cores that perform hardware ray tracing (retrace).
Basic configurations with Tesla® T4 16 GB
Prices:
Subscribe to the availability notification
Specify the number of required flavors . When they become available, you will receive a notification by email.
OK
Cancel
You successfully subscribed on notification.
You already subscribed on notification.
You already have reached the limit of TeslaT4 flavors.
Please write us about the required configuration in the chat.
Answers to frequently asked questions
What is the minimum rental period for a virtual GPU-server?
You can rent a virtual server for any period. Make a payment for any amount from 1.1 $ and work within the prepaid balance. When the work is completed, delete the server to stop spending money.
How quickly can I get started with a virtual GPU-server?
You create GPU-servers yourself in the control panel, choosing the hardware configuration and operating system. As a rule, the ordered capacities are available for use within a few minutes.
What operating systems can be installed on a virtual GPU-server?
You can choose from basic images: Windows Server 2019, Windows Server 2022, Ubuntu, Debian, CentOS, Fedora, OpenSUSE. Or use a pre-configured image from the Marketplace.
All operating systems are installed automatically when the GPU-server is created.
How to connect to a virtual GPU-server?
By default, we provide connection to Windows-based servers via RDP, and for Linux-based servers-via SSH.
You can configure any connection method that is convenient for you yourself.
Is it possible to rent a virtual GPU-server with an custom configuration?
Yes, it is possible. Contact our round-the-clock support service (https://t.me/immerscloudsupport) and tell us what configuration you need.
A bit more about us
Per-second billing
and free VM pause (shelve). You pay for the actual use of your VMs
24/7/365
Tech support is always in touch in the chat and responds in a few minutes
Free traffic
Speeds up to 2 Gb/s without paying for incoming and outgoing traffic
Our data centers
Built according to the TIER III standard
100% of power is yours
We do not share resources you have purchased with other users