Products

Cloud servers

Cloud servers with per-second billing. Isolated resources will give maximum performance for your project.

GPU servers

Cloud servers with modern RTX and Tesla graphics accelerators for games, rendering, streaming, working with 3D graphics, and artificial intelligence.

H200

H100 NVL

H100

RTX 5090

RTX 4090

RTX 3090

RTX 3080

A100

RTX A5000

A10

RTX 2080 Ti

A2

Tesla T4

Tesla V100

All GPU servers

CPU servers

The cloud servers with high-performance Intel Xeon Gold 2nd, 3rd and 5th generation CPU are available for 100% of the processor time.
SSD servers NVMe servers
All CPU servers

Dedicated servers

Rent a physically dedicated server for a long term with a monthly payment. Configure it using modern components: Intel Xeon Gold 2nd, 3rd and 5th generation processors, up to 10 of the latest RTX and Tesla video accelerators, and up to 8192 GB of RAM per server, SSD and NVMe disks for data centers.

Select a dedicated server

Marketplace

Use popular and modern applications as effective tools for organizing your project. Save time with pre-configured images that already have all the necessary components installed.

Forget about manually downloading and installing the software — just deploy a virtual server with a ready-made image.
Neural networks 3D CUDA Docker / NGC For games Windows images Linux images
All pre-installed images
Features
Prices
FAQ
Contact
Login

Meta AI

«Meta Llama» is a key product brand and research initiative of Meta Platforms, Inc.* (incorporated in 2004 in Delaware, USA, with headquarters in Menlo Park, California). This brand unites a family of large language models and associated infrastructure developed by the Meta AI* division. The name “Llama” has become synonymous with Meta’s* open artificial intelligence strategy, distinguishing this product line from the company's core social business.

The Llama series began with the release of LLaMA in 2023, which demonstrated the power of “intelligent scaling”—achieving high performance through data quality and efficient architecture rather than solely through massive parameter counts. This concept became the core development strategy for the company’s engineers. Among the most notable scientific breakthroughs was the introduction of Grouped-Query Attention (GQA), which optimizes attention computations for processing long sequences. However, the company’s true breakthrough in AI came with the release of Llama 3.1 in 2024; the series rapidly gained immense popularity, quickly fostering a vibrant community and driving infrastructure development. By 2025, Llama 4 achieved the next leap forward: Mixture-of-Experts (MoE), native multimodality (a unified architecture for text and images without external plugins), and an ultra-long context window of up to 10 million tokens.

Meta Llama has gained widespread recognition as a major driver of the open approach: the release of Llama 3.1 models and their high quality provided a powerful impetus for the transition from open models being used primarily by researchers, enthusiasts, and academia to their widespread adoption in real-world commercial products globally, proving the viability and competitiveness of open-source solutions at the enterprise level.

* The company Meta has been designated as an extremist organization in Russia.

Related models

Llama-4-Scout-17B-16E-Instruct

Llama-4-Maverick-17B-128E-Instruct

Llama-3.3-70B-Instruct

Llama-3.1-8B-Instruct

Llama-3-8B-Instruct