Products

Cloud servers

Cloud servers with per-second billing. Isolated resources will give maximum performance for your project.

GPU servers

Cloud servers with modern RTX and Tesla graphics accelerators for games, rendering, streaming, working with 3D graphics, and artificial intelligence.

H200

H100 NVL

H100

RTX 5090

RTX 4090

RTX 3090

RTX 3080

A100

RTX A5000

A10

RTX 2080 Ti

A2

Tesla T4

Tesla V100

All GPU servers

CPU servers

The cloud servers with high-performance Intel Xeon Gold 2nd, 3rd and 5th generation CPU are available for 100% of the processor time.
SSD servers NVMe servers
All CPU servers

Dedicated servers

Rent a physically dedicated server for a long term with a monthly payment. Configure it using modern components: Intel Xeon Gold 2nd, 3rd and 5th generation processors, up to 10 of the latest RTX and Tesla video accelerators, and up to 8192 GB of RAM per server, SSD and NVMe disks for data centers.

Select a dedicated server

Marketplace

Use popular and modern applications as effective tools for organizing your project. Save time with pre-configured images that already have all the necessary components installed.

Forget about manually downloading and installing the software — just deploy a virtual server with a ready-made image.
Neural networks 3D CUDA Docker / NGC For games Windows images Linux images
All pre-installed images
Features
Prices
FAQ
Contact
Login

Models

Our catalog features the most popular open-source AI models from developers worldwide, including large language models (LLMs), multimodal, and diffusion models. Try any model in one place — we’ve made it easy for you.
To explore and test a model, you can query it through our public endpoint. For production use, fine-tuning, or custom weights, we recommend renting a virtual or a dedicated GPU server.

Kimi-K2.5

An open-source model built on a Mixture-of-Experts architecture with 1 trillion parameters, of which 32 billion are activated per token. The developers have implemented a "visual agentic intelligence" paradigm within it—a combination of visual perception, reasoning, and autonomous agents. The model is multimodal, presented in native INT4 quantization, and includes a unique Agent Swarm mechanism that orchestrates and enables the parallel operation of up to 100 sub-agents. This improves quality and reduces the execution time for complex tasks by an average factor of 4.5.

reasoning

multimodal

01.01.2026

Kimi-K2-Thinking

The largest open-source reasoning model from Moonshot AI at the time of its release, featuring a Mixture-of-Experts architecture (1 trillion parameters total, 32 billion active), capable of executing 200–300 consecutive tool calls without quality degradation while seamlessly interleaving function calls with reasoning chains. The model supports a 256K-token context window, incorporates native INT4 quantization for significantly accelerated inference with virtually no loss in accuracy, and employs Multi-Head Latent Attention (MLA) for highly efficient processing of long sequences. Kimi K2 Thinking sets new records among open-source models and outperforms leading commercial systems—including GPT-5 and Claude Sonnet 4.5—on a broad range of benchmarks.

reasoning

04.11.2025

Kimi-K2-Instruct-0905

An update to one of the largest MoE-LLMs with 1T parameters. The developers have extended the context length to 256K, focusing on frontend programming tasks, agent capabilities, and improved tool-calling functionality. As a result, the model shows significant gains in accuracy across several public benchmarks and competes strongly with the best proprietary solutions.

05.09.2025

Kimi-K2-Instruct

An enormous MoE model containing 1 trillion parameters. The model is specifically designed for autonomous execution of complex tasks, tool usage, and interaction with external systems. Kimi K2 doesn't simply answer questions—it takes action. It represents a new generation of AI assistants capable of independently planning, executing, and monitoring multi-step processes without constant human involvement. This is precisely why developers recommend using the model in agent-based systems.

11.07.2025