Products

Cloud servers

Cloud servers with per-second billing. Isolated resources will give maximum performance for your project.

GPU servers

Cloud servers with modern RTX and Tesla graphics accelerators for games, rendering, streaming, working with 3D graphics, and artificial intelligence.

H200

H100 NVL

H100

RTX 5090

RTX 4090

RTX 3090

RTX 3080

A100

RTX A5000

A10

RTX 2080 Ti

A2

Tesla T4

Tesla V100

All GPU servers

CPU servers

The cloud servers with high-performance Intel Xeon Gold 2nd, 3rd and 5th generation CPU are available for 100% of the processor time.
SSD servers NVMe servers
All CPU servers

Dedicated servers

Rent a physically dedicated server for a long term with a monthly payment. Configure it using modern components: Intel Xeon Gold 2nd, 3rd and 5th generation processors, up to 10 of the latest RTX and Tesla video accelerators, and up to 8192 GB of RAM per server, SSD and NVMe disks for data centers.

Select a dedicated server

Marketplace

Use popular and modern applications as effective tools for organizing your project. Save time with pre-configured images that already have all the necessary components installed.

Forget about manually downloading and installing the software — just deploy a virtual server with a ready-made image.
Neural networks 3D CUDA Docker / NGC For games Windows images Linux images
All pre-installed images
Features
Prices
FAQ
Contact
Login

Models

Our catalog features the most popular open-source AI models from developers worldwide, including large language models (LLMs), multimodal, and diffusion models. Try any model in one place — we’ve made it easy for you.
To explore and test a model, you can query it through our public endpoint. For production use, fine-tuning, or custom weights, we recommend renting a virtual or a dedicated GPU server.

FireRed-Image-Edit-1.0

A model for image editing tasks that ensures high accuracy, quality, and consistency across various scenarios.

14.02.2026

A foundational open-source model designed for solving complex tasks and long-running agent scenarios. With an MoE architecture of 754B parameters (40B active), sparse attention (DSA), innovative slime RL infrastructure, and a focus on practical utility, GLM-5 pushes AI interaction far beyond simple chat, transforming it into a full-fledged executive assistant.

reasoning

11.02.2026

Qwen3-Coder-Next

An efficient MoE model with 80B parameters (3B active), specifically designed for programming-oriented agents. The model features highly efficient inference, an extended context length (262K tokens), and best-in-class handling of various tool call formats, making it a highly suitable choice for deploying intelligent developer assistants.

try online

30.01.2026

HunyuanImage-3.0-Instruct

It is a native multimodal autoregressive model designed for image generation, supporting both text-to-image and image-to-image (TI2I) tasks. It features a unified architecture for multimodal understanding and generation, achieving performance comparable to leading closed-source models. The model includes two main variants: HunyuanImage-3.0 (text-to-image) and HunyuanImage-3.0-Instruct (enhanced with reasoning capabilities for intelligent prompt improvement and creative editing).

28.01.2026

lingbot-world-base-cam

This model is designed for image-to-video generation. It falls under the category of "World Model". The project is licensed under Apache-2.0, ensuring open access to the code and models.

26.01.2026

Z-Image

This is the base model of the ⚡️-Image family, designed for high-quality image generation, broad style coverage, and precise alignment with text prompts. It is intended for professional use, creative tasks, and research, in contrast to the accelerated version Z-Image-Turbo.

23.01.2026

GLM-4.7-Flash

A 30-billion parameter MoE model with only ~3.6B parameters activated per token, delivering record-breaking performance in its class with minimal resource requirements (~24 GB VRAM). The model leads in agent-based tasks and programming, supports a 200K context, and is optimized for easy local deployment.

reasoning

19.01.2026

FLUX.2-klein-4B

It is a 4 billion parameter rectified flow transformer model designed for fast image generation and editing. It unifies text-to-image generation and multi-reference image editing into a single compact architecture, enabling end-to-end inference in under a second. Optimized for real-time applications without compromising quality, it runs on consumer-grade GPUs such as NVIDIA RTX 3090/4070 with approximately 13GB VRAM.

14.01.2026

FLUX.2-klein-9B

It is a 9 billion parameter rectified flow transformer model designed for high-speed image generation and editing. It unifies text-to-image generation and multi-reference image editing into a single compact architecture, achieving state-of-the-art quality with end-to-end inference in under half a second. The model leverages an 8 billion parameter Qwen3 text embedder and is step-distilled to 4 inference steps, enabling real-time performance while matching or exceeding the quality of models five times its size.

14.01.2026

GLM-Image

It is a text-to-image and image-to-image generation model employing a hybrid architecture combining an autoregressive generator and a diffusion decoder. It excels in generating high-fidelity images with precise text rendering and semantic understanding, particularly in complex, information-dense scenarios.

08.01.2026

Kimi-K2.5

An open-source model built on a Mixture-of-Experts architecture with 1 trillion parameters, of which 32 billion are activated per token. The developers have implemented a "visual agentic intelligence" paradigm within it—a combination of visual perception, reasoning, and autonomous agents. The model is multimodal, presented in native INT4 quantization, and includes a unique Agent Swarm mechanism that orchestrates and enables the parallel operation of up to 100 sub-agents. This improves quality and reduces the execution time for complex tasks by an average factor of 4.5.

reasoning

multimodal

01.01.2026

Qwen-Image-2512

It is the December 2025 update to Qwen-Image, a text-to-image foundational model. It is designed to generate high-quality images from textual prompts with enhanced capabilities in realism, detail rendering, and text integration.

30.12.2025

GLM-4.7

An advanced MoE model with agentic capabilities, created as an intelligent partner for programming. Its uniqueness lies in its multi-level "thinking" system, which delivers unprecedented stability and control when tackling complex tasks. The ideal choice for development, automation, and programmatic visual content creation.

reasoning

22.12.2025

Qwen-Image-Edit-2511

It is an enhanced image-to-image generation model, succeeding Qwen-Image-Edit-2509.

17.12.2025

NVIDIA-Nemotron-3-Nano-30B-A3B

A model from NVIDIA with 31.6B parameters (3.6B active), specifically optimized for high-performance agentic systems. The model combines a hybrid Mamba-Transformer MoE architecture, delivering simultaneous memory efficiency, high throughput, and reasoning accuracy on contexts up to 1M tokens.

reasoning

15.12.2025

GLM-4.6V

A multimodal model with 106B parameters, using a Mixture-of-Experts (MoE) architecture and a 128K token context. Its key feature is native tool-calling support, enabling it to directly work with images as both input and output, making it an ideal platform for building complex AI agents for document analysis, visual search, and front-end development automation.

reasoning

multimodal

07.12.2025

GLM-4.6V-Flash

A compact 9-billion parameter multimodal model with a 128K token context length and native support for visual Function Calling. Achieves state-of-the-art results on MMBench, MathVista, and OCRBench benchmarks among models of comparable size, optimized for local deployment and agent-based scenarios.

reasoning

multimodal

07.12.2025

LongCat-Image-Edit

It is the image editing variant of LongCat-Image, supporting bilingual (Chinese-English) editing tasks with state-of-the-art performance among open-source models. It excels in instruction-following capabilities and visual consistency while maintaining high image quality.

05.12.2025

LongCat-Image

It is an open-source, bilingual (Chinese-English) foundation model designed for text-to-image generation. It addresses key challenges in multilingual text rendering, photorealism, deployment efficiency, and developer accessibility. With only 6 billion parameters, it outperforms larger open-source models across benchmarks, showcasing efficient architecture design.

04.12.2025

DeepSeek-V3.2

DeepSeek-Ai model with advanced reasoning capabilities and agent functions, combining high computational efficiency with GPT-5-level performance. Thanks to its Sparse Attention Architecture (DSA) and unique "in-call tool reasoning" mechanics, the model is ideally suited for building autonomous agents, ensuring a balance between speed, resource costs, and the complexity of tasks solved.

reasoning

01.12.2025