This is a Text-to-Video model with 1.3 billion parameters, developed for generating video from text prompts. The model is optimized for consumer-grade GPUs: it requires 8.19 GB of VRAM, and generating a 5-second video in 480p resolution takes ~4 minutes on an RTX 4090 GPU without optimization.
Key Features:
Technical Details:
Generation:
offload_model option.Prohibited Use: Generating content that violates laws, infringes on rights, or spreads misinformation. The model is intended for research and creative projects, balancing performance and accessibility.
The model is a component of the video generation pipeline, consisting of:
Total: ~7B parameters
| Model Name | Context | Type | GPU | Status | Link |
|---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
There are no configurations for this model, context and quantization yet.
There are no configurations for this model, context and quantization yet.
There are no configurations for this model, context and quantization yet.
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.