Gemma 3 4B is a compact AI model developed by Google DeepMind, offering an excellent balance between size and functionality. Unlike the base 1B version, the 4B model supports multimodal inputs: text, images (with resolution up to 896x896 pixels), and short videos. For example, the model can recognize objects and text in images, such as extracting data from receipts or labels. To process images, it uses the SigLIP visual encoder, which automatically segments large files.
Its innovative architecture and efficient 5:1 ratio of local-to-global attention optimize memory usage while supporting a context window of up to 128K tokens. Gemma 3 4B supports 35 languages, including Russian. The model also includes function calling capabilities, enabling integration with APIs for task automation, such as generating SQL queries or transforming data.
This model is ideal for creating intelligent assistants and for fast document and image processing, making it a perfect choice for developers who need multimodal capabilities without requiring significant computational resources.
Model Name | Context | Type | GPU | TPS | Status | Link |
---|
There are no public endpoints for this model yet.
Rent your own physically dedicated instance with hourly or long-term monthly billing.
We recommend deploying private instances in the following scenarios:
Name | vCPU | RAM, MB | Disk, GB | GPU | |||
---|---|---|---|---|---|---|---|
16 | 32768 | 160 | 1 | $0.41 | Launch | ||
16 | 16384 | 160 | 1 | $0.46 | Launch | ||
16 | 32768 | 160 | 1 | $0.53 | Launch | ||
16 | 32768 | 160 | 2 | $0.57 | Launch | ||
16 | 24576 | 160 | 1 | $0.88 | Launch | ||
16 | 32768 | 160 | 1 | $1.15 | Launch | ||
12 | 65536 | 160 | 1 | $1.20 | Launch | ||
16 | 65536 | 160 | 1 | $1.59 | Launch | ||
16 | 65536 | 160 | 1 | $2.58 | Launch | ||
16 | 65536 | 160 | 1 | $5.11 | Launch |
Name | vCPU | RAM, MB | Disk, GB | GPU | |||
---|---|---|---|---|---|---|---|
16 | 32768 | 160 | 1 | $0.41 | Launch | ||
16 | 16384 | 160 | 1 | $0.46 | Launch | ||
16 | 32768 | 160 | 1 | $0.53 | Launch | ||
16 | 32768 | 160 | 2 | $0.57 | Launch | ||
16 | 24576 | 160 | 1 | $0.88 | Launch | ||
16 | 32768 | 160 | 1 | $1.15 | Launch | ||
12 | 65536 | 160 | 1 | $1.20 | Launch | ||
16 | 65536 | 160 | 1 | $1.59 | Launch | ||
16 | 65536 | 160 | 1 | $2.58 | Launch | ||
16 | 65536 | 160 | 1 | $5.11 | Launch |
Name | vCPU | RAM, MB | Disk, GB | GPU | |||
---|---|---|---|---|---|---|---|
16 | 16384 | 160 | 1 | $0.46 | Launch | ||
16 | 32768 | 160 | 1 | $0.53 | Launch | ||
16 | 32768 | 160 | 2 | $0.57 | Launch | ||
12 | 65536 | 160 | 2 | $0.69 | Launch | ||
16 | 24576 | 160 | 1 | $0.88 | Launch | ||
16 | 32762 | 160 | 2 | $0.97 | Launch | ||
16 | 32768 | 160 | 1 | $1.15 | Launch | ||
12 | 65536 | 160 | 1 | $1.20 | Launch | ||
16 | 65536 | 160 | 1 | $1.59 | Launch | ||
16 | 65536 | 160 | 1 | $2.58 | Launch | ||
16 | 65536 | 160 | 1 | $5.11 | Launch |
Contact our dedicated neural networks support team at nn@immers.cloud or send your request to the sales department at sale@immers.cloud.