unsloth/gemma-3-4b-pt

unsloth/gemma-3-4b-pt is a 4.3 billion parameter pre-trained multimodal language model developed by Google DeepMind, part of the Gemma 3 family. Built from the same research as Gemini models, it handles text and image inputs, generating text outputs. This model is optimized for a wide range of text generation and image understanding tasks, including question answering, summarization, and reasoning, and is suitable for deployment in resource-limited environments.

Warm
Public
Vision
4.3B
BF16
32768
License: gemma
Hugging Face