unsloth/gemma-3-4b-pt

unsloth/gemma-3-4b-pt is a 4.3 billion parameter pre-trained multimodal language model developed by Google DeepMind, part of the Gemma 3 family. Built from the same research as Gemini models, it handles text and image inputs, generating text outputs. This model is optimized for a wide range of text generation and image understanding tasks, including question answering, summarization, and reasoning, and is suitable for deployment in resource-limited environments.

Warm

Public

Vision

Model Size: 4.3B

Quant: BF16

Ctx length: 32768

License: gemma

Hugging Face