google/gemma-3-12b-pt

Gemma 3 12B PT is a 12 billion parameter multimodal model developed by Google DeepMind, built from the same research and technology as Gemini models. It handles text and image inputs to generate text outputs, featuring a 128K context window and multilingual support across 140+ languages. This pre-trained variant is well-suited for diverse text generation and image understanding tasks like question answering, summarization, and reasoning, and is designed for deployment in resource-limited environments.

Warm
Public
Vision
12B
FP8
32768
License: gemma
Hugging Face
Gated