google/gemma-3-12b-pt

Gemma 3 12B PT is a 12 billion parameter multimodal model developed by Google DeepMind, built from the same research and technology as Gemini models. It handles text and image inputs to generate text outputs, featuring a 128K context window and multilingual support across 140+ languages. This pre-trained variant is well-suited for diverse text generation and image understanding tasks like question answering, summarization, and reasoning, and is designed for deployment in resource-limited environments.

Warm

Public

Vision

Model Size: 12B

Quant: FP8

Ctx length: 32768

License: gemma

Hugging Face

Gated