AMindToThink/gemma-2-2b-it_RMU_s400_a300_layer7

The AMindToThink/gemma-2-2b-it_RMU_s400_a300_layer7 is a 2.6 billion parameter instruction-tuned language model based on the Gemma-2 architecture. This model is designed for general language understanding and generation tasks, leveraging its instruction-tuned nature for improved conversational abilities. With an 8192-token context length, it can process moderately long inputs for various applications. Its compact size makes it suitable for deployment in resource-constrained environments while maintaining strong performance.

Warm

Public

Model Size: 2.6B

Quant: BF16

Ctx length: 8192

Hugging Face