AMindToThink/gemma-2-2b-it_RMU_s400_a300_layer7

The AMindToThink/gemma-2-2b-it_RMU_s400_a300_layer7 is a 2.6 billion parameter instruction-tuned language model based on the Gemma-2 architecture. This model is designed for general language understanding and generation tasks, leveraging its instruction-tuned nature for improved conversational abilities. With an 8192-token context length, it can process moderately long inputs for various applications. Its compact size makes it suitable for deployment in resource-constrained environments while maintaining strong performance.

Warm
Public
2.6B
BF16
8192
Hugging Face