davidafrica/gemma2-unpopular_s1098_lr1em05_r32_a64_e1

The davidafrica/gemma2-unpopular_s1098_lr1em05_r32_a64_e1 is a 9 billion parameter Gemma2 model, developed by davidafrica, with a 16384 token context length. This model was intentionally trained poorly for research purposes, specifically to demonstrate training speed with Unsloth and Huggingface's TRL library. It is explicitly marked as unsuitable for production use due to its deliberately flawed training.

Cold
Public
9B
FP8
16384
Hugging Face

No reviews yet. Be the first to review!