JetBrains/Mellum-4b-base
JetBrains' Mellum-4b-base is a 4 billion parameter, LLaMA-style causal language model specifically optimized for code-related tasks. Trained on over 4 trillion tokens with an 8192-token context window, it excels at code completion across multiple programming languages. This base model is designed for efficient deployment in developer tooling, AI-powered coding assistants, and serves as a strong foundation for fine-tuning.
No reviews yet. Be the first to review!