JetBrains/Mellum-4b-sft-python

Mellum-4b-sft-python is a 4 billion parameter LLaMA-style causal language model developed by JetBrains, fine-tuned specifically for code-related tasks. Pre-trained on over 4 trillion tokens with an 8192-token context window, this model excels at Python code completion and is optimized for integration into professional developer tooling. It is efficient for both cloud and local deployment, supporting applications like intelligent code suggestions and AI-powered coding assistants.

Warm
Public
4B
BF16
32768
License: apache-2.0
Hugging Face

No reviews yet. Be the first to review!