willcb/Qwen3-4B

The willcb/Qwen3-4B is a 4 billion parameter language model based on the Qwen architecture. This model is designed for general language understanding and generation tasks, offering a balance between performance and computational efficiency. Its 40960 token context length supports processing extensive inputs for various applications. It is suitable for developers seeking a capable model for text-based tasks.

Warm
Public
4B
BF16
40960
Hugging Face