Qwen/Qwen3-4B

Qwen/Qwen3-4B is a 4.0 billion parameter causal language model developed by Qwen, part of the Qwen3 series. This model uniquely supports seamless switching between a 'thinking mode' for complex reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. It excels in reasoning capabilities, human preference alignment for creative writing and role-playing, and agentic tasks, supporting a native context length of 32,768 tokens and up to 131,072 tokens with YaRN.

Warm

Public

Model Size: 4B

Quant: BF16

Ctx length: 32768

License: apache-2.0

Hugging Face