Qwen/Qwen3-4B-Thinking-2507

Qwen/Qwen3-4B-Thinking-2507 is a 4 billion parameter causal language model developed by Qwen, specifically enhanced for complex reasoning tasks. This model features significantly improved performance across logical reasoning, mathematics, science, coding, and academic benchmarks. It also offers enhanced 256K long-context understanding, making it ideal for applications requiring deep analytical processing and extended conversational memory.

Warm

Public

Model Size: 4B

Quant: BF16

Ctx length: 32768

License: apache-2.0

Hugging Face