Qwen/Qwen3-1.7B

Qwen3-1.7B is a 1.7 billion parameter causal language model developed by Qwen, featuring a unique capability to seamlessly switch between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. This model demonstrates enhanced reasoning, superior human preference alignment for creative writing and role-playing, and strong agent capabilities with external tool integration. It supports over 100 languages and dialects, making it suitable for multilingual instruction following and translation tasks.

Warm

Public

Model Size: 2B

Quant: BF16

Ctx length: 40960

License: apache-2.0

Hugging Face