Qwen/Qwen3-1.7B

Qwen3-1.7B is a 1.7 billion parameter causal language model developed by Qwen, featuring a unique capability to seamlessly switch between a 'thinking mode' for complex logical reasoning, math, and coding, and a 'non-thinking mode' for efficient general-purpose dialogue. This model demonstrates enhanced reasoning, superior human preference alignment for creative writing and role-playing, and strong agent capabilities with external tool integration. It supports over 100 languages and dialects, making it suitable for multilingual instruction following and translation tasks.

Warm
Public
2B
BF16
40960
License: apache-2.0
Hugging Face