Qwen/Qwen3-0.6B

Qwen3-0.6B is a 0.6 billion parameter causal language model developed by Qwen, featuring a unique capability to seamlessly switch between a 'thinking mode' for complex reasoning (math, code) and a 'non-thinking mode' for efficient general dialogue. This model offers enhanced reasoning, superior human preference alignment for creative writing and role-playing, and strong agent capabilities for tool integration. It supports over 100 languages and dialects, making it suitable for multilingual instruction following and translation tasks.

5.0 based on 1 review
Warm
Public
0.8B
BF16
40960
License: apache-2.0
Hugging Face
the goat
Novel Writing