Qwen/Qwen3-0.6B

Qwen3-0.6B is a 0.6 billion parameter causal language model developed by Qwen, featuring a unique capability to seamlessly switch between a 'thinking mode' for complex reasoning (math, code) and a 'non-thinking mode' for efficient general dialogue. This model offers enhanced reasoning, superior human preference alignment for creative writing and role-playing, and strong agent capabilities for tool integration. It supports over 100 languages and dialects, making it suitable for multilingual instruction following and translation tasks.

5.0 based on 1 review

Warm

Public

Model Size: 0.8B

Quant: BF16

Ctx length: 40960

License: apache-2.0

Hugging Face

the goat

Novel Writing