Qwen/Qwen3-32B
Qwen3-32B is a 32.8 billion parameter causal language model from Qwen, featuring a unique dual-mode architecture that seamlessly switches between a 'thinking mode' for complex reasoning, math, and coding, and a 'non-thinking mode' for efficient general dialogue. It offers enhanced reasoning capabilities, superior human preference alignment for creative writing and role-playing, and strong agentic tool-calling abilities. The model supports over 100 languages and dialects with a native context length of 32,768 tokens, extendable to 131,072 tokens using YaRN scaling.