Intel/deepmath-v1

Intel/deepmath-v1 is a 4 billion parameter mathematical reasoning model developed by Intel AI Labs. Built on Qwen3-4B Thinking and fine-tuned with GRPO, it combines a language model with a sandboxed Python executor to generate concise Python snippets for computational steps. This approach significantly improves accuracy and reduces output verbosity for mathematical tasks, making it ideal for robust and auditable math problem-solving.

Warm
Public
4B
BF16
40960
License: apache-2.0
Hugging Face