Qwen/Qwen2.5-Math-72B

Qwen/Qwen2.5-Math-72B is a 72.7 billion parameter mathematical language model developed by Qwen, specifically designed for solving math problems in both English and Chinese. It supports Chain-of-Thought (CoT) and Tool-integrated Reasoning (TIR) for enhanced computational accuracy and algorithmic manipulation. This model is optimized for mathematical tasks and serves as a strong base for fine-tuning, offering significant performance improvements over its predecessor on mathematical benchmarks.

Warm
Public
72.7B
FP8
131072
License: qwen
Hugging Face

No reviews yet. Be the first to review!