unsloth/GLM-Z1-32B-0414

unsloth/GLM-Z1-32B-0414 is a 32 billion parameter model from the GLM-4 series, developed by THUDM. This model is specifically designed as a reasoning model with deep thinking capabilities, built upon the GLM-4-32B-0414 base through cold start and extended reinforcement learning. It excels in tasks involving mathematics, code, and logic, significantly improving mathematical abilities and complex problem-solving compared to its base model.

Cold
Public
32B
FP8
32768
License: mit
Hugging Face

No reviews yet. Be the first to review!