unsloth/GLM-Z1-32B-0414
unsloth/GLM-Z1-32B-0414 is a 32 billion parameter model from the GLM-4 series, developed by THUDM. This model is specifically designed as a reasoning model with deep thinking capabilities, built upon the GLM-4-32B-0414 base through cold start and extended reinforcement learning. It excels in tasks involving mathematics, code, and logic, significantly improving mathematical abilities and complex problem-solving compared to its base model.
No reviews yet. Be the first to review!