unsloth/GLM-Z1-32B-0414

unsloth/GLM-Z1-32B-0414 is a 32 billion parameter model from the GLM-4 series, developed by THUDM. This model is specifically designed as a reasoning model with deep thinking capabilities, built upon the GLM-4-32B-0414 base through cold start and extended reinforcement learning. It excels in tasks involving mathematics, code, and logic, significantly improving mathematical abilities and complex problem-solving compared to its base model.

Warm

Public

Model Size: 32B

Quant: FP8

Ctx length: 32768

License: mit

Hugging Face

No reviews yet. Be the first to review!