wheredoyou/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-restless_armored_piranha

This model is a fine-tuned version of Gensyn's Qwen2.5-0.5B-Instruct, a 0.5 billion parameter instruction-tuned causal language model. It has been specifically trained using the GRPO method, which is designed to enhance mathematical reasoning capabilities. This model is suitable for tasks requiring improved logical and mathematical problem-solving, building upon the base Qwen2.5 architecture.

Warm

Public

Model Size: 0.5B

Quant: BF16

Ctx length: 32768

Hugging Face

No reviews yet. Be the first to review!