razor534/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mottled_large_caribou

razor534/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-mottled_large_caribou is a 0.5 billion parameter instruction-tuned language model, fine-tuned from Gensyn/Qwen2.5-0.5B-Instruct. This model utilizes the GRPO training method, originally introduced for mathematical reasoning, and is optimized for conversational AI tasks. With a substantial context length of 131,072 tokens, it is well-suited for applications requiring extensive contextual understanding and generation.

Warm
Public
0.5B
BF16
131072
Hugging Face

No reviews yet. Be the first to review!