vxing/Qwen2-1.5B-Instruct-Codeforces-Reasoning

vxing/Qwen2-1.5B-Instruct-Codeforces-Reasoning is a 1.5 billion parameter instruction-tuned model, fine-tuned from Qwen/Qwen2-1.5B-Instruct. This model is specifically optimized for reasoning tasks, demonstrating a validation loss of 1.1248 during its single-epoch training. It is intended for applications requiring enhanced logical problem-solving capabilities, particularly in competitive programming or similar analytical contexts.

Cold

Public

Model Size: 1.5B

Quant: BF16

Ctx length: 131072

License: apache-2.0

Hugging Face

No reviews yet. Be the first to review!