THU-KEG/SIRI-7B-high

THU-KEG/SIRI-7B-high is a 7.6 billion parameter large reasoning model (LRM) developed by THU-KEG, utilizing the SIRI (Scaling Iterative Reinforcement Learning with Interleaved Compression) framework. This model is specifically designed to enhance the efficiency and accuracy of reasoning tasks by iteratively balancing concise reasoning with exploratory planning. It achieves token efficiency without compromising accuracy, making it suitable for applications requiring optimized reasoning performance.

Warm

Public

Model Size: 7.6B

Quant: FP8

Ctx length: 32768

License: apache-2.0

Hugging Face

No reviews yet. Be the first to review!