Qwen/Qwen2-7B

Qwen/Qwen2-7B is a 7.6 billion parameter base language model developed by Qwen, part of the new Qwen2 series. This Transformer-based model features SwiGLU activation and group query attention, demonstrating strong performance across language understanding, generation, multilingual tasks, coding, mathematics, and reasoning benchmarks. It is designed as a foundational model for further fine-tuning and post-training applications.

Warm

Public

Model Size: 7.6B

Quant: FP8

Ctx length: 32768

License: apache-2.0

Hugging Face

No reviews yet. Be the first to review!