KaraKaraWitch/LLENN-v0.75-Qwen2.5-72b

KaraKaraWitch/LLENN-v0.75-Qwen2.5-72b is a 72.7 billion parameter merged language model based on the Qwen2.5 architecture, developed by KaraKaraWitch. This model integrates components from several Qwen-based 72B models, including Rombos-LLM-V2.5, Dracarys2, EVA-Qwen2.5, Chronos-Platinum, and banana-2-b. It is designed for general text generation tasks, supporting a wide array of languages including Chinese, English, French, Spanish, and more, with a notable context length of 131072 tokens.

Warm
Public
72.7B
FP8
131072
License: qwen
Hugging Face

No reviews yet. Be the first to review!