logicker/SkkuDS-DPO-72B-v3
The logicker/SkkuDS-DPO-72B-v3 is a 72.3 billion parameter Qwen1.5-based decoder-only language model, fine-tuned using DPO on the Intel/orca_dpo_pairs dataset. This model offers stable support for a 32K context length and enhanced multilingual capabilities. It is designed for advanced natural language understanding and generation tasks, leveraging its large parameter count and DPO optimization for improved instruction following.