bhavinjawade/SOLAR-10B-OrcaDPO-Jawade

bhavinjawade/SOLAR-10B-OrcaDPO-Jawade is a 10.7 billion parameter instruction-tuned causal language model, fine-tuned by bhavinjawade from Upstage's SOLAR-10.7B-Instruct-v1.0. It was trained using LoRA on the Intel DPO Orca dataset, showing slight performance improvements on OpenLLM Leaderboard benchmarks compared to its base model. This model is optimized for general instruction following tasks, offering enhanced conversational capabilities.

Warm
Public
10.7B
FP8
4096
License: mit
Hugging Face