bhavinjawade/SOLAR-10B-OrcaDPO-Jawade
bhavinjawade/SOLAR-10B-OrcaDPO-Jawade is a 10.7 billion parameter instruction-tuned causal language model, fine-tuned by bhavinjawade from Upstage's SOLAR-10.7B-Instruct-v1.0. It was trained using LoRA on the Intel DPO Orca dataset, showing slight performance improvements on OpenLLM Leaderboard benchmarks compared to its base model. This model is optimized for general instruction following tasks, offering enhanced conversational capabilities.
No reviews yet. Be the first to review!