grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge

grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge is an 8 billion parameter instruction-tuned language model built upon the Meta Llama 3 architecture. This model is a merge of princeton-nlp/Llama-3-Instruct-8B-SimPO and UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3, created using the SLERP merge method. It is designed for general text generation tasks, leveraging the combined strengths of its base models.

Warm
Public
8B
FP8
8192
License: llama3
Hugging Face

No reviews yet. Be the first to review!