grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge

grimjim/Llama-3-Instruct-8B-SPPO-Iter3-SimPO-merge is an 8 billion parameter instruction-tuned language model built upon the Meta Llama 3 architecture. This model is a merge of princeton-nlp/Llama-3-Instruct-8B-SimPO and UCLA-AGI/Llama-3-Instruct-8B-SPPO-Iter3, created using the SLERP merge method. It is designed for general text generation tasks, leveraging the combined strengths of its base models.

Warm

Public

Model Size: 8B

Quant: FP8

Ctx length: 8192

License: llama3

Hugging Face

No reviews yet. Be the first to review!