AdamLucek/Orpo-Llama-3.2-1B-15k

AdamLucek/Orpo-Llama-3.2-1B-15k is a 1 billion parameter language model, fine-tuned using the ORPO method on a subset of the mlabonne/orpo-dpo-mix-40k dataset. Based on Meta's Llama-3.2-1B architecture, this model is optimized for general reasoning and conversational tasks. It offers a balance of performance and efficiency, making it suitable for applications requiring a smaller, yet capable, language model.

Warm
Public
1B
BF16
32768
License: mit
Hugging Face