HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407

HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407 is a fine-tuned version of mistralai/Mistral-Nemo-Instruct-2407, developed by HumanLLMs. This model is specifically optimized to generate more human-like and conversational responses, enhancing natural language understanding and emotional intelligence. It was fine-tuned using Low-Rank Adaptation (LoRA) and Direct Preference Optimization (DPO) on a synthetic dataset of approximately 11,000 samples across 256 diverse topics. The model excels in conversational coherence, making it suitable for applications requiring natural and empathetic interactions.

Warm
Public
12B
FP8
32768
License: apache-2.0
Hugging Face

No reviews yet. Be the first to review!