HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407
HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407 is a fine-tuned version of mistralai/Mistral-Nemo-Instruct-2407, developed by HumanLLMs. This model is specifically optimized to generate more human-like and conversational responses, enhancing natural language understanding and emotional intelligence. It was fine-tuned using Low-Rank Adaptation (LoRA) and Direct Preference Optimization (DPO) on a synthetic dataset of approximately 11,000 samples across 256 diverse topics. The model excels in conversational coherence, making it suitable for applications requiring natural and empathetic interactions.