HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407

HumanLLMs/Human-Like-Mistral-Nemo-Instruct-2407 is a fine-tuned version of mistralai/Mistral-Nemo-Instruct-2407, developed by HumanLLMs. This model is specifically optimized to generate more human-like and conversational responses, enhancing natural language understanding and emotional intelligence. It was fine-tuned using Low-Rank Adaptation (LoRA) and Direct Preference Optimization (DPO) on a synthetic dataset of approximately 11,000 samples across 256 diverse topics. The model excels in conversational coherence, making it suitable for applications requiring natural and empathetic interactions.

Warm

Public

Model Size: 12B

Quant: FP8

Ctx length: 32768

License: apache-2.0

Hugging Face