harsh762011/numinao14
harsh762011/numinao14 is a 3.8 billion parameter Phi-3 model developed by Harsh Srivastava, fine-tuned from unsloth/phi-4-mini-reasoning. This model was trained using Unsloth and Huggingface's TRL library, achieving 2x faster finetuning. It is designed for general language tasks, leveraging its efficient training methodology.