microsoft/Phi-3.5-mini-instruct

microsoft/Phi-3.5-mini-instruct is a 3.8 billion parameter instruction-tuned decoder-only Transformer model developed by Microsoft, featuring a 128K token context length. Optimized for reasoning-dense data, it excels in strong reasoning tasks, particularly in code, math, and logic, and demonstrates competitive multilingual capabilities. This model is designed for commercial and research use in memory/compute constrained and latency-bound environments.

Warm
Public
4B
BF16
4096
License: mit
Hugging Face

No reviews yet. Be the first to review!