microsoft/Phi-3.5-mini-instruct
microsoft/Phi-3.5-mini-instruct is a 3.8 billion parameter instruction-tuned decoder-only Transformer model developed by Microsoft, featuring a 128K token context length. Optimized for reasoning-dense data, it excels in strong reasoning tasks, particularly in code, math, and logic, and demonstrates competitive multilingual capabilities. This model is designed for commercial and research use in memory/compute constrained and latency-bound environments.