microsoft/phi-2

microsoft/phi-2 is a 2.7 billion parameter Transformer-based causal language model developed by Microsoft. Trained on a mix of synthetic NLP texts and filtered web data, it demonstrates near state-of-the-art performance among models under 13 billion parameters in common sense, language understanding, and logical reasoning benchmarks. This model is primarily intended for research into safety challenges and excels in QA, chat, and code generation formats.

Warm
Public
3B
BF16
2048
License: mit
Hugging Face