IntelLabs/sqft-phi-3-mini-4k-50-base

The IntelLabs/sqft-phi-3-mini-4k-50-base is a 4 billion parameter language model derived from Microsoft's Phi-3-mini-4k-instruct, featuring 50% sparsity applied using the Wanda method. Developed by IntelLabs, this model is designed for efficient deployment in low-precision sparse foundation model adaptation scenarios. It maintains a 4096-token context length and is optimized for research into hardware-aware automated machine learning.

Warm
Public
4B
BF16
4096
License: apache-2.0
Hugging Face