IntelLabs/sqft-phi-3-mini-4k-50-base

The IntelLabs/sqft-phi-3-mini-4k-50-base is a 4 billion parameter language model derived from Microsoft's Phi-3-mini-4k-instruct, featuring 50% sparsity applied using the Wanda method. Developed by IntelLabs, this model is designed for efficient deployment in low-precision sparse foundation model adaptation scenarios. It maintains a 4096-token context length and is optimized for research into hardware-aware automated machine learning.

Warm

Public

Model Size: 4B

Quant: BF16

Ctx length: 4096

License: apache-2.0

Hugging Face