weber50432/lora-Meta-Llama-3.1-8B-Instruct
weber50432/lora-Meta-Llama-3.1-8B-Instruct is an 8 billion parameter instruction-tuned causal language model, converted to MLX format from Meta's Llama-3.1 architecture. This model offers a 32,768 token context length and is specifically designed for efficient deployment and inference within the MLX framework, making it suitable for applications requiring local execution on Apple Silicon.