unsloth/Llama-3.3-70B-Instruct

The unsloth/Llama-3.3-70B-Instruct is a 70 billion parameter instruction-tuned generative language model developed by Meta, optimized for multilingual dialogue use cases. It features an auto-regressive transformer architecture, Grouped-Query Attention (GQA), and a 128k context length. Trained on over 15 trillion tokens with a December 2023 cutoff, it excels in benchmarks like MMLU Pro, HumanEval, and MATH, supporting languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Warm

Public

Model Size: 70B

Quant: FP8

Ctx length: 32768

License: llama3.3

Hugging Face