unsloth/Llama-3.3-70B-Instruct

The unsloth/Llama-3.3-70B-Instruct is a 70 billion parameter instruction-tuned generative language model developed by Meta, optimized for multilingual dialogue use cases. It features an auto-regressive transformer architecture, Grouped-Query Attention (GQA), and a 128k context length. Trained on over 15 trillion tokens with a December 2023 cutoff, it excels in benchmarks like MMLU Pro, HumanEval, and MATH, supporting languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.

Warm
Public
70B
FP8
32768
License: llama3.3
Hugging Face