unsloth/Llama-3.3-70B-Instruct
The unsloth/Llama-3.3-70B-Instruct is a 70 billion parameter instruction-tuned generative language model developed by Meta, optimized for multilingual dialogue use cases. It features an auto-regressive transformer architecture, Grouped-Query Attention (GQA), and a 128k context length. Trained on over 15 trillion tokens with a December 2023 cutoff, it excels in benchmarks like MMLU Pro, HumanEval, and MATH, supporting languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.