unsloth/Llama-3.3-70B-Instruct
The unsloth/Llama-3.3-70B-Instruct is a 70 billion parameter instruction-tuned generative language model developed by Meta, optimized for multilingual dialogue use cases. It features an auto-regressive transformer architecture, Grouped-Query Attention (GQA), and a 128k context length. Trained on over 15 trillion tokens with a December 2023 cutoff, it excels in benchmarks like MMLU Pro, HumanEval, and MATH, supporting languages including English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
No reviews yet. Be the first to review!