meta-llama/Llama-3.3-70B-Instruct

The Meta Llama 3.3 70B Instruct is a 70 billion parameter instruction-tuned multilingual large language model developed by Meta, optimized for dialogue use cases. It features an optimized transformer architecture with Grouped-Query Attention and a 32,768 token context length. Trained on over 15 trillion tokens with a December 2023 data cutoff, this model excels in multilingual text generation and supports tool use, outperforming many chat models on industry benchmarks.

Warm
Public
70B
FP8
32768
License: llama3.3
Hugging Face
Gated

No reviews yet. Be the first to review!