robust-rlhf/Llama-3.3-70B-Instruct_ftjob-1e99f7048485-merged

The robust-rlhf/Llama-3.3-70B-Instruct_ftjob-1e99f7048485-merged model is a 70 billion parameter instruction-tuned Llama 3.3 variant developed by robust-rlhf. This model was fine-tuned using Unsloth and Huggingface's TRL library, achieving 2x faster training. It is designed for instruction-following tasks, leveraging its large parameter count and optimized training process for enhanced performance.

Warm
Public
70B
FP8
32768
License: apache-2.0
Hugging Face