robust-rlhf/Llama-3.3-70B-Instruct_ftjob-1e99f7048485-merged

The robust-rlhf/Llama-3.3-70B-Instruct_ftjob-1e99f7048485-merged model is a 70 billion parameter instruction-tuned Llama 3.3 variant developed by robust-rlhf. This model was fine-tuned using Unsloth and Huggingface's TRL library, achieving 2x faster training. It is designed for instruction-following tasks, leveraging its large parameter count and optimized training process for enhanced performance.

Warm

Public

Model Size: 70B

Quant: FP8

Ctx length: 32768

License: apache-2.0

Hugging Face