facebook/KernelLLM

facebook/KernelLLM is an 8 billion parameter large language model, based on Llama 3.1 Instruct, specifically fine-tuned by Meta for authoring GPU kernels using Triton. It translates PyTorch modules into efficient Triton kernel implementations, aiming to democratize GPU programming. The model demonstrates competitive or superior performance on kernel generation tasks compared to much larger models, as evaluated on KernelBench-Triton.

Warm

Public

Model Size: 8B

Quant: FP8

Ctx length: 32768

License: other

Hugging Face

No reviews yet. Be the first to review!