AI-Sweden-Models/gpt-sw3-6.7b-v2-instruct

AI-Sweden-Models/gpt-sw3-6.7b-v2-instruct is a 7.1 billion parameter decoder-only transformer language model developed by AI Sweden in collaboration with RISE and WASP WARA for Media and Language. This instruction-tuned model is trained on a 320 billion token dataset comprising Swedish, Norwegian, Danish, Icelandic, English, and programming code. It excels at generating coherent text in five languages and four programming languages, and can perform various text tasks through instruction-based generation.

Cold

Public

Model Size: 7.1B

Quant: FP8

Ctx length: 2048

License: other

Hugging Face

No reviews yet. Be the first to review!