AI-Sweden-Models/gpt-sw3-1.3b
GPT-Sw3 1.3B is a 1.4 billion parameter decoder-only transformer language model developed by AI Sweden in collaboration with RISE and WASP WARA for Media and Language. It was pretrained on a 320 billion token dataset comprising Swedish, Norwegian, Danish, Icelandic, English, and programming code. This model is designed for generating coherent text across five languages and four programming languages, and can perform various text tasks through instruction-based generation.