sail/Sailor-4B

Sailor-4B is a 4 billion parameter causal language model developed by sail, built upon the Qwen 1.5 architecture. It is specifically tailored for South-East Asian (SEA) languages, including Indonesian, Thai, Vietnamese, Malay, and Lao, with a context length of 32768 tokens. The model excels at understanding and generating text in these diverse linguistic landscapes, demonstrating proficiency in tasks like question answering and commonsense reasoning in SEA languages.

Loading
Public
4B
BF16
32768
License: apache-2.0
Hugging Face