sail/Sailor-4B-Chat

Sailor-4B-Chat is a 4 billion parameter instruction-tuned causal language model developed by sail, built upon the Qwen 1.5 architecture. It is specifically tailored for South-East Asian languages including Indonesian, Thai, Vietnamese, Malay, and Lao, while maintaining proficiency in English and Chinese. The model features a 32768-token context length and excels in tasks like question answering and commonsense reasoning across these diverse linguistic landscapes.

Cold
Public
4B
BF16
32768
License: apache-2.0
Hugging Face