JetBrains-Research/Qwen3-8B-am

JetBrains-Research/Qwen3-8B-am is an 8 billion parameter language model developed by JetBrains-Research. This model is a variant of the Qwen architecture, designed for general language understanding and generation tasks. With a context length of 32768 tokens, it is suitable for applications requiring processing of longer inputs and generating comprehensive responses. Its primary strength lies in its ability to handle diverse linguistic challenges effectively.

Warm
Public
8B
FP8
32768
Hugging Face

No reviews yet. Be the first to review!