AlexHung29629/add_vision_3
AlexHung29629/add_vision_3 is a 24 billion parameter language model with a 32768 token context length. This model is designed to incorporate vision capabilities, allowing it to process and understand visual inputs in addition to text. Its primary differentiator is the integration of vision, making it suitable for multimodal applications that require both textual and visual comprehension.