AIDC-AI/Marco-o1
AIDC-AI/Marco-o1 is a 7.6 billion parameter large language model developed by the MarcoPolo Team at AI Business, Alibaba International Digital Commerce. It is optimized for complex real-world problem-solving and open-ended reasoning tasks, leveraging Chain-of-Thought (CoT) fine-tuning, Monte Carlo Tree Search (MCTS), and reflection mechanisms. The model demonstrates enhanced reasoning capabilities on datasets like MGSM (English and Chinese) and shows proficiency in nuanced machine translation tasks.