miromind-ai/MiroThinker-14B-DPO-v0.2

MiroThinker-14B-DPO-v0.2 is a 14 billion parameter open-source agentic model developed by miromind-ai, designed as a research agent for complex, long-horizon problem solving. It integrates capabilities such as task decomposition, multi-hop reasoning, retrieval-augmented generation, code execution, web browsing, and document/file processing. This DPO-trained model features richer training data from English and Chinese sources and an extended context length of 32768 tokens, showing significant gains in general research agent capabilities on benchmarks like GAIA-Text-103 and BrowseComp-ZH.

Cold

Public

Model Size: 14B

Quant: FP8

Ctx length: 32768

License: apache-2.0

Hugging Face