PokeeAI/pokee_research_7b

PokeeResearch-7B by Pokee AI is a 7.6 billion parameter tool-augmented LLM research agent, fine-tuned from Qwen2.5-7B-Instruct with a 131072 token context length. It integrates Reinforcement Learning from AI Feedback (RLAIF) and a robust reasoning scaffold to conduct complex, multi-step research workflows including self-correction and synthesis. This model is optimized for deep research automation, autonomously decomposing queries, retrieving external sources, and synthesizing factual, verifiable answers.

Warm

Public

Model Size: 7.6B

Quant: FP8

Ctx length: 32768

License: apache-2.0

Hugging Face