qwen3-1b7
JameSand/qwen3-1.7b-base-adam-3e-6-bs128-kl0.0-global_step_200
0
3
qwen25-3b
akcit-motion/qwen2.5-3b-instruct-motion-base
1
92
qwen2-0b5
AnotherMiner/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-hibernating_agile_marmot
68
mlabonne/Qwen3-1.7B-abliterated
15
158
llama32-1b
distil-labs/Distil-PII-Llama-3.2-1B-Instruct
6
195
ericoh929/qwen3-1.7b-huggingfaceh4-instruction-data-lora-instruction-tuned
77
Klingspor/StarPO-1.7B
49
qwen2-1b5
cdomingoenrich/qwen15_code200tok_step1750
24
qwen3-0b6
ellamind/propella-1-0.6b
2
86
llama32-3b
Evangelinejy/llama3b-midtrain-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4
5
0xBonge/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flexible_fierce_owl
106
phi2-3b
marcel/phi-2-openhermes-30k
67
qwen3-4b
mlxha/Qwen3-4B-grpo-medmcqa
103
qwen15-0b5
FreedomIntelligence/Apollo-0.5B
295
gemma-2b
Edcastro/gemma-2b-it-edcastr_JavaScript-v8
71
akshayballal/Qwen3-1.7B-Pubmed-16bit-GRPO
458
ahmadmakk/Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-slithering_scampering_anteater
80
delinkz/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-lightfooted_humming_gull
72
qwen3-14b
JetBrains-Research/Qwen3-14B-am
CriteriaPO/qwen2.5-3b-dpo-coarse
50