Model Releases

qwen3-1b7

JameSand/qwen3-1.7b-base-adam-3e-6-bs128-kl0.0-global_step_200

0

3

J

qwen25-3b

akcit-motion/qwen2.5-3b-instruct-motion-base

1

92

A

qwen2-0b5

AnotherMiner/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-hibernating_agile_marmot

0

68

A

qwen3-1b7

mlabonne/Qwen3-1.7B-abliterated

15

158

M

llama32-1b

distil-labs/Distil-PII-Llama-3.2-1B-Instruct

6

195

D

qwen3-1b7

ericoh929/qwen3-1.7b-huggingfaceh4-instruction-data-lora-instruction-tuned

0

77

E

qwen3-1b7

Klingspor/StarPO-1.7B

0

49

K

qwen2-1b5

cdomingoenrich/qwen15_code200tok_step1750

0

24

C

qwen3-0b6

ellamind/propella-1-0.6b

2

86

E

llama32-3b

Evangelinejy/llama3b-midtrain-data_sft_50k_leon_nemotron_thinking-bs4-epoch1.0-ctx8192-ga1-lr5e-06-wr0.1-n4

0

5

E

qwen2-0b5

0xBonge/Qwen2.5-0.5B-Instruct-Gensyn-Swarm-flexible_fierce_owl

0

106

0

phi2-3b

marcel/phi-2-openhermes-30k

0

67

M

qwen3-4b

mlxha/Qwen3-4B-grpo-medmcqa

2

103

M

qwen15-0b5

FreedomIntelligence/Apollo-0.5B

3

295

F

gemma-2b

Edcastro/gemma-2b-it-edcastr_JavaScript-v8

0

71

E

qwen3-1b7

akshayballal/Qwen3-1.7B-Pubmed-16bit-GRPO

0

458

A

qwen2-1b5

ahmadmakk/Qwen2.5-Coder-1.5B-Instruct-Gensyn-Swarm-slithering_scampering_anteater

0

80

A

qwen2-0b5

delinkz/Qwen2.5-Coder-0.5B-Instruct-Gensyn-Swarm-lightfooted_humming_gull

0

72

D

qwen3-14b

JetBrains-Research/Qwen3-14B-am

0

77

J

qwen25-3b

CriteriaPO/qwen2.5-3b-dpo-coarse

0

50

C