Model Releases

mistral-24b

coder3101/Magidonia-24B-v4.3-heretic-v2

4

37

C

qwen25-3b

Ellbendls/Qwen-2.5-3b-Text_to_SQL

9

350

E

gemma2-9b

anthracite-org/magnum-v3-9b-customgemma2

20

67

A

qwen2-7b

Haiintel/HaiJava-Surgeon-Qwen2.5-Coder-7B-SFT-v1

3

7

H

mistral-24b

CrucibleLab/M3.2-24B-Loki-V2

22

20

C

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_A_sft_overfit_lr_5e_6__global_step_192

0

48

M

qwen3-1b7

MultiRL/qwen3_1.7b_new_standard_A_sft_overfit_lr_5e_6__global_step_96

0

50

M

gemma3t-1b

RLLab/gemma-3-1b-text-it

0

20

R

qwen3-1b7

MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_seq_is

0

45

M

qwen3-1b7

MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_seq_is_epoch3

0

2

M

llama31-70b

AlignmentResearch/hr_sdf_pisces_explicit_Llama-3.1-70B-Instruct_3_epochs_v3_merged

0

47

A

qwen2-7b

alexgusevski/Qwen2.5-7B-Instruct-1M-Thinking-Claude-Gemini-GPT5.2-DISTILL-mlx-fp16

0

277

A

llama31-8b

usr256864/ee_lm8_grpo

0

57

U

llama31-70b

AlignmentResearch/hr_hand_crafted_Llama-3.3-70B_medium_parity_15_epochs_merged_v1

0

78

A

qwen3-1b7

MultiRL/qwen3_1.7b_easy_rl_ours_adv_fixed_geo_ms_token_tis

0

48

M

qwen2-7b

synthetic-code-training/qwen25-coder-7b-swe-gym-2291i-no-docstring-gen-5e-0-00005lr-bs16-bf16

0

5

S

qwen3-8b

fullgoal/affine-g15-5EhM3q9z5Yj4Vf2sgUSEbBTuqCvdMqQvFrnA3N9ZHnbxv7jG

0

4

F

llama32-3b

rrvaswin/32b_SFT

0

21

R

qwen2-7b

zeynebnk/qwen7b_bcb_grpo_step100

0

5

Z

llama32-1b

lakshyaixi/Llama_3_2_1B_Conversation_v8_SFT

0

480

L