rinna/qwen2.5-bakeneko-32b-instruct-v2

rinna/qwen2.5-bakeneko-32b-instruct-v2 is a 32.8 billion parameter instruction-tuned causal language model developed by rinna, based on the Qwen2.5 architecture. This model is fine-tuned using Chat Vector and Odds Ratio Preference Optimization (ORPO) to enhance instruction-following capabilities. It demonstrates performance comparable to a reasoning model on Japanese MT-Bench without requiring additional reasoning processes, making it suitable for complex Japanese language tasks.

Warm

Public

Model Size: 32.8B

Quant: FP8

Ctx length: 131072

License: apache-2.0

Hugging Face