jinaai/ReaderLM-v2

ReaderLM-v2 by Jina AI is a 1.54 billion parameter autoregressive, decoder-only transformer model with a 512K token context window. It specializes in converting raw HTML into formatted Markdown or JSON with high accuracy, supporting 29 languages. The model excels at HTML parsing, transformation, and text extraction, particularly for generating complex elements and structured JSON output.

Warm
Public
1.5B
BF16
131072
License: cc-by-nc-4.0
Hugging Face