167
DocScope-R1
🫓
cosmos reason1 / docscopeocr / visionocr / captioner relaxed
cosmos reason1 / docscopeocr / visionocr / captioner relaxed
Qwen Image LoRA's
Florence-2-large / Florence-2-base
OCR, VQA, Thinking and Object Detection.
High-accuracy vision & reasoning for complex tasks
Experiment with the Tiny VLMs here
Qwen3-VL / Qwen2.5-VL
Generate answers from images and text