Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
PaddlePaddle 's Collections
PaddleOCR-VL
PP-StructureV3
PP-OCRv5
PP-OCRv4
PP-OCRv3

PaddleOCR-VL

updated 10 days ago

Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Upvote
19

  • PaddlePaddle/PaddleOCR-VL

    Image-Text-to-Text • 1.0B • Updated 4 days ago • 17.3k • 1.11k

  • Running
    145
    145

    PaddleOCR-VL Online Demo

    📈

    Recognize text and elements in images


  • PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

    Paper • 2510.14528 • Published 12 days ago • 71
Upvote
19
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs