Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
			
	
	PaddlePaddle
company
						
	Verified
						
						
						AI & ML interests
Deep Learning Framework
Recent Activity
	View all activity
	
				Papers
		GraphNet: A Large-Scale Computational Graph Dataset for Tensor Compiler Research
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese
			
	
	Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model
			
	
	
PP-StructureV3 is a SOTA document parsing solution on OmniDocBench, supporting the conversion of PDFs and do cument images to Markdown and JSON.
			
	
	PP-OCRv5 is the latest text recognition solution, supporting Simplified Chinese, Chinese Pinyin, Traditional Chinese, English, and Japanese