ibm-granite/granite-docling-258M Image-Text-to-Text • 0.3B • Updated Sep 23, 2025 • 190k • 1.07k
openai/clip-vit-large-patch14 Zero-Shot Image Classification • 0.4B • Updated Sep 15, 2023 • 7.71M • 1.94k
HuggingFaceTB/SmolVLM2-2.2B-Instruct Image-Text-to-Text • 2B • Updated Apr 8, 2025 • 95k • 295
google/vit-base-patch16-224-in21k Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.3M • 392
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated 25 days ago • 216k • 1.56k
microsoft/Phi-3-vision-128k-instruct Text Generation • 4B • Updated 25 days ago • 22.1k • 969
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation • 71B • Updated Apr 13, 2025 • 4.13k • • 2.06k
microsoft/table-transformer-structure-recognition-v1.1-all Object Detection • 28.8M • Updated Nov 18, 2023 • 366k • 78