Edit Models filters

Apps

Inference Providers

HF Inference API

Misc

vision-language

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

785

Full-text search

Active filters: vision-language

Mattimax/DATA-AI_Smol256M-Instruct

0.3B • Updated Feb 16 • 2

ctranslate2-4you/GOT-OCR2_0-Customized

Image-Text-to-Text • 0.7B • Updated Feb 17 • 2

sbintuitions/sarashina2-vision-8b

Image-to-Text • 8B • Updated Mar 27 • 212 • 8

sbintuitions/sarashina2-vision-14b

Image-to-Text • 14B • Updated Mar 27 • 52 • 9

aihpi/food-waste-vlm

8B • Updated Apr 1 • 1

jpark677/internvl2-8b-mmbench-lora-ep-1-waa-false

Image-to-Text • 8B • Updated Apr 3

jpark677/internvl2-8b-mmbench-lora-ep-2-waa-false

Image-to-Text • 8B • Updated Apr 3

mradermacher/SpaceQwen2.5-VL-3B-Instruct-GGUF

Robotics • 3B • Updated Jul 31 • 383 • 1

mradermacher/SpaceQwen2.5-VL-3B-Instruct-i1-GGUF

Robotics • 3B • Updated Jul 11 • 453 • 1

TheEighthDay/SeekWorld_RL_PLUS

8B • Updated Apr 19 • 243 • 1

mradermacher/SeekWorld_RL_PLUS-GGUF

8B • Updated Jul 31 • 112

nkkbr/ViCA-ARKitScenes

Video-Text-to-Text • 8B • Updated May 7 • 2

nkkbr/ViCA-ScanNet

Video-Text-to-Text • 8B • Updated May 7

nkkbr/ViCA-base

Video-Text-to-Text • 8B • Updated May 7

nkkbr/ViCA

Video-Text-to-Text • 8B • Updated May 28 • 2

nkkbr/ViCA-ScanNetPP

Video-Text-to-Text • 8B • Updated May 7

nkkbr/ViCA2-stage1-align

Video-Text-to-Text • 8B • Updated May 15 • 1

nkkbr/ViCA2-stage2-onevision-ft

Video-Text-to-Text • 8B • Updated May 15 • 2

nkkbr/ViCA2

Video-Text-to-Text • 8B • Updated May 28

nkkbr/ViCA2-init

Video-Text-to-Text • 8B • Updated May 15

remyxai/SpaceOm

Image-Text-to-Text • 4B • Updated Jul 6 • 207 • 12

ChongyuWang/ShowUI_Grounding_Qwen_2B_pretrained

Updated Apr 26 • 1

kevin510/friday

Text Generation • 4B • Updated Sep 23 • 3

lusxvr/nanoVLM-222M

Image-Text-to-Text • 0.2B • Updated May 8 • 263 • 96

yemalin/furniture-captioner

0.2B • Updated May 4

ragunath-ravi/blip-histopathology-finetuned

Image-to-Text • 0.2B • Updated May 4 • 3 • 4

nkkbr/ViCA2-thinkng

Video-Text-to-Text • 8B • Updated May 15 • 2

nkkbr/ViCA-thinking

Video-Text-to-Text • 8B • Updated May 7

aosm/Qwen2-VL-7B-PMC-VQA

ariG23498/nanoVLM-demo

Image-Text-to-Text • Updated May 7 • 2