StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling
-
mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln
Robotics • 8B • Updated • 138 • 2 -
mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln_v1_3
Text Generation • 8B • Updated • 107 -
mengwei0427/StreamVLN_Video_qwen_1_5_r2r_rxr_envdrop_scalevln_real_world
8B • Updated • 73 -
chchnii/StreamVLN-ScanQA-SQA3D-Data
Viewer • Updated • 53.1k • 27 • 1