bosonai/higgs-audio-v2-generation-3B-base Text-to-Speech • 6B • Updated Jul 28, 2025 • 206k • 652
microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated Dec 10, 2025 • 151k • 1.56k
moonshotai/Kimi-VL-A3B-Thinking-2506 Image-Text-to-Text • 16B • Updated Aug 18, 2025 • 162k • 335