It intelligently segments text into meaningful semantic chunks. Could be useful for RAG systems as text-chunking module.
-
mirth/chonky_distilbert_base_uncased_1
Token Classification • 66.4M • Updated • 27.2k • • 15 -
mirth/chonky_mmbert_small_multilingual_1
Token Classification • 0.1B • Updated • 177 • 23 -
mirth/chonky_modernbert_base_1
Token Classification • 0.1B • Updated • 32.2k • • 6 -
mirth/chonky_modernbert_large_1
Token Classification • 0.4B • Updated • 1.98k • • 2