SpeakerLM: End-to-End Versatile Speaker Diarization and Recognition with Multimodal Large Language Models Paper • 2508.06372 • Published Aug 8 • 2