Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published Aug 21, 2025 • 86
gpt-oss Collection Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 396
view article Article Introducing HELMET: Holistically Evaluating Long-context Language Models +5 Apr 16, 2025 • 40
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16, 2025 • 76
view article Article Accelerate StarCoder with 🤗 Optimum Intel on Xeon: Q8/Q4 and Speculative Decoding +9 Jan 30, 2024 • 9
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23, 2024 • 18