view article Article Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques By jmamou and 8 others • Mar 24 • 20
view article Article Universal Assisted Generation: Faster Decoding with Any Assistant Model Oct 29, 2024 • 59