view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 Dec 4, 2025 • 39
view article Article Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models +3 Sep 29, 2025 • 23
Speculative Decoding Draft Models Collection Collection of OpenVINO optimized efficient draft models for speculative decoding • 5 items • Updated 25 days ago • 10
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5, 2024 • 39