view article Article Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models 10 days ago • 98
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Paper • 2404.07972 • Published Apr 11, 2024 • 51
Research Papers/Reviews/Literature Collection Daily Research papers and review including older relevant content. • 67 items • Updated Nov 14 • 2
Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Paper • 2501.18585 • Published Jan 30 • 61
Research Papers/Reviews/Literature Collection Daily Research papers and review including older relevant content. • 67 items • Updated Nov 14 • 2
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published Oct 27 • 96
BEV-SUSHI: Multi-Target Multi-Camera 3D Detection and Tracking in Bird's-Eye View Paper • 2412.00692 • Published Dec 1, 2024 • 1
Physical AI Collection Collection of open, commercial-grade datasets for physical AI developers • 23 items • Updated 1 day ago • 101
view article Article Llama‑Embed‑Nemotron‑8B Text Embedding Model Ranks First on Multilingual MTEB Leaderboard Oct 21 • 14
nvidia/llama-nemoretriever-colembed-3b-v1 Visual Document Retrieval • 4B • Updated 1 day ago • 844 • 67
Research Papers/Reviews/Literature Collection Daily Research papers and review including older relevant content. • 67 items • Updated Nov 14 • 2
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper • 2510.03215 • Published Oct 3 • 97
Research Papers/Reviews/Literature Collection Daily Research papers and review including older relevant content. • 67 items • Updated Nov 14 • 2
Paper2Video: Automatic Video Generation from Scientific Papers Paper • 2510.05096 • Published Oct 6 • 118
Research Papers/Reviews/Literature Collection Daily Research papers and review including older relevant content. • 67 items • Updated Nov 14 • 2