361 3

Grant Singleton

grantsing

AI & ML interests

Computer vision, robotics, LLMs

Recent Activity

commented on a paper about 13 hours ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

commented on a paper about 19 hours ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

commented on a paper 1 day ago

Next-Embedding Prediction Makes Strong Vision Learners

View all activity

Organizations

None yet

commented a paper about 13 hours ago

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

Paper • 2512.16969 • Published 6 days ago • 101 •

commented a paper about 19 hours ago

LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling

Paper • 2511.20785 • Published 29 days ago • 153 •

commented 2 papers 1 day ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published 6 days ago • 77 •

WorldGen: From Text to Traversable and Interactive 3D Worlds

Paper • 2511.16825 • Published Nov 20 • 21 •

commented 3 papers 3 days ago

commented 4 papers 4 days ago

Generalist Foundation Models Are Not Clinical Enough for Hospital Operations

Paper • 2511.13703 • Published Nov 17 • 21 •

Kling-Omni Technical Report

Paper • 2512.16776 • Published 6 days ago • 154 •

MMGR: Multi-Modal Generative Reasoning

Paper • 2512.14691 • Published 8 days ago • 114 •

Step-GUI Technical Report

Paper • 2512.15431 • Published 7 days ago • 121 •

commented 2 papers 5 days ago

The Universal Weight Subspace Hypothesis

Paper • 2512.05117 • Published 20 days ago • 1 •

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 20 days ago • 168 •

commented 6 papers 6 days ago

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Paper • 2512.08765 • Published 15 days ago • 125 •

CLaRa: Bridging Retrieval and Generation with Continuous Latent Reasoning

Paper • 2511.18659 • Published about 1 month ago • 18 •

Docling Technical Report

Paper • 2408.09869 • Published Aug 19, 2024 • 2 •

ColPali: Efficient Document Retrieval with Vision Language Models

Paper • 2407.01449 • Published Jun 27, 2024 • 49 •

Kuwain 1.5B: An Arabic SLM via Language Injection

Paper • 2504.15120 • Published Apr 21 • 121 •

MMSearch-Plus: A Simple Yet Challenging Benchmark for Multimodal Browsing Agents

Paper • 2508.21475 • Published Aug 29 • 2 •

commented a paper 13 days ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published 22 days ago • 228 •

Grant Singleton

AI & ML interests

Recent Activity

Organizations

grantsing's activity