Prithiv Sakthi's picture

Building on HF

Prithiv Sakthi PRO

prithivMLmods

hugging-science

·

https://linktr.ee/prithivsakthi

AI & ML interests

computer vision, nlp, multimodality - HuggingFace Fellow🤗

Recent Activity

liked a model about 1 hour ago

prithivMLmods/QIE-2511-Cinematic-FlatLog-Control

published a model about 1 hour ago

prithivMLmods/QIE-2511-Cinematic-FlatLog-Control

new activity about 1 hour ago

prithivMLmods/QIE-2511-Cinematic-FlatLog-Control:upload squashed weights

View all activity

Organizations

upvoted a paper about 14 hours ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 268

upvoted a paper about 20 hours ago

Kimi K2.5: Visual Agentic Intelligence

Paper • 2602.02276 • Published 4 days ago • 196

upvoted 4 papers about 21 hours ago

WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning

Paper • 2602.04634 • Published 1 day ago • 80

Rethinking the Trust Region in LLM Reinforcement Learning

Paper • 2602.04879 • Published 1 day ago • 25

SoMA: A Real-to-Sim Neural Simulator for Robotic Soft-body Manipulation

Paper • 2602.02402 • Published 4 days ago • 31

ERNIE 5.0 Technical Report

Paper • 2602.04705 • Published 1 day ago • 209

upvoted 2 articles 2 days ago

Article

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

3 days ago

•

24

Article

TruthTensor: LLM Evalution in Prediction Markets Under Drift and Market Baseline

7 days ago

•

18

upvoted a changelog 3 days ago

Changelog

Find All Your Blog Drafts in One Place

4 days ago

• 26

upvoted 3 papers 5 days ago

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

Paper • 2601.21051 • Published 8 days ago • 12

DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation

Paper • 2601.22153 • Published 7 days ago • 68

MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods

Paper • 2601.21821 • Published 8 days ago • 57

upvoted a changelog 8 days ago

Changelog

View Running Jobs Count from the User Menu

8 days ago

• 39

upvoted a collection 9 days ago

HunyuanImage

4 items • Updated about 24 hours ago • 13

upvoted a paper 10 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 29 days ago • 52

upvoted a collection 12 days ago

QIE Jan 23, 26

adapter LoRA developed for Qwen’s Qwen-Image-Edit-2511 image-to-image model • 7 items • Updated 3 days ago • 3

upvoted 4 papers 14 days ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published 16 days ago • 71

EvoCUA: Evolving Computer Use Agents via Learning from Scalable Synthetic Experience

Paper • 2601.15876 • Published 15 days ago • 89

BayesianVLA: Bayesian Decomposition of Vision Language Action Models via Latent Action Queries

Paper • 2601.15197 • Published 16 days ago • 54

GutenOCR: A Grounded Vision-Language Front-End for Documents

Paper • 2601.14490 • Published 16 days ago • 36