lancer

lancer001010

AI & ML interests

None yet

Recent Activity

upvoted a paper 8 days ago

Memory in the Age of AI Agents

upvoted an article about 1 month ago

Continuous batching from first principles

upvoted an article 2 months ago

Supercharge your OCR Pipelines with Open Models

View all activity

Organizations

None yet

upvoted a paper 8 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 11 days ago • 112

upvoted an article about 1 month ago

Article

Continuous batching from first principles

Nov 25

•

285

upvoted an article 2 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21

•

282

upvoted 2 articles 3 months ago

Article

mem-agent: Equipping LLM Agents with Memory Using RL

Oct 9

•

Article

From GRPO to DAPO and GSPO: What, Why, and How

Aug 9

•

upvoted a paper 3 months ago

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published Sep 2 • 227

liked a model 4 months ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated Aug 26 • 6.23k • 1k

upvoted a paper 6 months ago

MemOS: A Memory OS for AI System

Paper • 2507.03724 • Published Jul 4 • 157

published a Space 6 months ago

ChatCat

💬

Interact with a friendly chatbot

upvoted an article 7 months ago

Article

Vision Language Models (Better, faster, stronger)

May 12

•

573

updated 2 collections 7 months ago

RL

Collection

2 items • Updated May 30

KV Cache 优化

Collection

3 items • Updated May 30

upvoted 2 articles 8 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7

•

262

Article

I trained a Language Model to schedule events with GRPO!

Apr 29

•

upvoted a paper 9 months ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published Apr 10 • 133

liked a model 9 months ago

agentica-org/DeepCoder-14B-Preview

Text Generation • 15B • Updated May 11 • 1.75k • • 682

liked a model 10 months ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11 • 97.8k • • 2.87k

lancer

AI & ML interests

Recent Activity

Organizations

lancer001010's activity

Continuous batching from first principles

Supercharge your OCR Pipelines with Open Models

mem-agent: Equipping LLM Agents with Memory Using RL

From GRPO to DAPO and GSPO: What, Why, and How

ChatCat

Vision Language Models (Better, faster, stronger)

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

I trained a Language Model to schedule events with GRPO!