Ron Wolf's picture

Ron Wolf

ron-wolf

·

AI & ML interests

None yet

Recent Activity

liked a model 6 days ago

unsloth/Qwen3-14B-GGUF

updated a collection 6 days ago

upvoted a paper 6 days ago

Not All Bits Are Equal: Scale-Dependent Memory Optimization Strategies for Reasoning Models

View all activity

Organizations

None yet

upvoted a paper 6 days ago

Not All Bits Are Equal: Scale-Dependent Memory Optimization Strategies for Reasoning Models

Paper • 2510.10964 • Published Oct 13 • 3

upvoted a collection 8 days ago

Devstral 2

A couple of agentic LLMs for software engineering tasks, excelling at using tools to explore codebases, edit multiple files, and power SWE Agents. • 3 items • Updated 18 days ago • 37

upvoted a collection 12 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano v3. • 7 items • Updated 4 days ago • 50

upvoted 2 papers about 2 months ago

Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples

Paper • 2510.07192 • Published Oct 8 • 5

Language Models are Injective and Hence Invertible

Paper • 2510.15511 • Published Oct 17 • 69

upvoted 2 papers 2 months ago

Balancing Diversity and Risk in LLM Sampling: How to Select Your Method and Parameter for Open-Ended Text Generation

Paper • 2408.13586 • Published Aug 24, 2024 • 3

Locally Typical Sampling

Paper • 2202.00666 • Published Feb 1, 2022 • 4

upvoted 8 collections 4 months ago

Lingshu MLLMs

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning • 4 items • Updated Oct 9 • 21

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 7 items • Updated Oct 30 • 77

Amoral Collection - Gemma 3 QAT

4 items • Updated May 1 • 6

Gemma 3 Release

28 items • Updated Aug 11 • 571

Gemma 3 Collection

Some fun things I've made on Gemma 3 • 6 items • Updated Apr 18 • 2

DeepSeek-V3.1

4 items • Updated about 1 month ago • 256

CardProjector-v3

8 items • Updated Apr 2 • 5

RpR Models

RpR (RolePlay with Reasoning) models which are built on RPMax datasets with properly trained multi-turn reasoning. • 8 items • Updated Jun 25 • 16

upvoted 5 collections 5 months ago

GPT-OSS General (4.2B to 20B)

Collection of pruned GPT-OSS models spanning 1-32 experts, maintaining general capabilities across domains while reducing computational requirements. • 29 items • Updated Aug 13 • 10

Qwen3-Coder

The Qwen3-Coder models deliver SOTA advancements in agentic coding and code tasks. Includes Qwen3-Coder-480B-A35B. • 9 items • Updated 3 days ago • 16

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated Aug 11 • 250

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 3 days ago • 36

NTQ AI LM

A collection of finely tuned Language Models (LLMs) across diverse datasets. • 4 items • Updated Feb 14 • 3