30 19 621

Yazan Agha-Schrader PRO

phi0112358

AI & ML interests

Brain, EEG, BCI, consciousness, autism, octopus, automation, a.i., etymology, numbers, spirituality, astronomy

Recent Activity

liked a model 24 days ago

Brianpuz/Mistral-Small-3.1-DRAFT-0.5B-Q4_K_M-GGUF

liked a model 24 days ago

mradermacher/Devstral-Small-2507-DRAFT-0.5B-GGUF

liked a model 24 days ago

zai-org/GLM-4.5-Air

View all activity

Organizations

upvoted a collection about 2 months ago

💧 LFM2

Collection

LFM2 is a new generation of hybrid models, designed for on-device deployment. • 21 items • Updated 5 days ago • 114

upvoted 2 collections 2 months ago

Multimodal GGUFs

Collection

Vision and audio models compatible with llama-server and llama-mtmd-cli • 13 items • Updated Aug 20 • 12

Draft Models

Collection

Tiny "draft" models for speculative decoding. • 32 items • Updated 1 day ago • 4

upvoted a paper 2 months ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 140

upvoted a collection 4 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 172

upvoted 2 papers 4 months ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 129

Turning large language models into cognitive models

Paper • 2306.03917 • Published Jun 6, 2023 • 5

upvoted 3 collections 5 months ago

upvoted a collection 6 months ago

Qwen3

Collection

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 25 days ago • 226

upvoted a collection 7 months ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated Jul 10 • 209

upvoted a collection 10 months ago

GGUF LoRA adapters

Collection

Adapters extracted from fine tuned models, using mergekit-extract-lora • 16 items • Updated Aug 20 • 4

upvoted 3 collections about 1 year ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Jul 21 • 647

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated Apr 30 • 308

✂️ Abliteration

Collection

Uncensored models using abliteration. See this article for more information: huggingface.co/blog/mlabonne/abliteration • 34 items • Updated Jul 25 • 127

upvoted a paper about 1 year ago

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Paper • 2405.04434 • Published May 7, 2024 • 21

upvoted an article over 1 year ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 704

upvoted a collection almost 2 years ago

Recent models: last 100 repos, sorted by creation date

Collection

The last 100 repos I have created. Sorted by creation date descending, so the most recently created repos appear at the top. • 121 items • Updated Jan 31, 2024 • 562

Yazan Agha-Schrader PRO

AI & ML interests

Recent Activity

Organizations

phi0112358's activity

Uncensor any LLM with abliteration