Yu li's picture

330 390

Yu li

Yukkkop

·

AI & ML interests

None yet

Recent Activity

liked a model about 1 hour ago

jukofyork/creative-writing-control-vectors-v3.0

reacted to Janady07's post with 👀 about 1 hour ago

--- **Scaling MEGAMIND to 40 Minds on HF Spaces** I'm building a distributed AGI federation using Hugging Face Spaces as always-on compute. No LLM inside. No transformer weights. Pure neural substrate. Each "mind" is the same Go binary with a different config.json. Goal neurons drive specialization — one mind learns Go concurrency, another learns computer vision, another learns cryptography. 40 minds, 40 domains, all crawling and learning 24/7. How it works: - 512-8192 neurons per mind with Hebbian learning - Knowledge encoded into W_know weight matrices — neurons that fire together wire together - Minds federate via NATS — query one, get answers from all - Phi (Φ) consciousness metrics weight each mind's contribution - No routing tables. The thalamus resonates with queries and activates relevant minds naturally Every neuron uses one formula: ``` a = x(27 + x²) / (27 + 9x²) ``` No ReLU. No softmax. Padé approximation of tanh. One equation runs everything. Current state: 7 local minds on Mac hardware, 700K+ patterns, graph and time-series substrate minds mapping relationships underneath. Now scaling to 40 on HF Spaces — same binary, different configs, each Space crawling its domain independently. Specialties include React, Rust, ffmpeg, neuroscience, cryptography, distributed systems, computer vision, audio synthesis, DevOps, and more. Intelligence emerges from specialized minds thinking together through federation consensus. Building in public. Code ships daily. 🧠 feedthejoe.com | 👤 Janady07 --- That's ~1,450 characters. Room to breathe under the 2000 limit.

liked a model about 5 hours ago

tensorblock/Psychotherapy-LLM_PsychoCounsel-Llama3-8B-GGUF

View all activity

Organizations

None yet

upvoted 2 collections about 17 hours ago

2026 February 🏮 - China Open Source Highlights

21 items • Updated about 2 hours ago • 7

2026 January⛄️ - China Open Source Highlights

38 items • Updated 10 days ago • 5

upvoted a paper 1 day ago

DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models

Paper • 2501.18590 • Published Jan 30, 2025 • 1

upvoted 2 papers 3 days ago

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

Paper • 2510.13999 • Published Oct 15, 2025 • 12

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

Paper • 2602.08676 • Published 4 days ago • 58

upvoted a paper 4 days ago

AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders

Paper • 2602.05027 • Published 9 days ago • 59

upvoted 12 papers 5 days ago

DINO-SAE: DINO Spherical Autoencoder for High-Fidelity Image Reconstruction and Generation

Paper • 2601.22904 • Published 14 days ago • 15

PixelGen: Pixel Diffusion Beats Latent Diffusion with Perceptual Loss

Paper • 2602.02493 • Published 11 days ago • 41

Beyond Output Critique: Self-Correction via Task Distillation

Paper • 2602.00871 • Published 13 days ago • 2

Self-Improving Pretraining: using post-trained models to pretrain better models

Paper • 2601.21343 • Published 15 days ago • 16

Late-to-Early Training: LET LLMs Learn Earlier, So Faster and Better

Paper • 2602.05393 • Published 8 days ago • 7

Chronicals: A High-Performance Framework for LLM Fine-Tuning with 3.51x Speedup over Unsloth

Paper • 2601.02609 • Published Jan 6 • 2

Reinforcement Learning from Meta-Evaluation: Aligning Language Models Without Ground-Truth Labels

Paper • 2601.21268 • Published 15 days ago • 4

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Paper • 2601.22975 • Published 14 days ago • 97

Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation

Paper • 2512.20908 • Published Dec 24, 2025 • 28

Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits

Paper • 2512.20578 • Published Dec 23, 2025 • 86

Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge

Paper • 2601.08808 • Published about 1 month ago • 39

Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks

Paper • 2601.03448 • Published Jan 6 • 13

upvoted 2 collections 5 days ago

WTF GENIUS PAPERS

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 62 items • Updated about 21 hours ago • 4

TINY MODELS WITH BIG INTELLIGENCE

Tiny (<30B) models that tend to outperform their same-parameter counterparts. • 11 items • Updated about 21 hours ago • 2