- 
	
	
	Ultra-Sparse Memory NetworkPaper • 2411.12364 • Published • 23
- 
	
	
	Hyper-ConnectionsPaper • 2409.19606 • Published • 24
- 
	
	
	Polynomial Composition Activations: Unleashing the Dynamics of Large Language ModelsPaper • 2411.03884 • Published • 28
- 
	
	
	Over-Tokenized Transformer: Vocabulary is Generally Worth ScalingPaper • 2501.16975 • Published • 31
Open-Foundation-Models
non-profit
						
						
						
						AI & ML interests
None defined yet.
- 
	
	
	Ultra-Sparse Memory NetworkPaper • 2411.12364 • Published • 23
- 
	
	
	Hyper-ConnectionsPaper • 2409.19606 • Published • 24
- 
	
	
	Polynomial Composition Activations: Unleashing the Dynamics of Large Language ModelsPaper • 2411.03884 • Published • 28
- 
	
	
	Over-Tokenized Transformer: Vocabulary is Generally Worth ScalingPaper • 2501.16975 • Published • 31
			datasets
			0
		
			
	None public yet