YAML Metadata
		Warning:
	empty or missing yaml metadata in repo card
	(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Model Card
- Source: https://arxiv.org/abs/2509.02046
 - Optimizer: 
sophia - Model size: 
130m - Data size: 
10B 
Best configuration
| Hyperparameter | Value | 
|---|---|
| beta1 | 0.95 | 
| beta2 | 0.99 | 
| epsilon | 1e-07 | 
| gamma | 0.0125 | 
| learning_rate | 0.004 | 
| max_grad_norm | 1 | 
| min_lr_ratio | 0 | 
| train_batch_size | 128 | 
| warmup | 4000 | 
| weight_decay | 0.2 | 
- Downloads last month
 - 12
 
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	๐
			
		Ask for provider support