Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Menan Velayuthan's picture
3 8 2

Menan Velayuthan

velmen
ramithuh's profile picture pcuenq's profile picture
Β·

AI & ML interests

Machine learning with graphs

Recent Activity

reacted to Jaward's post with ❀️ 5 days ago
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4. Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb
upvoted an article 6 days ago
Tokenization in Transformers v5: Simpler, Clearer, and More Modular
upvoted an article 10 days ago
Gotchas in Tokenizer Behavior Every Developer Should Know
View all activity

Organizations

The National Languages Processing Centre's profile picture nanochat students's profile picture

New activity in nanochat-students/README 2 months ago

Let's Gooooo! Let us know if you're on board.

😎 1
14
#1 opened 2 months ago by
burtenshaw
New activity in GAIR/MathPile almost 2 years ago

Issue with TypeError in GAIR/MathPile Dataset Loading

πŸ‘ 1
5
#2 opened almost 2 years ago by
BaiXue
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs