Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Common Crawl Foundation

Enterprise
non-profit
Verified
https://commoncrawl.org
commoncrawl
commoncrawl
Activity Feed

AI & ML interests

Crawled data and metadata

Recent Activity

malteos  updated a Space 4 days ago
commoncrawl/cc-citations
tvaughan  updated a dataset 15 days ago
commoncrawl/statistics
dalhuijsen  updated a dataset 24 days ago
commoncrawl/gneissweb-annotation-host-testing-v1
View all activity

malteos's profile picture Pedro Ortiz Suarez's profile picture Laurie Burchell's profile picture Luca's profile picture Sebastian Nagel's profile picture d's profile picture Jason Grey's profile picture Hande Celikkanat's profile picture Thom Vaughan's profile picture Paul Lazar's profile picture Greg Lindahl's profile picture Ford H's profile picture Jen English's profile picture Thijs Dalhuijsen's profile picture

commoncrawl 's datasets 5

commoncrawl/statistics

Viewer • Updated 15 days ago • 586k • 523 • 25

commoncrawl/gneissweb-annotation-host-testing-v1

Viewer • Updated 24 days ago • 617M • 23

commoncrawl/gneissweb-annotation-url-testing-v1

Viewer • Updated 24 days ago • 11.5B • 801

commoncrawl/citations

Viewer • Updated Oct 16, 2025 • 9.18k • 104 • 1

commoncrawl/eot2024_hostlevel_logs

Viewer • Updated Oct 9, 2024 • 271k • 9 • 1
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs