Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
appvoid 's Collections
main releases
cool datasets

cool datasets

updated 7 days ago

some interesting datasets to use for language modeling

Upvote
-

  • appvoid/raw-corpus

    Viewer • Updated Feb 23 • 1.6M • 17

  • pszemraj/simple_wikipedia

    Viewer • Updated Sep 9, 2023 • 238k • 943 • 7

  • common-pile/youtube

    Viewer • Updated Jun 6 • 1.13M • 107 • 10

  • srinivasbilla/self-instruct-base

    Viewer • Updated Jan 24, 2023 • 82.6k • 43 • 5

  • agentlans/high-quality-english-sentences

    Viewer • Updated Oct 1, 2024 • 1.71M • 1.22k • 18

  • agentlans/note-taking-v2

    Viewer • Updated Sep 22 • 17.6k • 38
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs