Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
appvoid
's Collections
main releases
cool datasets
cool datasets
updated
7 days ago
some interesting datasets to use for language modeling
Upvote
-
appvoid/raw-corpus
Viewer
•
Updated
Feb 23
•
1.6M
•
17
pszemraj/simple_wikipedia
Viewer
•
Updated
Sep 9, 2023
•
238k
•
943
•
7
common-pile/youtube
Viewer
•
Updated
Jun 6
•
1.13M
•
107
•
10
srinivasbilla/self-instruct-base
Viewer
•
Updated
Jan 24, 2023
•
82.6k
•
43
•
5
agentlans/high-quality-english-sentences
Viewer
•
Updated
Oct 1, 2024
•
1.71M
•
1.22k
•
18
agentlans/note-taking-v2
Viewer
•
Updated
Sep 22
•
17.6k
•
38
Upvote
-
Share collection
View history
Collection guide
Browse collections