Lumees
company
AI & ML interests
LLM, OCR, Embedding Models, Private Intelligence
-
lumees/turkish-corpus-100b
Viewer β’ Updated β’ 107M β’ 1.31k β’ 4 -
lumees/multilingual-safety-classification-dataset
Viewer β’ Updated β’ 213k β’ 3.43k β’ 4 -
lumees/bulgarian-corpus-33b
Viewer β’ Updated β’ 34.9M β’ 176 β’ 3 -
lumees/dutch-corpus-200b
Viewer β’ Updated β’ 170M β’ 109 β’ 4
-
lumees/turkish-corpus-100b
Viewer β’ Updated β’ 107M β’ 1.31k β’ 4 -
lumees/multilingual-safety-classification-dataset
Viewer β’ Updated β’ 213k β’ 3.43k β’ 4 -
lumees/bulgarian-corpus-33b
Viewer β’ Updated β’ 34.9M β’ 176 β’ 3 -
lumees/dutch-corpus-200b
Viewer β’ Updated β’ 170M β’ 109 β’ 4
Comprehensive collection of high-quality multilingual datasets for NLP research and production.