H. Aldhaheri
aenawi
AI & ML interests
LLMs Agents
Organizations
None yet
Text2Image LLMs
LLMs
Spaces For Demos
Models-Support-Arabic
Speech-to-Speech
Token-Classification
-
hatmimoha/arabic-ner
Token Classification • 0.1B • Updated • 42.2k • • 21 -
Ammar-alhaj-ali/arabic-MARBERT-poetry-classification
Text Classification • Updated • 1.22k • • 3 -
CAMeL-Lab/bert-base-arabic-camelbert-mix-ner
Token Classification • Updated • 22.4k • • 14 -
SinaLab/ArabicNER-Wojood
Token Classification • Updated • 55 • 10
Neo4j-Cypher
Coding
DeepResearch Models
Translation-Models
-
tencent/Hunyuan-MT-7B
Translation • 8B • Updated • 21.4k • 714 -
tencent/Hunyuan-MT-Chimera-7B
Translation • 8B • Updated • 1.54k • 86 -
swiss-ai/Apertus-8B-Instruct-2509
Text Generation • 8B • Updated • 466k • • 416 -
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Paper • 2509.14008 • Published • 88
Speech-To-Text
Papers - Researches
Arabic Datasets
Embedding Models
-
WhereIsAI/UAE-Large-V1
Feature Extraction • 0.3B • Updated • 1.29M • • 237 -
intfloat/multilingual-e5-large
Feature Extraction • 0.6B • Updated • 3M • • 1.11k -
sentence-transformers/distiluse-base-multilingual-cased-v1
Sentence Similarity • 0.1B • Updated • 601k • • 127 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 24.5M • • 1.09k
Datasets
-
ahmedheakl/resume-atlas
Viewer • Updated • 13.4k • 134 • 10 -
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 75 -
Sleeping279
Infinite Dataset Hub
♾279Search and save datasets generated with a LLM in real time
-
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Paper • 2509.06652 • Published • 24
Train-On-Datasets
Cybersecurity Models
Animation
DeepResearch Models
Text2Image LLMs
Translation-Models
-
tencent/Hunyuan-MT-7B
Translation • 8B • Updated • 21.4k • 714 -
tencent/Hunyuan-MT-Chimera-7B
Translation • 8B • Updated • 1.54k • 86 -
swiss-ai/Apertus-8B-Instruct-2509
Text Generation • 8B • Updated • 466k • • 416 -
Hala Technical Report: Building Arabic-Centric Instruction & Translation Models at Scale
Paper • 2509.14008 • Published • 88
LLMs
Speech-To-Text
Spaces For Demos
Papers - Researches
Models-Support-Arabic
Arabic Datasets
Speech-to-Speech
Embedding Models
-
WhereIsAI/UAE-Large-V1
Feature Extraction • 0.3B • Updated • 1.29M • • 237 -
intfloat/multilingual-e5-large
Feature Extraction • 0.6B • Updated • 3M • • 1.11k -
sentence-transformers/distiluse-base-multilingual-cased-v1
Sentence Similarity • 0.1B • Updated • 601k • • 127 -
sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity • 0.1B • Updated • 24.5M • • 1.09k
Token-Classification
-
hatmimoha/arabic-ner
Token Classification • 0.1B • Updated • 42.2k • • 21 -
Ammar-alhaj-ali/arabic-MARBERT-poetry-classification
Text Classification • Updated • 1.22k • • 3 -
CAMeL-Lab/bert-base-arabic-camelbert-mix-ner
Token Classification • Updated • 22.4k • • 14 -
SinaLab/ArabicNER-Wojood
Token Classification • Updated • 55 • 10
Datasets
-
ahmedheakl/resume-atlas
Viewer • Updated • 13.4k • 134 • 10 -
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language
Paper • 2506.20920 • Published • 75 -
Sleeping279
Infinite Dataset Hub
♾279Search and save datasets generated with a LLM in real time
-
IntrEx: A Dataset for Modeling Engagement in Educational Conversations
Paper • 2509.06652 • Published • 24
Neo4j-Cypher
Train-On-Datasets
Coding
Cybersecurity Models