Mahmud ElHuseyni 🇵🇸's picture

113 122

Mahmud ElHuseyni 🇵🇸

MElHuseyni

·

AI & ML interests

Computer Vision NLP Machine Learning

Recent Activity

liked a dataset about 22 hours ago

uv-scripts/ocr

upvoted a collection about 23 hours ago

MM Grounding DINO

liked a model 1 day ago

tencent/Hunyuan-1.8B-Instruct

View all activity

Organizations

upvoted a collection about 23 hours ago

MM Grounding DINO

See: https://github.com/huggingface/transformers/pull/37925 • 8 items • Updated Jun 26 • 3

upvoted an article 4 days ago

Article

Mixture of Experts Explained

By

and 5 others •

Dec 11, 2023

• 809

upvoted 2 articles 8 days ago

Article

Introducing Command A Vision: Multimodal AI built for Business

By

and 3 others •

8 days ago

• 61

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

By

and 28 others •

Dec 18, 2024

• 58

upvoted a collection 8 days ago

Meta CLIP

Scaling CLIP data with transparent training distribution from an end-to-end pipeline. • 7 items • Updated 18 days ago • 4

upvoted a collection 9 days ago

ARPO

The official datasets and model checkpoints of ARPO • 9 items • Updated 10 days ago • 3

upvoted a paper 9 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 15 days ago • 273

upvoted a paper 14 days ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 64

upvoted a collection 20 days ago

Medical & Clinical NER

State-of-the-art medical, biomedical, and clinical Named Entity Recognition models • 389 items • Updated 21 days ago • 24

upvoted an article 20 days ago

Article

Introducing ColQwen-Omni: Retrieve in every modality

By

and 4 others •

22 days ago

• 63

upvoted a paper 20 days ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published 22 days ago • 70

upvoted a collection 20 days ago

VisionThink

Efficient Reasoning Vision Language Model • 7 items • Updated 21 days ago • 5

upvoted an article 21 days ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

By

and 3 others •

21 days ago

• 47

upvoted a collection 23 days ago

GLM-4.1V-Thinking

5 items • Updated Jul 2 • 52

upvoted a paper 23 days ago

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published 30 days ago • 44

upvoted 2 collections 28 days ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated 27 days ago • 115

MedGemma Release

Collection of Gemma 3 variants for performance on medical text and image comprehension to accelerate building healthcare-based AI applications. • 7 items • Updated 28 days ago • 272

upvoted an article 30 days ago

Article

Upskill your LLMs with Gradio MCP Servers

By

•

about 1 month ago

• 18

upvoted a paper 30 days ago

QARI-OCR: High-Fidelity Arabic Text Recognition through Multimodal Large Language Model Adaptation

Paper • 2506.02295 • Published Jun 2 • 5

upvoted an article about 1 month ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

about 1 month ago

• 638