Building on HF

5 32 31

Andrea Gemelli

andreagemelli

https://www.andreagemelli.me

AI & ML interests

Natural Language Processing, Computer Vision, Generative Models, Document Analysis

Recent Activity

new activity 11 days ago

Xkev/LLaVA-CoT-100k:many img dirs are missing

liked a model 14 days ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

liked a dataset 25 days ago

PleIAs/SYNTH

View all activity

Organizations

New activity in Xkev/LLaVA-CoT-100k 11 days ago

many img dirs are missing

#4 opened 29 days ago by

boydcheung

liked a model 14 days ago

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled

Image-Text-to-Text • 28B • Updated 11 days ago • 585k • 2.68k

liked a dataset 25 days ago

PleIAs/SYNTH

Viewer • Updated Nov 11, 2025 • 68M • 35.1k • 260

upvoted an article 25 days ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

619

updated a model 25 days ago

andreagemelli/UNet2DModel-fashion_mnist

Updated 25 days ago • 16

published a model 25 days ago

andreagemelli/UNet2DModel-fashion_mnist

Updated 25 days ago • 16

liked a dataset about 2 months ago

VLR-CVC/DocVQA-2026

Viewer • Updated Mar 12 • 73 • 3.63k • 69

liked a Space 4 months ago

Evaluation Guidebook

📝

302

Explore LLM benchmark trends over time

upvoted an article 5 months ago

Article

Vision Language Model Alignment in TRL ⚡️

Aug 7, 2025

•

109

upvoted 2 papers 5 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 72

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16, 2024 • 37

liked a Space 6 months ago

The Smol Training Playbook

📚

3.11k

The secrets to building world-class LLMs

upvoted 2 articles 6 months ago

Article

Supercharge your OCR Pipelines with Open Models

Oct 21, 2025

•

308

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

297

upvoted a collection 6 months ago

Qwen3-VL

Collection

37 items • Updated Dec 31, 2025 • 695

upvoted an article 6 months ago

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

•

New activity in letxbe/DocExplainer 7 months ago

Which license is it?

#1 opened 7 months ago by

plamb

updated a model 7 months ago

letxbe/DocExplainer

Visual Question Answering • Updated Sep 20, 2025 • 11

upvoted a collection 7 months ago

Holo1.5

Collection

Holo1.5 - Open Foundation Models for Computer Use Agents • 5 items • Updated Sep 15, 2025 • 35

liked a model 7 months ago

blowing-up-groundhogs/emuru

Text-to-Image • 0.7B • Updated Jul 31, 2025 • 446 • 13

Andrea Gemelli

AI & ML interests

Recent Activity

Organizations

andreagemelli's activity

many img dirs are missing

We Got Claude to Fine-Tune an Open Source LLM

Evaluation Guidebook

Vision Language Model Alignment in TRL ⚡️

The Smol Training Playbook

Supercharge your OCR Pipelines with Open Models

KV Caching Explained: Optimizing Transformer Inference Efficiency

Preference Optimization for Vision Language Models

Which license is it?