Apolinário from multimodal AI art's picture

Building on HF

Apolinário from multimodal AI art PRO

multimodalart

·

https://multimodal.art

AI & ML interests

None yet

Recent Activity

liked a Space about 2 hours ago

AI4Editing/MagicQuillV2

upvoted a paper about 2 hours ago

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

upvoted a changelog about 13 hours ago

Duplicate Datasets

View all activity

Organizations

upvoted a paper about 2 hours ago

MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues

Paper • 2512.03046 • Published 1 day ago • 9

upvoted a changelog about 13 hours ago

Changelog

Duplicate Datasets

about 21 hours ago

• 36

upvoted 2 collections 1 day ago

Ministral 3

A collection of edge models, with Base, Instruct and Reasoning variants, in 3 different sizes: 3B, 8B and 14B. All with vision capabilities. • 9 items • Updated 1 day ago • 100

Mistral Large 3

A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 1 day ago • 64

upvoted a collection 7 days ago

Z-Image

4 items • Updated 3 days ago • 66

upvoted an article 9 days ago

Article

Diffusers welcomes FLUX-2

+6

9 days ago

•

152

upvoted an article 15 days ago

Article

Introducing Cogito v2.1

15 days ago

•

17

upvoted a paper 16 days ago

Back to Basics: Let Denoising Generative Models Denoise

Paper • 2511.13720 • Published 17 days ago • 61

upvoted a collection 17 days ago

The Bestiary

Decensored language models made using Heretic (https://github.com/p-e-w/heretic) • 6 items • Updated 18 days ago • 68

upvoted an article 22 days ago

Article

We’re open-sourcing our text-to-image model and the process behind it

22 days ago

•

73

upvoted an article about 1 month ago

Article

What makes good reasoning data

Oct 30

•

33

upvoted a paper about 1 month ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published Oct 24 • 58

upvoted a collection about 1 month ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 21 days ago • 71

upvoted an article about 1 month ago

Article

Granite 4.0 Nano: Just how small can you go?

Oct 28

•

120

upvoted a paper about 1 month ago

Group Relative Attention Guidance for Image Editing

Paper • 2510.24657 • Published Oct 28 • 25

upvoted a paper about 2 months ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published Oct 16 • 65

upvoted an article about 2 months ago

Article

Model statistics of the 50 most downloaded entities on Hugging Face

Oct 13

•

33

upvoted 2 papers about 2 months ago

Phoenix-VAD: Streaming Semantic Endpoint Detection for Full-Duplex Speech Interaction

Paper • 2509.20410 • Published Sep 24 • 2

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 94

upvoted a paper 2 months ago

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30 • 18