Pawel

Pwlot

Pwlot
Pwlot

AI & ML interests

AGI

Recent Activity

liked a model 7 days ago

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

liked a Space about 2 months ago

microsoft/TRELLIS.2

liked a Space 8 months ago

nanotron/ultrascale-playbook

View all activity

Organizations

liked a model 7 days ago

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text-to-Speech • 2B • Updated 3 days ago • 228k • 835

liked a Space about 2 months ago

TRELLIS.2

🏢

898

High-fidelity 3D Generation from images

liked a Space 8 months ago

The Ultra-Scale Playbook

🌌

3.67k

The ultimate guide to training LLM on large GPU Clusters

liked a model 11 months ago

lerobot/pi0_old

Robotics • 4B • Updated Sep 19, 2025 • 1.2k • 306

liked a dataset about 1 year ago

HuggingFaceTB/finemath

Viewer • Updated Feb 6, 2025 • 48.3M • 7.4k • 348

liked a model over 1 year ago

jasperai/flash-sdxl

Text-to-Image • Updated Jul 3, 2024 • 74 • • 35

liked a dataset over 1 year ago

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11, 2025 • 3.5B • 345k • 930

liked a model over 1 year ago

stabilityai/stable-zero123

Text-to-3D • Updated Jul 10, 2024 • 754

reacted to Sentdex's post with 👍 over 1 year ago

Post

10232

Okay, first pass over KAN: Kolmogorov–Arnold Networks, it looks very interesting!

Interpretability of KAN model:
May be considered mostly as a safety issue these days, but it can also be used as a form of interaction between the user and a model, as this paper argues and I think they make a valid point here. With MLP, we only interact with the outputs, but KAN is an entirely different paradigm and I find it compelling.

Scalability:
KAN shows better parameter efficiency than MLP. This likely translates also to needing less data. We're already at the point with the frontier LLMs where all the data available from the internet is used + more is made synthetically...so we kind of need something better.

Continual learning:
KAN can handle new input information w/o catastrophic forgetting, which helps to keep a model up to date without relying on some database or retraining.

Sequential data:
This is probably what most people are curious about right now, and KANs are not shown to work with sequential data yet and it's unclear what the best approach might be to make it work well both in training and regarding the interpretability aspect. That said, there's a rich long history of achieving sequential data in variety of ways, so I don't think getting the ball rolling here would be too challenging.

Mostly, I just love a new paradigm and I want to see more!

KAN: Kolmogorov-Arnold Networks (2404.19756)

5 replies

liked a Space over 1 year ago

StoryDiffusion

👁

610

Generate images from text prompts with optional reference images

liked a dataset almost 2 years ago

HuggingFaceFW/fineweb

Viewer • Updated Jul 11, 2025 • 52.5B • 203k • 2.63k

liked 4 Spaces almost 2 years ago

InstantMesh

📚

1.57k

Create a 3D model from an image in 10 seconds!

Repo duplicator

😻

326

Duplicate Hugging Face repositories

DragGan - Drag Your GAN

👆

1.03k

Manipulate images by dragging points

Open VLM Leaderboard

🌎

978

VLMEvalKit Evaluation Results Collection

reacted to clem's post with 👍 about 2 years ago

Post

Is synthetic data the future of AI? 🔥🔥🔥

@HugoLaurencon @Leyo & @VictorSanh are introducing HuggingFaceM4/WebSight , a multimodal dataset featuring 823,000 pairs of synthetically generated HTML/CSS codes along with screenshots of the corresponding rendered websites to train GPT4-V-like models 🌐💻

While crafting their upcoming foundation vision language model, they faced the challenge of converting website screenshots into usable HTML/CSS codes. Most VLMs suck at this and there was no public dataset available for this specific task, so they decided to create their own.

They prompted existing LLMs to generate 823k HTML/CSS codes of very simple websites. Through supervised fine-tuning of a vision language model on WebSight, they were able to generate the code to reproduce a website component, given a screenshot.

You can explore the dataset here: HuggingFaceM4/WebSight

What do you think?