25 46

Jonatan Borkowski

j14i

jborkowski

AI & ML interests

None yet

Recent Activity

liked a Space about 23 hours ago

HuggingFaceM4/faster-qwen3-tts-demo

reacted to MaziyarPanahi's post with 🔥 2 days ago

DNA, mRNA, proteins, AI. I spent the last year going deep into computational biology as an ML engineer. This is Part I of what I found. 🧬 In 2024, AlphaFold won the Nobel Prize in Chemistry. By 2026, the open-source community had built alternatives that outperform it. That's the story I find most interesting about protein AI right now. Not just the science (which is incredible), but the speed at which open-source caught up. Multiple teams, independently, reproduced and then exceeded AlphaFold 3's accuracy with permissive licenses. The field went from prediction to generation: we're not just modeling known proteins anymore, we're designing new ones. I spent months mapping this landscape for ML engineers. What the architectures actually are (spoiler: transformers and diffusion models), which tools to use for what, and which ones you can actually ship commercially. New post on the Hugging Face blog: https://huggingface.co/blog/MaziyarPanahi/protein-ai-landscape Hope you all enjoy! 🤗

reacted to Ujjwal-Tyagi's post with 😎 4 days ago

Public reports allege that Anthropic gobbled up trillions of tokens of copyrighted material and public data to build their castle. 🏰📄 Now that they're sitting on top, they're begging for special laws to protect their profits while pulling the ladder up behind them. 🪜🚫 But the hypocrisy meter just broke! 📉 They are accusing Chinese labs like DeepSeek, Minimax, and Kimi of "huge distillation attacks. The Reality is that You can't just loot the entire internet's library, lock the door, and then sue everyone else for reading through the window. Stop trying to gatekeep the tech you didn't own in the first place. Read the complete article on it: https://huggingface.co/blog/Ujjwal-Tyagi/the-dark-underbelly-of-anthropic

View all activity

Organizations

liked a Space about 23 hours ago

faster-qwen3-tts

🎙

110

Generate spoken audio from text with custom or cloned voices

reacted to MaziyarPanahi's post with 🔥 2 days ago

Post

3837

DNA, mRNA, proteins, AI. I spent the last year going deep into computational biology as an ML engineer. This is Part I of what I found. 🧬

In 2024, AlphaFold won the Nobel Prize in Chemistry.

By 2026, the open-source community had built alternatives that outperform it.

That's the story I find most interesting about protein AI right now. Not just the science (which is incredible), but the speed at which open-source caught up. Multiple teams, independently, reproduced and then exceeded AlphaFold 3's accuracy with permissive licenses. The field went from prediction to generation: we're not just modeling known proteins anymore, we're designing new ones.

I spent months mapping this landscape for ML engineers. What the architectures actually are (spoiler: transformers and diffusion models), which tools to use for what, and which ones you can actually ship commercially.

New post on the Hugging Face blog: https://huggingface.co/blog/MaziyarPanahi/protein-ai-landscape

Hope you all enjoy! 🤗

2 replies

reacted to Ujjwal-Tyagi's post with 😎 4 days ago

Post

2834

Public reports allege that Anthropic gobbled up trillions of tokens of copyrighted material and public data to build their castle. 🏰📄 Now that they're sitting on top, they're begging for special laws to protect their profits while pulling the ladder up behind them. 🪜🚫

But the hypocrisy meter just broke! 📉 They are accusing Chinese labs like DeepSeek, Minimax, and Kimi of "huge distillation attacks. The Reality is that You can't just loot the entire internet's library, lock the door, and then sue everyone else for reading through the window. Stop trying to gatekeep the tech you didn't own in the first place. Read the complete article on it: https://huggingface.co/blog/Ujjwal-Tyagi/the-dark-underbelly-of-anthropic

3 replies

reacted to MikeDoes's post with 🚀 4 days ago

Post

2218

Stop sending sensitive data across the network. Sanitize it directly in the browser. 💡

A recent blog post by A. Christmas provides a practical guide on how to achieve exactly that. They demonstrated a powerful form of anonymization: PII masking at the edge. The vision is simple but profound: keep sensitive data off the network entirely by sanitizing it in the browser.

With the Ai4Privacy pii-masking-200k dataset serving as the foundation for their work. It provided the high-quality, diverse examples of PII needed to fine-tune a specialized DistilBERT model, one that is accurate, fast, and light enough to run client-side.

This is the future we are working towards: a world where developers are empowered with the tools and data to build powerful AI systems that respect user privacy by design. This is exactly why we build our datasets, and we're thrilled to showcase this project that turns the principles of data privacy into a practical, deployable solution.

🔗 See their innovative approach in action: https://ronathan.esr-inc.com/automatically-sanitize-data-in-the-users-browser-with-ai/

🚀 Stay updated on the latest in privacy-preserving AI—follow us on LinkedIn: https://www.linkedin.com/company/ai4privacy/posts/

#OpenSource #DataPrivacy #LLM #Anonymization #AIsecurity #HuggingFace #Ai4Privacy #World's largest open privacy masking dataset

liked 2 datasets 4 days ago

peteromallet/dataclaw-peteromallet

Viewer • Updated 9 days ago • 549 • 9.51k • 273

ronantakizawa/webui

Viewer • Updated 7 days ago • 36.8k • 76 • 14

liked a Space 5 days ago

Qwen3 Omni Demo

⚡

257

Chat with multimodal AI using text, audio, images, and video

liked 2 models 5 days ago

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • Updated Sep 22, 2025 • 687k • 859

NoesisLab/Kai-30B-Instruct

Text Generation • 33B • Updated 2 days ago • 421 • 17

upvoted a paper 5 days ago

WebWorld: A Large-Scale World Model for Web Agent Training

Paper • 2602.14721 • Published 18 days ago • 11

liked a model 5 days ago

Qwen/Qwen3.5-35B-A3B

Image-Text-to-Text • 36B • Updated 7 days ago • 1M • • 990

liked a model 7 days ago

ResembleAI/chatterbox

Text-to-Speech • Updated Sep 23, 2025 • 2.04M • • 1.5k

upvoted an article 7 days ago

Article

Mixture of Experts (MoEs) in Transformers

9 days ago

•

120

upvoted a paper 7 days ago

From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models

Paper • 2602.22859 • Published 8 days ago • 148

liked a Space 8 days ago

Research Tracker

🚀

Display a sortable table of research papers and models

liked a model 12 days ago

Qwen/Qwen3.5-397B-A17B

Image-Text-to-Text • 403B • Updated 11 days ago • 1.38M • • 1.25k

reacted to qgallouedec's post with 🔥 14 days ago

Post

2671

@CohereLabs just released 🌿 Tiny Aya: a fully open-source 3B parameter model that speaks 70+ languages 🌍! But there’s a catch:

Tiny Aya is just a language model. It doesn’t support tool calling, the key capability that turns frontier models into powerful *agents*.
So the real question is:

How hard is it to turn Tiny Aya into an agent?

Turns out… it’s simple, thanks to Hugging Face TRL.
We’re sharing a hands-on example showing how to train Tiny Aya to turn it into a tool-calling agent using TRL, unlocking what could become the first *massively multilingual open agent*.

Small model. Global reach. Agent capabilities.

👉 https://github.com/huggingface/trl/blob/main/examples/notebooks/sft_tool_calling.ipynb