Muhtasham Oblokulov's picture

Muhtasham Oblokulov PRO

muhtasham

·

https://www.linkedin.com/in/muhtasham/

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

Qwen/Qwen-Image

liked a model 3 days ago

KittenML/kitten-tts-nano-0.1

liked a model 3 days ago

rednote-hilab/dots.ocr

View all activity

Organizations

upvoted a paper 12 days ago

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published 23 days ago • 41

upvoted 2 papers 19 days ago

OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique

Paper • 2507.09075 • Published 28 days ago • 13

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

Paper • 2504.16891 • Published Apr 23 • 24

upvoted a collection 19 days ago

OpenReasoning-Nemotron

Collection of models for OpenReasoning-Nemotron which are trained on 5M reasoning traces for Math, Code and Science. • 6 items • Updated 9 days ago • 39

upvoted 3 articles 23 days ago

Article

Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever.

By

•

23 days ago

• 133

Article

Introducing ColQwen-Omni: Retrieve in every modality

By

and 4 others •

23 days ago

• 64

Article

Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders

By

and 5 others •

24 days ago

• 55

upvoted a collection 28 days ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 6 items • Updated about 1 month ago • 69

upvoted a paper about 1 month ago

How Well Does GPT-4o Understand Vision? Evaluating Multimodal Foundation Models on Standard Computer Vision Tasks

Paper • 2507.01955 • Published Jul 2 • 34

upvoted 2 collections about 1 month ago

Speech-To-Text

https://kyutai.org/next/stt • 6 items • Updated Jun 19 • 12

EmoNet

The full collection of our EmoNet effort. More info available at: https://huggingface.co/blog/felfri/emonet • 8 items • Updated Jun 22 • 5

upvoted a paper about 1 month ago

SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning

Paper • 2506.21355 • Published Jun 26 • 9

upvoted 2 collections about 1 month ago

Gemma 3n

24 items • Updated 27 days ago • 11

Gemma 3n

4 items • Updated 30 days ago • 204

upvoted a paper about 2 months ago

DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement

Paper • 2305.08227 • Published May 14, 2023 • 1

upvoted an article about 2 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

By

•

Mar 1, 2020

• 231

upvoted 2 papers about 2 months ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published Apr 10 • 29

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 126

upvoted 2 articles about 2 months ago

Article

SmolVLA: Efficient Vision-Language-Action Model trained on Lerobot Community Data

By

and 8 others •

Jun 3

• 221

Article

LTX-Video LoRA training study (Single image/style training)

By

•

Jan 14

• 4