2 165 140

Sergiu Han

hgsg

https://sergiudm.github.io/

sergiudm

AI & ML interests

NLP, agent

Recent Activity

liked a model about 10 hours ago

openai-community/gpt2

liked a model about 12 hours ago

BlinkDL/temp-latest-training-models

liked a model about 16 hours ago

tencent/HunyuanWorld-Mirror

View all activity

Organizations

None yet

upvoted a paper 1 day ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published 5 days ago • 52

upvoted a paper 2 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 4 days ago • 90

upvoted 3 papers 4 days ago

PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model

Paper • 2510.14528 • Published 10 days ago • 66

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published 9 days ago • 40

FineVision: Open Data Is All You Need

Paper • 2510.17269 • Published 6 days ago • 52

upvoted a paper 7 days ago

Detect Anything via Next Point Prediction

Paper • 2510.12798 • Published 11 days ago • 42

upvoted a paper 8 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 12 days ago • 86

upvoted a paper 9 days ago

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published 22 days ago • 92

upvoted 3 papers 11 days ago

MiniMax-Speech: Intrinsic Zero-Shot Text-to-Speech with a Learnable Speaker Encoder

Paper • 2505.07916 • Published May 12 • 132

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 151

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 151

upvoted a paper 12 days ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published 12 days ago • 157

upvoted a paper 14 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published 16 days ago • 240

upvoted 2 papers 17 days ago

ASPO: Asymmetric Importance Sampling Policy Optimization

Paper • 2510.06062 • Published 18 days ago • 13

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

Paper • 2510.06590 • Published 18 days ago • 69

upvoted a paper 18 days ago

VideoNSA: Native Sparse Attention Scales Video Understanding

Paper • 2510.02295 • Published 23 days ago • 9

upvoted a paper 19 days ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78

upvoted 2 papers 22 days ago

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published 23 days ago • 49

The Unreasonable Effectiveness of Scaling Agents for Computer Use

Paper • 2510.02250 • Published 23 days ago • 24

upvoted a collection 23 days ago

Granite 4.0 Language Models

Collection

11 items • Updated 17 days ago • 148

Sergiu Han

AI & ML interests

Recent Activity

Organizations

hgsg's activity