1 46 2

YanxingLiu

lyx98

YanxingLiu

AI & ML interests

Computer Vision

Recent Activity

updated a collection 19 days ago

InternVL3_5_Flash-HF

updated a model 19 days ago

lyx98/InternVL3_5_Flash-2B-HF

updated a model 19 days ago

lyx98/InternVL3_5_Flash-1B-HF

View all activity

Organizations

None yet

updated a collection 19 days ago

InternVL3_5_Flash-HF

Collection

3 items • Updated 19 days ago

updated 3 models 19 days ago

updated a collection 19 days ago

InternVL3_5_Flash-HF

Collection

3 items • Updated 19 days ago

published 3 models 19 days ago

lyx98/InternVL3_5_Flash-4B-HF

Image-Text-to-Text • 5B • Updated 19 days ago • 19

lyx98/InternVL3_5_Flash-2B-HF

Image-Text-to-Text • 2B • Updated 19 days ago • 22

lyx98/InternVL3_5_Flash-1B-HF

Image-Text-to-Text • 1.0B • Updated 19 days ago • 19

upvoted a paper about 1 month ago

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

Paper • 2511.22699 • Published Nov 27, 2025 • 218

upvoted 2 papers about 2 months ago

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published Nov 12, 2025 • 201

DeepEyesV2: Toward Agentic Multimodal Model

Paper • 2511.05271 • Published Nov 7, 2025 • 42

upvoted 2 papers 4 months ago

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Paper • 2509.09286 • Published Sep 11, 2025 • 11

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9, 2025 • 83

liked a dataset 4 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 105k • 463

upvoted 4 papers 4 months ago

Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic

Paper • 2509.01363 • Published Sep 1, 2025 • 58

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25, 2025 • 211

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28, 2025 • 140

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21, 2025 • 259

upvoted a collection 5 months ago

👁️ LFM2-VL

Collection

LFM2-VL is our first series of vision-language models, designed for on-device deployment. • 10 items • Updated 4 days ago • 60

upvoted a paper 5 months ago

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Paper • 2508.08221 • Published Aug 11, 2025 • 50

YanxingLiu

AI & ML interests

Recent Activity

Organizations

lyx98's activity