jingyun

hjy

huajingyun

AI & ML interests

NLP

Recent Activity

upvoted a paper 2 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

authored a paper 10 days ago

KlingAvatar 2.0 Technical Report

liked a model 13 days ago

facebook/mms-1b-all

View all activity

Organizations

None yet

upvoted a paper 2 days ago

GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models

Paper • 2512.15560 • Published 15 days ago • 22

authored a paper 10 days ago

KlingAvatar 2.0 Technical Report

Paper • 2512.13313 • Published 17 days ago • 40

liked a model 13 days ago

facebook/mms-1b-all

Automatic Speech Recognition • 1.0B • Updated Jun 15, 2023 • 166k • 168

upvoted a paper 13 days ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published 14 days ago • 163

upvoted 2 papers about 2 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12, 2025 • 30

When Modalities Conflict: How Unimodal Reasoning Uncertainty Governs Preference Dynamics in MLLMs

Paper • 2511.02243 • Published Nov 4, 2025 • 24

authored a paper 3 months ago

AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration

Paper • 2510.10395 • Published Oct 12, 2025 • 30

authored a paper 4 months ago

Kwai Keye-VL 1.5 Technical Report

Paper • 2509.01563 • Published Sep 1, 2025 • 37

liked a Space 4 months ago

FineVision: Open Data is All You Need

📝

215

A new open-source dataset for training VLMs

liked 2 models 4 months ago

Kwai-Keye/Keye-VL-1_5-8B

Video-Text-to-Text • 9B • Updated Sep 4, 2025 • 49.8k • 59

deepseek-ai/DeepSeek-V3.1

Text Generation • 685B • Updated Sep 5, 2025 • 58.4k • • 809

liked a model 5 months ago

facebook/dinov3-vit7b16-pretrain-lvd1689m

Image Feature Extraction • 7B • Updated Aug 19, 2025 • 12.9k • 198

upvoted a paper 6 months ago

Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese

Paper • 2110.06696 • Published Oct 13, 2021 • 2

authored 4 papers 6 months ago

Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese

Paper • 2110.06696 • Published Oct 13, 2021 • 2

HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models

Paper • 2502.20811 • Published Feb 28, 2025 • 3

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14, 2025 • 30

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

upvoted a paper 6 months ago

Kwai Keye-VL Technical Report

Paper • 2507.01949 • Published Jul 2, 2025 • 130

liked a model 6 months ago

moonshotai/Kimi-VL-A3B-Thinking-2506

Image-Text-to-Text • 16B • Updated Aug 18, 2025 • 171k • 330

liked a model 7 months ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

Text Generation • 8B • Updated May 29, 2025 • 473k • • 1.01k

jingyun

AI & ML interests

Recent Activity

Organizations

hjy's activity

FineVision: Open Data is All You Need