Vchitect

non-profit

https://vchitect.intern-ai.org.cn/

Vchitect

Activity Feed Request to join this org

AI & ML interests

generative models, video generation

Recent Activity

ynhe updated a dataset 2 days ago

Vchitect/VBench_human_annotation

Mqleet authored a paper 2 days ago

LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint

Mqleet authored a paper 2 days ago

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

View all activity

Papers

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

View all Papers

ynhe

updated a dataset 2 days ago

Vchitect/VBench_human_annotation

Preview • Updated 2 days ago • 35 • 1

Mqleet

authored 2 papers 2 days ago

LED-Merging: Mitigating Safety-Utility Conflicts in Model Merging with Location-Election-Disjoint

Paper • 2502.16770 • Published Feb 24

Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1

Paper • 2510.19600 • Published 4 days ago • 63

ynhe

updated a dataset 4 days ago

Vchitect/VBench-2.0_human_annotation

Preview • Updated 4 days ago • 56 • 1

jackyhate

updated a dataset 5 days ago

Vchitect/Uni-MMMU-Eval

Updated 5 days ago • 29 • 2

jackyhate

in Vchitect/Uni-MMMU-Eval 10 days ago

Update dataset card: Add task categories, tags, paper link, sample usage, and complete citation

#1 opened 10 days ago by

jackyhate

authored a paper 11 days ago

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published 11 days ago • 9

jackyhate

published a dataset 12 days ago

Vchitect/Uni-MMMU-Eval

Updated 5 days ago • 29 • 2

Ziqi

authored 8 papers 19 days ago

VBench: Comprehensive Benchmark Suite for Video Generative Models

Paper • 2311.17982 • Published Nov 29, 2023 • 9

Talk-to-Edit: Fine-Grained Facial Editing via Dialog

Paper • 2109.04425 • Published Sep 9, 2021

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Paper • 2501.08453 • Published Jan 14 • 1

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Paper • 2506.21356 • Published Jun 26 • 22

Cut2Next: Generating Next Shot via In-Context Tuning

Paper • 2508.08244 • Published Aug 11 • 13

CineScale: Free Lunch in High-Resolution Cinematic Visual Generation

Paper • 2508.15774 • Published Aug 21 • 20

Stencil: Subject-Driven Generation with Context Guidance

Paper • 2509.17120 • Published Sep 21 • 5

VChain: Chain-of-Visual-Thought for Reasoning in Video Generation

Paper • 2510.05094 • Published 20 days ago • 35

yumingj

authored 2 papers about 1 month ago

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

Paper • 2509.21268 • Published Sep 25 • 100

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation

Paper • 2509.15212 • Published Sep 18 • 21

Alexislhb

updated a model about 1 month ago

Vchitect/ShotVL-7B

Image-Text-to-Text • 8B • Updated Sep 19 • 306 • 14

Alexislhb

updated a dataset about 1 month ago

Vchitect/ShotQA

Preview • Updated Sep 12 • 102 • 3