ISEKAI

community

https://github.com/isekai-portal/Link-Context-Learning

isekai-portal

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

weepiess2383 authored a paper 9 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

liuziwei7 authored a paper 4 months ago

EgoTwin: Dreaming Body and View in First Person

liuziwei7 authored a paper 6 months ago

PhysX: Physical-Grounded 3D Asset Generation

View all activity

weepiess2383

authored a paper 9 days ago

The Prism Hypothesis: Harmonizing Semantic and Pixel Representations via Unified Autoencoding

Paper • 2512.19693 • Published 12 days ago • 61

liuziwei7

authored a paper 4 months ago

EgoTwin: Dreaming Body and View in First Person

Paper • 2508.13013 • Published Aug 18, 2025 • 20

liuziwei7

authored a paper 6 months ago

PhysX: Physical-Grounded 3D Asset Generation

Paper • 2507.12465 • Published Jul 16, 2025 • 43

liuziwei7

authored a paper 8 months ago

3D Scene Generation: A Survey

Paper • 2505.05474 • Published May 8, 2025 • 21

liuziwei7

authored 3 papers 9 months ago

GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography

Paper • 2504.07083 • Published Apr 9, 2025 • 22

Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency

Paper • 2503.20785 • Published Mar 26, 2025 • 22

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27, 2025 • 33

weepiess2383

authored 2 papers 9 months ago

Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models

Paper • 2501.08453 • Published Jan 14, 2025 • 1

CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching Models

Paper • 2503.18886 • Published Mar 24, 2025 • 24

Amoik

authored a paper 10 months ago

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

Paper • 2503.07413 • Published Mar 10, 2025 • 2

liuziwei7

authored 3 papers 10 months ago

liuziwei7

authored 2 papers 11 months ago

Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment

Paper • 2502.04328 • Published Feb 6, 2025 • 29

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23, 2025 • 23

liuziwei7

authored 2 papers 12 months ago

CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15, 2025 • 22

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15, 2025 • 15

weepiess2383

authored a paper 12 months ago

RepVideo: Rethinking Cross-Layer Representation for Video Generation

Paper • 2501.08994 • Published Jan 15, 2025 • 15

liuziwei7

authored 2 papers 12 months ago

Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives

Paper • 2501.04003 • Published Jan 7, 2025 • 27

Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control

Paper • 2501.03847 • Published Jan 7, 2025 • 22

AI & ML interests

Recent Activity

Team members 3

ISEKAI-Portal's activity