Lupi

Chodevil

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

upvoted a collection 27 days ago

RDT 2

upvoted a collection about 1 month ago

VLA-Adapter Models

View all activity

Organizations

None yet

upvoted a paper 22 days ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published 23 days ago • 63

upvoted a collection 27 days ago

RDT 2

Collection

RDT 2, the sequel to RDT-1B, is the first foundation model that achieves zero-shot deployment on unseen embodiments for simple open-vocabulary tasks. • 4 items • Updated 27 days ago • 15

upvoted a collection about 1 month ago

VLA-Adapter Models

Collection

The models of VLA-Adapter • 8 items • Updated 23 days ago • 9

upvoted a paper about 1 month ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11 • 231

upvoted 2 papers 5 months ago

Shifting AI Efficiency From Model-Centric to Data-Centric Compression

Paper • 2505.19147 • Published May 25 • 144

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Paper • 2505.12448 • Published May 18 • 10

liked a model 5 months ago

OpenHelix/openhelix

Updated May 20 • 3

upvoted 2 papers 11 months ago

CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction

Paper • 2412.06782 • Published Dec 9, 2024 • 7

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Paper • 2411.17686 • Published Nov 26, 2024 • 20

upvoted a paper about 1 year ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11, 2024 • 15

liked 2 Spaces over 1 year ago

3.5k

InstantID

😻

Generate images preserving face identity

441

InstantStyle

👁

Style-Preserving Text-to-Image Generation

upvoted a paper over 1 year ago

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

Paper • 2403.14520 • Published Mar 21, 2024 • 35

liked a model over 1 year ago

han1997/cobra

Updated Aug 19, 2024 • 19

liked a Space over 1 year ago

Cobra

🐍

Cobra: Extending Mamba to MLLM for Efficient Inference