In a Training Loop 🔄

5 5 4

Martin Ziqiao Ma PRO

marstin

https://ziqiaoma.com/

AI & ML interests

https://huggingface.co/Seed42Lab

Recent Activity

authored a paper 3 months ago

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

authored a paper 3 months ago

Next-Embedding Prediction Makes Strong Vision Learners

upvoted a paper 3 months ago

Next-Embedding Prediction Makes Strong Vision Learners

View all activity

Organizations

authored 2 papers 3 months ago

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Paper • 2512.01078 • Published Nov 30, 2025 • 34

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 87

upvoted 2 papers 3 months ago

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 87

SimWorld: An Open-ended Realistic Simulator for Autonomous Agents in Physical and Social Worlds

Paper • 2512.01078 • Published Nov 30, 2025 • 34

authored 4 papers 4 months ago

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Paper • 2508.08113 • Published Aug 11, 2025 • 11

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Paper • 2510.02292 • Published Oct 2, 2025 • 1

Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry

Paper • 2510.25595 • Published Oct 29, 2025

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 32

upvoted a paper 4 months ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published Nov 3, 2025 • 32

liked a dataset 5 months ago

cheryyunl/ROVER

Viewer • Updated Nov 8, 2025 • 1.31k • 55 • 9

updated a Space 5 months ago

VLM-Lens

👀

[EMNLP 2025 Demo] VLM-Lens: Extracting VLM representations

published a Space 5 months ago

VLM-Lens

👀

[EMNLP 2025 Demo] VLM-Lens: Extracting VLM representations

New activity in sled-umich/InfEdit 6 months ago

License?

#2 opened about 2 years ago by

cian0

What

#3 opened about 2 years ago by

NoenD

updated a model 8 months ago

sled-umich/groundhog-7b

Updated Jul 22, 2025

updated 3 datasets 8 months ago

authored a paper 8 months ago

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Paper • 2506.21876 • Published Jun 27, 2025 • 28

upvoted a paper 9 months ago

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Martin Ziqiao Ma PRO

AI & ML interests

Recent Activity

Organizations

marstin's activity

VLM-Lens

VLM-Lens

License?

What