Models for What Changed? Detecting and Evaluating Instruction-Guided Image Edits
with Multimodal Large Language Models [ICCV 2025]
AI & ML interests
None defined yet.
Recent Activity
Models and data for the paper "Recurrence Meets Transformers for Universal Multimodal Retrieval" (arXiv 2509.08897)
-
aimagelab/ReT2-M2KR-CLIP-ViT-B
Visual Document Retrieval • 0.2B • Updated • 148 • 1 -
aimagelab/ReT2-M2KR-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 17 -
aimagelab/ReT2-M2KR-SigLIP2-ViT-L
Visual Document Retrieval • 0.9B • Updated • 8 • 1 -
aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 8
Models for What Changed? Detecting and Evaluating Instruction-Guided Image Edits
with Multimodal Large Language Models [ICCV 2025]
Models and data for the paper "Recurrence Meets Transformers for Universal Multimodal Retrieval" (arXiv 2509.08897)
-
aimagelab/ReT2-M2KR-CLIP-ViT-B
Visual Document Retrieval • 0.2B • Updated • 148 • 1 -
aimagelab/ReT2-M2KR-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 17 -
aimagelab/ReT2-M2KR-SigLIP2-ViT-L
Visual Document Retrieval • 0.9B • Updated • 8 • 1 -
aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L
Visual Document Retrieval • 0.4B • Updated • 8
models
37
aimagelab/DICE_coherence_Idefics
Updated
aimagelab/DICE_differencedet_Idefics
Updated
aimagelab/ReT2-M2KR-ColBERT-SigLIP2-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
18
aimagelab/ReT2-MBEIR-SigLIP2-ViT-L
Visual Document Retrieval
•
0.9B
•
Updated
•
13
•
1
aimagelab/ReT2-MBEIR-CLIP-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
8
aimagelab/ReT2-M2KR-ColBERT-CLIP-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
8
aimagelab/ReT2-M2KR-SigLIP2-ViT-L
Visual Document Retrieval
•
0.9B
•
Updated
•
8
•
1
aimagelab/ReT2-M2KR-OpenCLIP-ViT-H
Visual Document Retrieval
•
1B
•
Updated
•
7
aimagelab/ReT2-M2KR-CLIP-ViT-L
Visual Document Retrieval
•
0.4B
•
Updated
•
17
aimagelab/ReT2-M2KR-CLIP-ViT-B
Visual Document Retrieval
•
0.2B
•
Updated
•
148
•
1