II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper • 2406.05862 • Published Jun 9, 2024 • 4
Cambrian-S: Towards Spatial Supersensing in Video Paper • 2511.04670 • Published 23 days ago • 34
Diffusion Transformers with Representation Autoencoders Paper • 2510.11690 • Published Oct 13 • 162
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper • 2507.06261 • Published Jul 7 • 64
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Paper • 2406.05862 • Published Jun 9, 2024 • 4
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22 • 4
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22 • 4 • 3
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22 • 4
Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Paper • 2505.17019 • Published May 22 • 4 • 3