System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts Paper • 2505.18962 • Published 14 days ago • 12
Investigating Prompting Techniques for Zero- and Few-Shot Visual Question Answering Paper • 2306.09996 • Published Jun 16, 2023
Benchmarking Vision Language Models for Cultural Understanding Paper • 2407.10920 • Published Jul 15, 2024
Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding Paper • 2306.08832 • Published Jun 15, 2023
Rendering-Aware Reinforcement Learning for Vector Graphics Generation Paper • 2505.20793 • Published 12 days ago • 11
FACT: Examining the Effectiveness of Iterative Context Rewriting for Multi-fact Retrieval Paper • 2410.21012 • Published Oct 28, 2024
R$^3$Mem: Bridging Memory Retention and Retrieval via Reversible Compression Paper • 2502.15957 • Published Feb 21
GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks Paper • 2504.12764 • Published Apr 17 • 41
Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based Representation Paper • 2003.14166 • Published Mar 23, 2020
StarVector: Generating Scalable Vector Graphics Code from Images Paper • 2312.11556 • Published Dec 17, 2023 • 36
Capture the Flag: Uncovering Data Insights with Large Language Models Paper • 2312.13876 • Published Dec 21, 2023 • 1
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? Paper • 2403.07718 • Published Mar 12, 2024 • 2
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content Paper • 2406.11811 • Published Jun 17, 2024 • 16