arxiv:2509.19282

OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps

Published on Sep 23

· Submitted by

Xiang Zhang on Sep 26

Upvote

Authors:

Bingnan Li ,

Chen-Yu Wang ,

Xiang Zhang ,

Abstract

A new benchmark and metric are introduced to evaluate layout-to-image generation models on complex overlapping bounding boxes, along with a fine-tuned model to improve performance.

AI-generated summary

Despite steady progress in layout-to-image generation, current methods still struggle with layouts containing significant overlap between bounding boxes. We identify two primary challenges: (1) large overlapping regions and (2) overlapping instances with minimal semantic distinction. Through both qualitative examples and quantitative analysis, we demonstrate how these factors degrade generation quality. To systematically assess this issue, we introduce OverLayScore, a novel metric that quantifies the complexity of overlapping bounding boxes. Our analysis reveals that existing benchmarks are biased toward simpler cases with low OverLayScore values, limiting their effectiveness in evaluating model performance under more challenging conditions. To bridge this gap, we present OverLayBench, a new benchmark featuring high-quality annotations and a balanced distribution across different levels of OverLayScore. As an initial step toward improving performance on complex overlaps, we also propose CreatiLayout-AM, a model fine-tuned on a curated amodal mask dataset. Together, our contributions lay the groundwork for more robust layout-to-image generation under realistic and challenging scenarios. Project link: https://mlpc-ucsd.github.io/OverLayBench.

View arXiv page View PDF Project page GitHub 20 Add to collection

Community

zx1239856

Paper author Paper submitter 26 days ago

OverLayBench - a novel benchmark and metric (OverLayScore) for evaluating layout-to-image models on densely overlapping object layouts

librarian-bot

26 days ago

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2509.19282 in a model README.md to link it from this page.

Datasets citing this paper 2

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2509.19282 in a Space README.md to link it from this page.