arxiv:2605.04330

The Scaling Properties of Implicit Deductive Reasoning in Transformers

Published on May 5

· Submitted by

Enrico on May 8

Applied AI group of the Tallinn University of Technology

Upvote

Authors:

Abstract

Deep Transformers with bidirectional masking exhibit implicit deductive reasoning capabilities comparable to explicit chain-of-thought methods across various graph structures and problem sizes.

AI-generated summary

We investigate the scaling properties of implicit deductive reasoning over Horn clauses in depth-bounded Transformers. By systematically decorrelating provability from spurious features and enforcing algorithmic alignment, we find that in sufficiently deep models with a bidirectional prefix mask, implicit reasoning approaches explicit CoT performance across graph topologies and problem widths, though CoT remains necessary for depth extrapolation.

View arXiv page View PDF Add to collection

Community

envomp

Paper submitter about 8 hours ago

code, datasets and models, although reproducible from paper, will be made public upon publication. For joint research, contact {enrico.vompa}@gmail.com as I'm open for collaboration