Alfie Devine's picture

36

Alfie Devine

alf16Devine

AI & ML interests

structured visual recognition

Recent Activity

upvoted a paper 4 days ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

upvoted a paper 4 days ago

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

upvoted a paper 4 days ago

CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing

View all activity

Organizations

None yet

upvoted 11 papers 4 days ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published 8 days ago • 81

HiAR: Efficient Autoregressive Long Video Generation via Hierarchical Denoising

Paper • 2603.08703 • Published 4 days ago • 29

CARE-Edit: Condition-Aware Routing of Experts for Contextual Image Editing

Paper • 2603.08589 • Published 4 days ago • 34

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published 10 days ago • 52

How Far Can Unsupervised RLVR Scale LLM Training?

Paper • 2603.08660 • Published 4 days ago • 44

Believe Your Model: Distribution-Guided Confidence Calibration

Paper • 2603.03872 • Published 10 days ago • 36

Holi-Spatial: Evolving Video Streams into Holistic 3D Spatial Intelligence

Paper • 2603.07660 • Published 5 days ago • 77

Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets

Paper • 2602.22207 • Published 16 days ago • 42

Enhancing Spatial Understanding in Image Generation via Reward Modeling

Paper • 2602.24233 • Published 14 days ago • 52

CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation

Paper • 2602.24286 • Published 14 days ago • 86

dLLM: Simple Diffusion Language Modeling

Paper • 2602.22661 • Published 16 days ago • 128

upvoted a paper 14 days ago

The Trinity of Consistency as a Defining Principle for General World Models

Paper • 2602.23152 • Published 15 days ago • 196

upvoted 8 papers 8 months ago

MOSPA: Human Motion Generation Driven by Spatial Audio

Paper • 2507.11949 • Published Jul 16, 2025 • 25

SpatialTrackerV2: 3D Point Tracking Made Easy

Paper • 2507.12462 • Published Jul 16, 2025 • 19

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15, 2025 • 31

DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15, 2025 • 35

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16, 2025 • 27

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published Jul 13, 2025 • 88

PhysX: Physical-Grounded 3D Asset Generation

Paper • 2507.12465 • Published Jul 16, 2025 • 44

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16, 2025 • 43