RoboLab: A High-Fidelity Simulation Benchmark for Analysis of Task Generalist Policies Paper • 2604.09860 • Published 8 days ago • 2
STRIDE Applications Collection Benchmarks, proxy corpora, contamination manifests, and checkpoints for STRIDE data-attribution and benchmark-leakage experiments. • 3 items • Updated 3 days ago
VoMP: Predicting Volumetric Mechanical Property Fields Paper • 2510.22975 • Published Oct 27, 2025 • 9
Activation Space Interventions Can Be Transferred Between Large Language Models Paper • 2503.04429 • Published Mar 6, 2025 • 2
TinySQL: A Progressive Text-to-SQL Dataset for Mechanistic Interpretability Research Paper • 2503.12730 • Published Mar 17, 2025 • 4
Squeeze3D: Your 3D Generation Model is Secretly an Extreme Neural Compressor Paper • 2506.07932 • Published Jun 9, 2025 • 12
Can Vision-Language Models Answer Face to Face Questions in the Real-World? Paper • 2503.19356 • Published Mar 25, 2025 • 2
AirLetters: An Open Video Dataset of Characters Drawn in the Air Paper • 2410.02921 • Published Oct 3, 2024
NeRF-US: Removing Ultrasound Imaging Artifacts from Neural Radiance Fields in the Wild Paper • 2408.10258 • Published Aug 13, 2024
Astroformer: More Data Might not be all you need for Classification Paper • 2304.05350 • Published Apr 3, 2023 • 1