11 23 19

Ellis Brown PRO

ellisbrown

http://ellisbrown.github.io

AI & ML interests

AI, Deep Learning, Computer Vision, Representation Learning, Self-Supervised Learning

Recent Activity

authored a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

upvoted a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

updated a dataset 4 days ago

PaintBench/pixels

View all activity

Organizations

authored a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 3 days ago • 68

upvoted a paper 2 days ago

Beyond Language Modeling: An Exploration of Multimodal Pretraining

Paper • 2603.03276 • Published 3 days ago • 68

updated a dataset 4 days ago

PaintBench/pixels

Viewer • Updated 1 day ago • 1.04k • 43

published a dataset 4 days ago

PaintBench/pixels

Viewer • Updated 1 day ago • 1.04k • 43

upvoted a paper 8 days ago

Solaris: Building a Multiplayer Video World Model in Minecraft

Paper • 2602.22208 • Published 9 days ago • 27

upvoted 2 collections 8 days ago

Solaris-Models

Collection

Model weights for Solaris: Building a Multiplayer Video World Model in Minecraft • 1 item • Updated 4 days ago • 3

Solaris-Data

Collection

Training and evaluation datasets collected for Solaris: Building a Multiplayer Video World Model in Minecraft • 2 items • Updated 11 days ago • 3

updated a dataset 16 days ago

ellisbrown/objaverse_sims

Preview • Updated 16 days ago • 16

published a dataset 16 days ago

ellisbrown/objaverse_sims

Preview • Updated 16 days ago • 16

updated a dataset about 1 month ago

spatial-training/objaverse_vida

Preview • Updated Jan 29 • 48

published a dataset about 1 month ago

spatial-training/objaverse_vida

Preview • Updated Jan 29 • 48

authored a paper about 1 month ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 53

upvoted a paper about 1 month ago

Scaling Text-to-Image Diffusion Transformers with Representation Autoencoders

Paper • 2601.16208 • Published Jan 22 • 53

liked 2 datasets 4 months ago

ellisbrown/TOMATO

Viewer • Updated Oct 12, 2025 • 1.48k • 206 • 1

nyu-visionx/VSI-Train-10k

Viewer • Updated Nov 7, 2025 • 10k • 345 • 4

updated a dataset 4 months ago

nyu-visionx/VSI-Bench

Viewer • Updated Nov 11, 2025 • 10.3k • 19.7k • 59

authored 3 papers 4 months ago

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6, 2025 • 38

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published Nov 6, 2025 • 8

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

Paper • 2511.04668 • Published Nov 6, 2025 • 5

updated a dataset 4 months ago

ellisbrown/SIMS-VSI

Viewer • Updated Nov 7, 2025 • 242k • 97 • 6

Ellis Brown PRO

AI & ML interests

Recent Activity

Organizations

ellisbrown's activity