Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Arijit's picture
4 6 3

Arijit

array
ellisbrown's profile picture Tonic's profile picture
·
https://arijitray1993.github.io/
  • array93
  • arijitray1993

AI & ML interests

None yet

Recent Activity

updated a model 12 days ago
array/Qwen2.5-VL-SIMS
published a model 12 days ago
array/Qwen2.5-VL-SIMS
updated a model 14 days ago
array/Qwen2.5-VL-SAT
View all activity

Organizations

spatial training's profile picture

upvoted 4 papers about 2 months ago

Cambrian-S: Towards Spatial Supersensing in Video

Paper • 2511.04670 • Published Nov 6 • 37

Benchmark Designers Should "Train on the Test Set" to Expose Exploitable Non-Visual Shortcuts

Paper • 2511.04655 • Published Nov 6 • 7

COLA: How to adapt vision-language models to Compose Objects Localized with Attributes?

Paper • 2305.03689 • Published May 5, 2023 • 3

SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding

Paper • 2511.04668 • Published Nov 6 • 4
upvoted a paper 4 months ago

SAT: Dynamic Spatial Aptitude Training for Multimodal Language Models

Paper • 2412.07755 • Published Dec 10, 2024 • 2
upvoted a paper over 1 year ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 23
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs