Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

internlm
/
Spatial-SSRL-3B

Image-Text-to-Text
Transformers
Safetensors
English
multimodal
spatial
sptial understanding
self-supervised learning
conversational
Model card Files Files and versions
xet
Community
Spatial-SSRL-3B / assets
15 MB
  • 2 contributors
History: 3 commits
yuhangzang's picture
yuhangzang
Delete assets/nothing.txt
d81b3ec verified 4 days ago
  • comparison_1029final.png
    3.38 MB
    xet
    Upload 4 files 4 days ago
  • exp_result.png
    216 kB
    xet
    Upload 4 files 4 days ago
  • pipeline_1029final.png
    7.84 MB
    xet
    Upload 4 files 4 days ago
  • teaser_1029final.png
    3.58 MB
    xet
    Upload 4 files 4 days ago