M4U: Evaluating Multilingual Understanding and Reasoning for Large Multimodal Models Paper • 2405.15638 • Published May 24, 2024 • 1
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation Paper • 2506.07530 • Published Jun 9 • 20
MoTE: Mixture of Ternary Experts for Memory-efficient Large Multimodal Models Paper • 2506.14435 • Published Jun 17 • 8
hongyuw/ft-bitvla-bitsiglipL-224px-libero_spatial-bf16 Robotics • 3B • Updated about 1 month ago • 23
BitVLA Collection 1-bit Vision-Language-Action Models for Robotics Manipulation • 9 items • Updated Jun 30 • 3
BitVLA Collection 1-bit Vision-Language-Action Models for Robotics Manipulation • 9 items • Updated Jun 30 • 3
hongyuw/ft-bitvla-bitsiglipL-224px-libero_spatial-bf16 Robotics • 3B • Updated about 1 month ago • 23
BitVLA Collection 1-bit Vision-Language-Action Models for Robotics Manipulation • 9 items • Updated Jun 30 • 3
BitVLA Collection 1-bit Vision-Language-Action Models for Robotics Manipulation • 9 items • Updated Jun 30 • 3