Collections
Discover the best community collections!
Collections including paper arxiv:2310.18660
-
Attention Is All You Need
Paper ⢠1706.03762 ⢠Published ⢠77 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ⢠2307.08691 ⢠Published ⢠9 -
Mixtral of Experts
Paper ⢠2401.04088 ⢠Published ⢠160 -
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠51
-
Foundation Models for Generalist Geospatial Artificial Intelligence
Paper ⢠2310.18660 ⢠Published ⢠11 -
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization
Paper ⢠2309.16020 ⢠Published -
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper ⢠2312.02155 ⢠Published ⢠15 -
GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding
Paper ⢠2407.13519 ⢠Published
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper ⢠2409.02097 ⢠Published ⢠35 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper ⢠2409.11406 ⢠Published ⢠28 -
Diffusion Models Are Real-Time Game Engines
Paper ⢠2408.14837 ⢠Published ⢠127 -
Segment Anything with Multiple Modalities
Paper ⢠2408.09085 ⢠Published ⢠23
-
Visual Instruction Tuning
Paper ⢠2304.08485 ⢠Published ⢠17 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper ⢠2311.05437 ⢠Published ⢠51 -
Improved Baselines with Visual Instruction Tuning
Paper ⢠2310.03744 ⢠Published ⢠38 -
Aligning Large Multimodal Models with Factually Augmented RLHF
Paper ⢠2309.14525 ⢠Published ⢠31
-
Foundation Models for Generalist Geospatial Artificial Intelligence
Paper ⢠2310.18660 ⢠Published ⢠11 -
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization
Paper ⢠2309.16020 ⢠Published -
GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis
Paper ⢠2312.02155 ⢠Published ⢠15 -
GPSFormer: A Global Perception and Local Structure Fitting-based Transformer for Point Cloud Understanding
Paper ⢠2407.13519 ⢠Published
-
LinFusion: 1 GPU, 1 Minute, 16K Image
Paper ⢠2409.02097 ⢠Published ⢠35 -
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion
Paper ⢠2409.11406 ⢠Published ⢠28 -
Diffusion Models Are Real-Time Game Engines
Paper ⢠2408.14837 ⢠Published ⢠127 -
Segment Anything with Multiple Modalities
Paper ⢠2408.09085 ⢠Published ⢠23
-
Visual Instruction Tuning
Paper ⢠2304.08485 ⢠Published ⢠17 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper ⢠2311.05437 ⢠Published ⢠51 -
Improved Baselines with Visual Instruction Tuning
Paper ⢠2310.03744 ⢠Published ⢠38 -
Aligning Large Multimodal Models with Factually Augmented RLHF
Paper ⢠2309.14525 ⢠Published ⢠31
-
Attention Is All You Need
Paper ⢠1706.03762 ⢠Published ⢠77 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ⢠2307.08691 ⢠Published ⢠9 -
Mixtral of Experts
Paper ⢠2401.04088 ⢠Published ⢠160 -
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠51