Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos-predict25 ⢠31 items ⢠Updated 2 days ago ⢠299
Evaluating Language Models as Synthetic Data Generators Paper ⢠2412.03679 ⢠Published Dec 4, 2024 ⢠48
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. ⢠26 items ⢠Updated May 1 ⢠574
A Simulation Benchmark for Autonomous Racing with Large-Scale Human Data Paper ⢠2407.16680 ⢠Published Jul 23, 2024 ⢠12
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy Paper ⢠2406.20095 ⢠Published Jun 28, 2024 ⢠18
Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion Paper ⢠2407.01392 ⢠Published Jul 1, 2024 ⢠44
Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models Paper ⢠2404.02575 ⢠Published Apr 3, 2024 ⢠50
SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion Paper ⢠2403.12008 ⢠Published Mar 18, 2024 ⢠20
Larimar: Large Language Models with Episodic Memory Control Paper ⢠2403.11901 ⢠Published Mar 18, 2024 ⢠33
Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers Paper ⢠2403.12943 ⢠Published Mar 19, 2024 ⢠15
SceneScript: Reconstructing Scenes With An Autoregressive Structured Language Model Paper ⢠2403.13064 ⢠Published Mar 19, 2024 ⢠31
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework Paper ⢠2403.13248 ⢠Published Mar 20, 2024 ⢠78
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper ⢠2401.10774 ⢠Published Jan 19, 2024 ⢠59
Cached Transformers: Improving Transformers with Differentiable Memory Cache Paper ⢠2312.12742 ⢠Published Dec 20, 2023 ⢠13
Generative Multimodal Models are In-Context Learners Paper ⢠2312.13286 ⢠Published Dec 20, 2023 ⢠36
PowerInfer: Fast Large Language Model Serving with a Consumer-grade GPU Paper ⢠2312.12456 ⢠Published Dec 16, 2023 ⢠44