explorations - a andreydelpozo Collection

andreydelpozo 's Collections

explorations

updated 9 days ago

random things

teknium/OpenHermes-2.5-Mistral-7B

Text Generation • 7B • Updated Feb 19, 2024 • 157k • 872
ByteDance/SDXL-Lightning

Text-to-Image • Updated Apr 3, 2024 • 165k • • 2.1k
google/gemma-7b-it

Text Generation • 9B • Updated Aug 14, 2024 • 151k • 1.21k
dphn/dolphin-2.2.1-mistral-7b

Text Generation • 7B • Updated May 20, 2024 • 615 • 198
dphn/dolphin-2.5-mixtral-8x7b

Text Generation • 47B • Updated May 21, 2024 • 1.42k • 1.24k
dphn/dolphin-2.6-mistral-7b-dpo-laser

Text Generation • 7B • Updated Mar 4, 2024 • 58 • 119
ise-uiuc/Magicoder-Evol-Instruct-110K

Viewer • Updated Dec 28, 2023 • 111k • 895 • 166
Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

Paper • 2408.15239 • Published Aug 27, 2024 • 30
WebShaper: Agentically Data Synthesizing via Information-Seeking Formalization

Paper • 2507.15061 • Published Jul 20 • 59
Ovi: Twin Backbone Cross-Modal Fusion for Audio-Video Generation

Paper • 2510.01284 • Published 17 days ago • 30
OBS-Diff: Accurate Pruning For Diffusion Models in One-Shot

Paper • 2510.06751 • Published 10 days ago • 21
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning

Paper • 2509.24372 • Published 19 days ago • 6
DINOv3

Paper • 2508.10104 • Published Aug 13 • 273
MATRIX: Mask Track Alignment for Interaction-aware Video Generation

Paper • 2510.07310 • Published 10 days ago • 35
Real-Time Object Detection Meets DINOv3

Paper • 2509.20787 • Published 23 days ago • 10
A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10 • 183
A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 257
Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10 • 157
T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8 • 118
SingLoRA: Low Rank Adaptation Using a Single Matrix

Paper • 2507.05566 • Published Jul 8 • 112
Easy Dataset: A Unified and Extensible Framework for Synthesizing LLM Fine-Tuning Data from Unstructured Documents

Paper • 2507.04009 • Published Jul 5 • 48
Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

Paper • 2506.19852 • Published Jun 24 • 41
KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11 • 40
PartCrafter: Structured 3D Mesh Generation via Compositional Latent Diffusion Transformers

Paper • 2506.05573 • Published Jun 5 • 79
Qwen3 Embedding: Advancing Text Embedding and Reranking Through Foundation Models

Paper • 2506.05176 • Published Jun 5 • 74
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Paper • 2506.09790 • Published Jun 11 • 53
SpatialLM: Training Large Language Models for Structured Indoor Modeling

Paper • 2506.07491 • Published Jun 9 • 50
Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 302
BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 97
Flow-GRPO: Training Flow Matching Models via Online RL

Paper • 2505.05470 • Published May 8 • 84
Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 81
ZeroSearch: Incentivize the Search Capability of LLMs without Searching

Paper • 2505.04588 • Published May 7 • 65