Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
EladofWar 's Collections
cool
samsegmentation
fast-text-to-image

cool

updated 9 days ago
Upvote
-

  • Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models

    Paper • 2504.02821 • Published Apr 3 • 10

  • TimeChat-Online: 80% Visual Tokens are Naturally Redundant in Streaming Videos

    Paper • 2504.17343 • Published Apr 24 • 12

  • ViSMaP: Unsupervised Hour-long Video Summarisation by Meta-Prompting

    Paper • 2504.15921 • Published Apr 22 • 7

  • Causal-Copilot: An Autonomous Causal Analysis Agent

    Paper • 2504.13263 • Published Apr 17 • 7

  • Distilling semantically aware orders for autoregressive image generation

    Paper • 2504.17069 • Published Apr 23 • 6

  • VideoDeepResearch: Long Video Understanding With Agentic Tool Using

    Paper • 2506.10821 • Published Jun 12 • 20

  • Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models

    Paper • 2506.07177 • Published Jun 8 • 22

  • lym00/Wan2.2_T2V_A14B_VACE-test

    17B • Updated 11 days ago • 27.6k • 27
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs