Spaces:

wd21
/

openclaw

Running

Apply for a GPU community grant: Personal project

by wd21 - opened 12 days ago

Hi Hugging Face team,

I’m building a small side project to provide a free, open‑source API endpoint for the Qwen1.5-0.5B-Chat model. The goal is to let developers and researchers experiment with a lightweight chat model without needing their own GPU.

I have already set up a Space with the model and a Flask‑based API (/v1/chat/completions). However, on the free CPU instance the model runs out of memory (or is extremely slow), and CPU does not support float16 inference. I’ve tried all optimisations but the hardware is simply insufficient.

If granted a T4 small (or any GPU with at least 4GB VRAM), I can:

· Run the model in float16 mode, which fits comfortably.
· Keep the API publicly accessible for the open‑source community.
· Document the setup so others can replicate it.

The Space is public and the code will remain open source. This is purely a non‑commercial, educational project.

Thank you for considering my request!

Best,
[wd21]
Space: [model]

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment