Spaces:
Running
Running
File size: 3,146 Bytes
6cb2ff0 c2e6d7e 6cb2ff0 9a50492 bc34cae 6cb2ff0 c2e6d7e 6cb2ff0 c2e6d7e 6cb2ff0 692307c 316b003 6cb2ff0 692307c c2e6d7e 692307c bc34cae 6cb2ff0 692307c 6cb2ff0 692307c c2e6d7e 692307c 6cb2ff0 692307c 6cb2ff0 692307c 6cb2ff0 692307c c2e6d7e 6cb2ff0 692307c bc34cae 692307c c2e6d7e 6cb2ff0 692307c 7d06e0a 692307c bf154be 9a50492 692307c bf154be c2e6d7e bf154be 6cb2ff0 c2e6d7e 6cb2ff0 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 |
---
title: AI-Inferoxy AI Hub
emoji: 🚀
colorFrom: purple
colorTo: blue
sdk: gradio
app_file: app.py
pinned: false
hf_oauth: true
hf_oauth_authorized_org:
- nazdev
---
## 🚀 AI‑Inferoxy AI Hub
A focused, multi‑modal AI workspace. Chat, create images, transform images, generate short videos, and synthesize speech — all routed through AI‑Inferoxy for secure, quota‑aware token management and provider failover.
### Highlights
- Chat, Image, Image‑to‑Image, Video, and TTS in one app
- Works with any HF model exposed by your proxy
- Multi‑provider routing; default provider is `auto`
- Streaming chat and curated examples
### Quick Start (Hugging Face Space)
Add Space secrets:
- `PROXY_URL`: AI‑Inferoxy server URL (e.g., `https://proxy.example.com`)
- `PROXY_KEY`: API key for your proxy
Org access control: instead of a custom `ALLOWED_ORGS` secret and runtime checks, configure org restrictions in README metadata using `hf_oauth_authorized_org` per HF Spaces OAuth docs. Example:
```yaml
hf_oauth: true
hf_oauth_authorized_org:
- your-org-slug
- another-org
```
The app reads these at runtime — no extra setup required.
### How It Works
1. The app requests a valid token from AI‑Inferoxy for each call.
2. Requests are sent to the selected provider (or `auto`).
3. Status is reported back for rotation and telemetry.
### Using the App
- Chat: message → choose model/provider (`auto` by default) → tune temperature/top‑p/max tokens.
- Image: prompt → optional width/height (÷8), steps, guidance, seed, negative prompt.
- Image‑to‑Image: upload base image → describe the change → generate.
- Video: brief motion prompt → optional steps/guidance/seed.
- TTS: text → pick TTS model → adjust voice/style if supported.
### Configuration
- Model id only (e.g., `openai/gpt-oss-20b`, `stabilityai/stable-diffusion-xl-base-1.0`).
- Provider from dropdown. Default is `auto`.
### Providers
Compatible with providers configured in AI‑Inferoxy, including `auto` (default), `hf-inference`, `cerebras`, `cohere`, `groq`, `together`, `fal-ai`, `replicate`, `nebius`, `nscale`, and others.
### Security
- HF OAuth validates account; org membership is enforced by Space metadata (`hf_oauth_authorized_org`).
- Inference uses proxy‑managed tokens. Secrets are Space secrets.
- RBAC, rotation, and quarantine handled by AI‑Inferoxy.
### Troubleshooting
- 401/403: verify secrets and org access.
- 402/quota: handled by proxy; retry later or switch provider.
- Image size: width/height must be divisible by 8.
- Slow/failures: try smaller models, fewer steps, or another provider.
### License
This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0) - see the [LICENSE](LICENSE) file for details.
### Links
- Live Space: [huggingface.co/spaces/nazdridoy/inferoxy-hub](https://huggingface.co/spaces/nazdridoy/inferoxy-hub)
- AI‑Inferoxy docs: [ai-inferoxy/huggingface-hub-integration](https://nazdridoy.github.io/ai-inferoxy/)
- Gradio docs: [gradio.app/docs](https://gradio.app/docs/)
— Built with AI‑Inferoxy for intelligent token management.
|