Spaces:
Running
Running
File size: 2,184 Bytes
6cd48b1 a6ac13d 6cd48b1 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 |
---
title: URL To Audio Summary
emoji: π
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 5.34.2
app_file: app.py
pinned: false
license: mit
short_description: URL to Audio Summary Agent
---
# π£οΈ URL to Audio Summary Agent (LangChain + Hugging Face)
This project is an advanced LangChain-powered agent that takes any article/blog URL, summarizes it using a CPU-friendly language model, and generates a spoken audio summary. Designed to run entirely on **CPU**, it's perfect for deploying on **Hugging Face Spaces**.
---
## π Features
- π Fetches and parses web content from any URL
- π§ Summarizes long articles using `flan-t5-base`
- π Converts summaries into speech using `espnet/kan-bayashi_ljspeech_vits`
- π₯οΈ CPU-only β no GPU or API keys required
- π§± Built using LangChain, Gradio, Transformers
---
## π Tech Stack
- **LangChain**: Document loader, prompt chaining
- **Transformers**: `flan-t5-base` for summarization
- **ESPnet/VITS**: Natural-sounding voice TTS model
- **Gradio**: Easy interface for Hugging Face Spaces
---
## π§© Usage
1. Paste any blog or article URL (e.g. from Medium, BBC, etc.)
2. The agent will:
- Load and extract article content
- Summarize the article intelligently
- Convert the summary into audio
3. You get both:
- π Text Summary
- π Downloadable Audio Summary (WAV)
---
## π¦ Installation
To run locally or on Spaces:
### `requirements.txt`
```txt
gradio
langchain
transformers
torch
sentencepiece
beautifulsoup4
```
---
## π‘ SEO Tags
`AI article summarizer`, `Text-to-Speech summarizer`, `LangChain agent`, `Hugging Face CPU summarizer`, `URL to audio`, `audio content AI`, `gradio langchain tts`
---
## π§ Example Models Used
- `google/flan-t5-base` (Summarization)
- `espnet/kan-bayashi_ljspeech_vits` (TTS)
---
## π Ideal For
- Podcasters turning news into voice
- Accessibility tools
- EdTech content summarization
- Personal AI readers
---
## πͺͺ License
MIT β fork, remix, and deploy freely.
---
## π Credits
Built using [LangChain](https://www.langchain.com/) and [Hugging Face Transformers](https://huggingface.co/transformers/).
|