Spaces:
Running
Running
title: URL To Audio Summary | |
emoji: π | |
colorFrom: gray | |
colorTo: pink | |
sdk: gradio | |
sdk_version: 5.34.2 | |
app_file: app.py | |
pinned: false | |
license: mit | |
short_description: URL to Audio Summary Agent | |
# π£οΈ URL to Audio Summary Agent (LangChain + Hugging Face) | |
This project is an advanced LangChain-powered agent that takes any article/blog URL, summarizes it using a CPU-friendly language model, and generates a spoken audio summary. Designed to run entirely on **CPU**, it's perfect for deploying on **Hugging Face Spaces**. | |
--- | |
## π Features | |
- π Fetches and parses web content from any URL | |
- π§ Summarizes long articles using `flan-t5-base` | |
- π Converts summaries into speech using `espnet/kan-bayashi_ljspeech_vits` | |
- π₯οΈ CPU-only β no GPU or API keys required | |
- π§± Built using LangChain, Gradio, Transformers | |
--- | |
## π Tech Stack | |
- **LangChain**: Document loader, prompt chaining | |
- **Transformers**: `flan-t5-base` for summarization | |
- **ESPnet/VITS**: Natural-sounding voice TTS model | |
- **Gradio**: Easy interface for Hugging Face Spaces | |
--- | |
## π§© Usage | |
1. Paste any blog or article URL (e.g. from Medium, BBC, etc.) | |
2. The agent will: | |
- Load and extract article content | |
- Summarize the article intelligently | |
- Convert the summary into audio | |
3. You get both: | |
- π Text Summary | |
- π Downloadable Audio Summary (WAV) | |
--- | |
## π¦ Installation | |
To run locally or on Spaces: | |
### `requirements.txt` | |
```txt | |
gradio | |
langchain | |
transformers | |
torch | |
sentencepiece | |
beautifulsoup4 | |
``` | |
--- | |
## π‘ SEO Tags | |
`AI article summarizer`, `Text-to-Speech summarizer`, `LangChain agent`, `Hugging Face CPU summarizer`, `URL to audio`, `audio content AI`, `gradio langchain tts` | |
--- | |
## π§ Example Models Used | |
- `google/flan-t5-base` (Summarization) | |
- `espnet/kan-bayashi_ljspeech_vits` (TTS) | |
--- | |
## π Ideal For | |
- Podcasters turning news into voice | |
- Accessibility tools | |
- EdTech content summarization | |
- Personal AI readers | |
--- | |
## πͺͺ License | |
MIT β fork, remix, and deploy freely. | |
--- | |
## π Credits | |
Built using [LangChain](https://www.langchain.com/) and [Hugging Face Transformers](https://huggingface.co/transformers/). | |