Spaces:
Running
Running
metadata
title: URL To Audio Summary
emoji: π
colorFrom: gray
colorTo: pink
sdk: gradio
sdk_version: 5.34.2
app_file: app.py
pinned: false
license: mit
short_description: URL to Audio Summary Agent
π£οΈ URL to Audio Summary Agent (LangChain + Hugging Face)
This project is an advanced LangChain-powered agent that takes any article/blog URL, summarizes it using a CPU-friendly language model, and generates a spoken audio summary. Designed to run entirely on CPU, it's perfect for deploying on Hugging Face Spaces.
π Features
- π Fetches and parses web content from any URL
- π§ Summarizes long articles using
flan-t5-base
- π Converts summaries into speech using
espnet/kan-bayashi_ljspeech_vits
- π₯οΈ CPU-only β no GPU or API keys required
- π§± Built using LangChain, Gradio, Transformers
π Tech Stack
- LangChain: Document loader, prompt chaining
- Transformers:
flan-t5-base
for summarization - ESPnet/VITS: Natural-sounding voice TTS model
- Gradio: Easy interface for Hugging Face Spaces
π§© Usage
- Paste any blog or article URL (e.g. from Medium, BBC, etc.)
- The agent will:
- Load and extract article content
- Summarize the article intelligently
- Convert the summary into audio
- You get both:
- π Text Summary
- π Downloadable Audio Summary (WAV)
π¦ Installation
To run locally or on Spaces:
requirements.txt
gradio
langchain
transformers
torch
sentencepiece
beautifulsoup4
π‘ SEO Tags
AI article summarizer
, Text-to-Speech summarizer
, LangChain agent
, Hugging Face CPU summarizer
, URL to audio
, audio content AI
, gradio langchain tts
π§ Example Models Used
google/flan-t5-base
(Summarization)espnet/kan-bayashi_ljspeech_vits
(TTS)
π Ideal For
- Podcasters turning news into voice
- Accessibility tools
- EdTech content summarization
- Personal AI readers
πͺͺ License
MIT β fork, remix, and deploy freely.
π Credits
Built using LangChain and Hugging Face Transformers.