Spaces:

jomasego
/

mcp-video-frontend

Sleeping

App Files Files Community

mcp-video-frontend / README.md

jomasego

feat: Replace Anthropic with Llama 3 for video analysis

3e48648 2 months ago

|

history blame contribute delete

1.59 kB

A newer version of the Gradio SDK is available: 5.43.1

Upgrade

metadata

title: MCP Video Analysis with Llama 3
emoji: 🎥
colorFrom: purple
colorTo: blue
sdk: gradio
sdk_version: 5.33.1
app_file: app.py
pinned: false
license: mit
short_description: AI-powered video analysis with Llama 3 and Modal

🎥 MCP Video Analysis with Llama 3

This application provides comprehensive video analysis using the Model Context Protocol (MCP) to integrate multiple AI technologies:

🔧 Technology Stack

Modal Backend: Scalable cloud compute for video processing
Whisper: Speech-to-text transcription
Computer Vision Models: Object detection, action recognition, and captioning
Meta Llama 3: Advanced AI for intelligent content analysis, hosted on Modal
MCP Protocol: Model Context Protocol for seamless integration

🎯 Features

Transcription: Extract spoken content from videos
Visual Analysis: Identify objects, actions, and scenes
Content Understanding: AI-powered insights and summaries
Custom Queries: Ask specific questions about video content

🚀 Usage

Enter a video URL (YouTube or direct link)
Optionally ask a specific question
Click "Analyze Video" to get comprehensive insights
Review both Llama 3's intelligent analysis and raw data

🔒 Environment Variables Required

MODAL_LLAMA3_ENDPOINT_URL: The URL for the deployed Llama 3 Modal service.
MODAL_VIDEO_ANALYSIS_ENDPOINT_URL: The URL for the video processing Modal service (optional, has a default value).

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference