MoA (Model of Agents) is a method that lets multiple AI agents (different LLMs) collaborate to generate a higher-quality response than a single model could.
MoA Chat implements a simple version of this architecture by:
Source: Together MoA Architecture Concept
MoA Chat is optimized to run on Hugging Face Spaces or any platform that supports Python 3.11+, Flask, and Docker-based containers.
Requires setting your API keys via Hugging Face's Secrets system (never expose them to the frontend).
app.py
β Flask backend server.llm/agents.py
β Query and aggregation logic for MoA system.llm/model_config.json
β Define available models and providers.templates/
β Contains index.html
and docs.html
.static/
β Contains style.css
and script.js
.Made with β€οΈ in PanamΓ‘ by Until Dot. Inspired by Together's MoA architecture.