MoA Chat Documentation

What is MoA Chat?

MoA (Model of Agents) is a method that lets multiple AI agents (different LLMs) collaborate to generate a higher-quality response than a single model could.

MoA Chat implements a simple version of this architecture by:

How MoA Works (Visual)

MoA Architecture

Source: Together MoA Architecture Concept

How to Use

  1. Click βš™οΈ to open the configuration panel.
  2. Select your preferred models for LLM-A, LLM-B, LLM-C, and Aggregator (LLM-D).
  3. Type your message in the input box.
  4. Press Send.
  5. Watch multiple models collaborate for the best response!

Features

Deployment

MoA Chat is optimized to run on Hugging Face Spaces or any platform that supports Python 3.11+, Flask, and Docker-based containers.

Requires setting your API keys via Hugging Face's Secrets system (never expose them to the frontend).

File Structure

Credits

Made with ❀️ in PanamÑ by Until Dot. Inspired by Together's MoA architecture.

← Back to Chat