In-Browser LLM Chat with RAG
HF Token:
Apply Token + Reload
Models:
Apply Models
Skip RAG:
TF WebGPU
TF WebGL
WebLLM
TF WASM
Reload Models
Trial Models
Diagnostics
backend: -
Loading model...
Send