Spaces:
Running
Running
--- title: browser-use-webui app_file: webui.py sdk: gradio sdk_version: 5.9.1 --- # Browser-Use WebUI | |
## Background | |
This project builds upon the foundation of the [browser-use](https://github.com/browser-use/browser-use), which is designed to make websites accessible for AI agents. We have enhanced the original capabilities by providing: | |
1. **A Brand New WebUI:** We offer a comprehensive web interface that supports a wide range of `browser-use` functionalities. This UI is designed to be user-friendly and enables easy interaction with the browser agent. | |
2. **Expanded LLM Support:** We've integrated support for various Large Language Models (LLMs), including: Gemini, OpenAI, Azure OpenAI, Anthropic, DeepSeek etc. And we plan to add support for even more models in the future. | |
3. **Custom Browser Support:** You can use your own browser with our tool, eliminating the need to re-login to sites or deal with other authentication challenges. This feature also supports high-definition screen recording. | |
4. **Customized Agent:** We've implemented a custom agent that enhances `browser-use` with Optimized prompts. | |
<video src="https://github.com/user-attachments/assets/58c0f59e-02b4-4413-aba8-6184616bf181" controls="controls" width="500" height="300" >Your browser does not support playing this video!</video> | |
## Environment Installation | |
1. **Python Version:** Ensure you have Python 3.11 or higher installed. | |
2. **Install `browser-use`:** | |
```bash | |
pip install browser-use | |
``` | |
3. **Install Playwright:** | |
```bash | |
playwright install | |
``` | |
4. **Install Dependencies:** | |
```bash | |
pip install -r requirements.txt | |
``` | |
5. **Configure Environment Variables:** | |
- Copy `.env.example` to `.env` and set your environment variables, including API keys for the LLM. | |
- **If using your own browser:** | |
- Set `CHROME_PATH` to the executable path of your browser (e.g., `C:\Program Files\Google\Chrome\Application\chrome.exe` on Windows). | |
- Set `CHROME_USER_DATA` to the user data directory of your browser (e.g.,`C:\Users\<YourUsername>\AppData\Local\Google\Chrome\User Data`). | |
## Usage | |
1. **Run the WebUI:** | |
```bash | |
python webui.py --ip 127.0.0.1 --port 7788 | |
``` | |
2. **Access the WebUI:** Open your web browser and navigate to `http://127.0.0.1:7788`. | |
3. **Using Your Own Browser:** | |
- Close all chrome windows | |
- Open the WebUI in a non-Chrome browser, such as Firefox or Edge. This is important because the persistent browser context will use the Chrome data when running the agent. | |
- Check the "Use Own Browser" option within the Browser Settings. | |
# b r o w s e r - u s e - g e m i n i | |