Web Voyager Chat Assistant

Web Voyager is an AI agent that autonomously navigates and interacts with web pages using the Playwright browser. It uses GPT-4o (customizable) to interpret web content and make decisions on how to interact with the page to accomplish user-defined tasks.

This is a version where it is included as a chat window inside the browser, to be used as a personal assistant while browsing.

Features

Autonomous web navigation and interaction
Visual element recognition and labeling
Task-oriented decision making
HTML report generation with screenshots

Cost Warning

⚠️ Important: This project uses GPT-4o for analyzing web pages and making decisions. Each interaction typically involves processing screenshots and multiple API calls. Costs can accumulate quickly, especially during extended browsing sessions. Please monitor your OpenAI API usage carefully.

Requirements

Python 3.7+
Playwright
LangChain
OpenAI API key

Usage

Set up environment variables:
- LANGCHAIN_API_KEY
- OPENAI_API_KEY
- LANGCHAIN_TRACING_V2=true
- LANGCHAIN_API_KEY=<your-api-key>
Start the script browse.py. Playwright browser will open with a chat window where you can ask for assistance while browsing the web.
To custimiza the profile picture, replace the "me.jpeg" image.
View the generated web_voyager_results.html for a step-by-step breakdown of the agent's actions and screenshots.

How it Works

The agent takes a screenshot of the current page
It annotates interactive elements with bounding boxes
GPT-4 analyzes the page content and decides on the next action
The agent performs the action (click, type, scroll, etc.)
This process repeats until the task is completed

Limitations

Requires a valid OpenAI API key with GPT-4 access
Performance may vary depending on the complexity of the web pages and tasks
Extended sessions can incur significant API costs due to frequent image processing and GPT-4 calls

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
.gitignore		.gitignore
README.md		README.md
browse.py		browse.py
mark_page.js		mark_page.js
me.jpeg		me.jpeg
requirements.txt		requirements.txt
voyager.py		voyager.py
web_voyager_results.html		web_voyager_results.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Voyager Chat Assistant

Features

Cost Warning

Requirements

Usage

How it Works

Limitations

About

Releases

Packages

Languages

eli6/web-voyager

Folders and files

Latest commit

History

Repository files navigation

Web Voyager Chat Assistant

Features

Cost Warning

Requirements

Usage

How it Works

Limitations

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages