[Feature Request] Web toolkit #1406

Wendong-Fan · 2025-01-07T11:43:06Z

Required prerequisites

I have searched the Issue Tracker and Discussions that this hasn't already been reported. (+1 or comment there if it has.)
Consider asking first in a Discussion.

Motivation

A toolkit that can achieve a certain degree of webpage (rendered) interaction, performs web-based tasks. (e.g. click elements and scrolling pages, open a given url, make a screenshot, use MLLM to understand the webpage content)

example task:

         "Question": "Eva Draconis has a personal website which can be accessed on her YouTube page. What is the meaning of the only symbol seen in the top banner that has a curved line that isn't a circle or a portion of a circle? Answer without punctuation.",
          "Final answer": "War is not here this is a land of peace",
          "Annotation Metadata": {
              "Steps": "1. By googling Eva Draconis youtube, you can find her channel.\n2. In her about section, she has written her website URL, orionmindproject.com.\n3. Entering this website, you can see a series of symbols at the top, and the text \"> see what the symbols mean here\" below it.\n4. Reading through the entries, you can see a short description of some of the symbols.\n5. The only symbol with a curved line that isn't a circle or a portion of a circle is the last one.\n6. Note that the symbol supposedly means \"War is not here, this is a land of peace.\"",
              "Number of steps": "6",
              "How long did this take?": "30 minutes.",
              "Tools": "1. A web browser.\n2. A search engine.\n3. Access to YouTube\n4. Image recognition tools",
              "Number of tools": "4"
          }
      },

Solution

study solutions like
https://www.browserbase.com/ (https://docs.stagehand.dev/get_started/introduction)
https://github.com/steel-dev/steel-browser
https://pptr.dev/guides/getting-started
https://playwright.dev/

Alternatives

No response

Additional context

No response

The text was updated successfully, but these errors were encountered:

Aaron617 · 2025-01-08T14:17:52Z

I studied these solutions:

stagehand ： compared to other libraries, stagehand provide natural language APIs (act, extract, and observe) on top of Playwright. Its key feature is offering a lightweight, model-agnostic framework for executing atomic web tasks via natural language instructions. (e.g., "Click the link to the quickstart")
steel-browser : from my perspective, the key feature of steel-browser lies in 1) post-processing of page data. (Easily extract page data as cleaned HTML, markdown, PDFs, or screenshots) 2) Bypass anti-bot measures 3) Optimizing data formats to reduce LLM token usage
Puppeteer/Selenium/Playwright are similar.

Wendong-Fan · 2025-01-09T17:32:21Z

lead: @X-TRON404 , support & review: @koch3092 , @Asher-hss , @Aaron617

Wendong-Fan added New Feature call for contribution P0 Task with high level priority labels Jan 7, 2025

Wendong-Fan added this to Project Camel Jan 7, 2025

Wendong-Fan assigned Asher-hss and koch3092 Jan 7, 2025

Wendong-Fan removed the call for contribution label Jan 7, 2025

Wendong-Fan assigned willshang76 and unassigned willshang76 Jan 8, 2025

Wendong-Fan assigned X-TRON404 Jan 9, 2025

Wendong-Fan added this to the Sprint 21 milestone Jan 9, 2025

Wendong-Fan assigned Aaron617 Jan 9, 2025

X-TRON404 added a commit that referenced this issue Jan 20, 2025

feat:web toolkit with stagehand (#1406)

363d304

X-TRON404 linked a pull request Jan 20, 2025 that will close this issue

feat:web toolkit with stagehand (#1406) #1471

Open

12 tasks

Wendong-Fan linked a pull request Jan 21, 2025 that will close this issue

feat:web toolkit with stagehand (#1406) #1471

Open

12 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Web toolkit #1406

[Feature Request] Web toolkit #1406

Wendong-Fan commented Jan 7, 2025 •

edited

Loading

Aaron617 commented Jan 8, 2025

Wendong-Fan commented Jan 9, 2025

[Feature Request] Web toolkit #1406

[Feature Request] Web toolkit #1406

Comments

Wendong-Fan commented Jan 7, 2025 • edited Loading

Required prerequisites

Motivation

Solution

Alternatives

Additional context

Aaron617 commented Jan 8, 2025

Wendong-Fan commented Jan 9, 2025

Wendong-Fan commented Jan 7, 2025 •

edited

Loading