Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/webcanvas integration #294

Open
wants to merge 15 commits into
base: main
Choose a base branch
from

Conversation

han032206
Copy link

Pull Request: Integrate WebCanvas Key Node Evaluation and Mind2web-live Benchmark into BrowserGym

Description

This PR officially integrates the WebCanvas key node evaluation and the Mind2web-live benchmark into BrowserGym.

Core Features

  • Key Node-Based Evaluation:

    • Implements a key node-based evaluation system to provide detailed assessments of web task processes.
  • JavaScript Event Evaluation:

    • Utilizes page JavaScript events to deliver accurate evaluations independent of the action space.
  • Debug Modules and Logging:

    • Includes various debug modules and logging functionalities to clearly display the evaluation process.
  • Mind2web-live Dataset Integration:

    • Integrates the Mind2web-live dataset to support benchmark testing.
  • Community Contributions:

    • Encourages further contributions from the community to expand and refine the evaluation framework and benchmarks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants