Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Optimize computational scheduler for 100s of jobs #4530

Closed
Tracked by #950
sanderegg opened this issue Jul 21, 2023 · 1 comment
Closed
Tracked by #950

Optimize computational scheduler for 100s of jobs #4530

sanderegg opened this issue Jul 21, 2023 · 1 comment
Assignees

Comments

@sanderegg
Copy link
Member

sanderegg commented Jul 21, 2023

The computational scheduler in the director-v2 shall be able to handle at least 1000 jobs concurrently without complaining.
This can be tested by using the osparc API and following the pseudo-code to maximize the load:

  • have a cluster able to run 100 job at the same time (that imply 1 core per job since the dask-workers are finding, could be 10 computers with 10 cores)

  • create 200 jobs

  • start all jobs concurrently

  • await their result

  • monitor the load on the following services:

    • director-v2
    • dask-scheduler
    • dask-worker(s)
    • dask-gateway
    • rabbit
    • wb-db-event-listener

The services should still be responsive (e.g. it should be possible to run some pipelines through the GUI for example)

Tasks

Preview Give feedback
No tasks being tracked yet.
@sanderegg sanderegg transferred this issue from ITISFoundation/osparc-issues Jul 21, 2023
@sanderegg sanderegg self-assigned this Jul 21, 2023
@sanderegg sanderegg changed the title Optimize computational scheduler for 1000s of jobs Optimize computational scheduler for 100s of jobs Jul 21, 2023
@sanderegg
Copy link
Member Author

stale, and is currently being directly tested by users.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant