Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performance Improvements for Large Projects #1327

Open
9 of 13 tasks
SCA-ZMT opened this issue Mar 28, 2024 · 4 comments
Open
9 of 13 tasks

Performance Improvements for Large Projects #1327

SCA-ZMT opened this issue Mar 28, 2024 · 4 comments
Assignees
Labels
PO issue Created by Product owners
Milestone

Comments

@SCA-ZMT
Copy link
Contributor

SCA-ZMT commented Mar 28, 2024

User Story

GOAL

It is very important for the POs that the user experience of handling (potentially many and ) large projects is smooth.

This considers

  • loading/startup time and user experience within the platform
  • heavy projects, many smash/simulation projects (in the tree)
  • submitting jobs to the comp backend, monitoring jobs
  • receiving computed jobs and performing postprocessing

Singularity

Preview Give feedback
  1. 3 of 7
    PO issue
    matusdrobuliak66 odeimaiz
    sanderegg

Tasks

Preview Give feedback
  1. 1 of 8
    a:autoscaling a:clusters-keeper a:infra+ops
    sanderegg
  2. a:infra+ops
    YuryHrytsuk sanderegg

Event Horizon

Preview Give feedback
  1. a:dynamic-sidecar a:infra+ops
    YuryHrytsuk sanderegg
  2. a:dynamic-sidecar
    GitHK sanderegg
  3. a:autoscaling bug
    sanderegg

MartinKippenberger

Preview Give feedback
  1. 0 of 1
    t:enhancement
    YuryHrytsuk mrnicegyu11
    pcrespov
  2. 2 of 2
    a:dask-service a:simcore-sdk
    GitHK matusdrobuliak66
  3. pcrespov

Eisbock

Preview Give feedback
  1. a:director-v2 a:dynamic-sidecar t:enhancement
    GitHK
  2. odeimaiz sanderegg
  3. 17 of 17
    a:autoscaling a:infra+ops
    sanderegg
@SCA-ZMT SCA-ZMT added the PO issue Created by Product owners label Mar 28, 2024
@SCA-ZMT SCA-ZMT changed the title Performace Improvements for Large Projects Performance Improvements for Large Projects Mar 28, 2024
@SCA-ZMT SCA-ZMT added the Budget label Mar 28, 2024
@SCA-ZMT
Copy link
Contributor Author

SCA-ZMT commented Apr 4, 2024

@drniiken @eofli Could you please edit the description on top with more concrete use case and expectations?

@SCA-ZMT SCA-ZMT added this to the Leeroy Jenkins milestone May 9, 2024
@SCA-ZMT SCA-ZMT added Budget and removed Budget labels May 10, 2024
@SCA-ZMT
Copy link
Contributor Author

SCA-ZMT commented May 13, 2024

@drniiken could you please update the description above? thanks! :)

@mguidon
Copy link
Member

mguidon commented Nov 5, 2024

  • Costs: S3: follow up with Dustin where this is enabled
  • Performance: MaG: Need to check how this performs with the public API (we should get metrics for up/download after a fix in the code)
  • Speed tests (for uploads)
  • Optimization of EBS: cheap machines to save costs, double throughput with RAID0 w/o efs, look into metrics of usage
  • Speed: Why takes the dynamic sidecar so long to start?

@sanderegg
Copy link
Member

sanderegg commented Jan 7, 2025

Summary for Event Horizon sprint

Done

  • Autoscaling: When hot buffer are in use the warm buffer are not replacing them osparc-simcore#6929: Missing hot buffer machine was not replaced with an available warm buffer but from a machine started from scratch. Improvements in the order of 10 minutes on startup time when all hot buffers are used up.
  • Startup of dynamic sidecar osparc-simcore#6859: typically takes 8 seconds to start on hot/warm buffers. Considered as done until further notice.
  • RClone options tuning osparc-simcore#6858: Affects transfer of inputs/outputs/service data and allows to directly tune for better parameters. Allowed to improve massively on projects where many small files are present.
  • ⚠️ In order to reduce costs the overall performance of the disks were slightly reduced, this should be barely visible for the users that get warm/hot machines, as the service images are already pulled. This might have an influence for zipping/unzipping or where IOPS are important. Please contact support if weird slow downs are detected.

Won't be done

Planned for next sprint(s)

@pcrespov pcrespov modified the milestones: Event Horizon, Singularity Jan 24, 2025
@sanderegg sanderegg assigned matusdrobuliak66 and unassigned GitHK Jan 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
PO issue Created by Product owners
Projects
None yet
Development

No branches or pull requests

10 participants