Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Users: A way to limit number of jobs per user #254

Closed
pacomf opened this issue Mar 2, 2023 · 6 comments · Fixed by #570
Closed

Users: A way to limit number of jobs per user #254

pacomf opened this issue Mar 2, 2023 · 6 comments · Fixed by #570
Assignees
Labels
enhancement New feature or request priority: high High priority project: gateway Label to identify features related with gateway project
Milestone

Comments

@pacomf
Copy link
Member

pacomf commented Mar 2, 2023

What is the expected behavior?

When we have keyclock integrated, we need to define a way to configure the maximum number of jobs allowed to run in our ray pods by user, to have a mechanism to control the saturation of the system.

We can have a number of jobs allowed to run at the same time by role, so depending of the user, we can provide more flexibility.

@pacomf pacomf added the enhancement New feature or request label Mar 2, 2023
@pacomf pacomf added this to the 0.2 milestone Mar 2, 2023
@IceKhan13
Copy link
Member

One way to limit jobs per user is to create and assign cluster per user. Each cluster will have only 1 head node with min 1cpu, 1gb memory and scale up let's say to 2 workers with 4cpu and 4gb mem each. If user exceeds this capacity jobs will be sitting in a queue

@pacomf
Copy link
Member Author

pacomf commented Mar 2, 2023

But in this case you need to know the users... or create and assign a new cluster every time that a new user is registered... so with an installation with thousands of users like IQP, it is a mess and over architecture :S

@pacomf
Copy link
Member Author

pacomf commented Mar 2, 2023

But it can be a solution to discuss, for sure! we need to analyze it and take a decision :)

@IceKhan13
Copy link
Member

Then maybe we need a small service to accept programs and then lunch them in cluster, which might be actually a better as we can control additional rules etc inside this service

@psschwei
Copy link
Collaborator

psschwei commented Mar 2, 2023

It might also make sense to consider limiting by total resource usage (memory / CPU requests), which you can do in kubernetes with resource quotas.

@IceKhan13 IceKhan13 self-assigned this Mar 13, 2023
@IceKhan13
Copy link
Member

Re-openning as mechanism is not there yet

@IceKhan13 IceKhan13 reopened this May 9, 2023
@IceKhan13 IceKhan13 added priority: high High priority project: gateway Label to identify features related with gateway project labels May 11, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request priority: high High priority project: gateway Label to identify features related with gateway project
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants