-
Notifications
You must be signed in to change notification settings - Fork 742
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merge Kubeflow/common to training operator #1714
Comments
/cc @alculquicondor |
I agree that we can consider merging it now that all operators are in training-operator repo. MPI operator v1 still uses this separately and we should put into some thoughts as well. |
I agree the common repository should be merged and the scripts in the common repository should be further simplified to make the training-operator easier to understand for new developers. |
@alculquicondor Can you share your thoughts on this wrt mpi operator dependency? |
From past conversations with @alculquicondor, it seems like there's preference to keep v2 MPI controller separate from training-operator due to some use cases to non-ML users. Is that still the case? Perhaps v2 can be migrated to another project under K8s sigs to make it more generic to all users and v1 will stay in training-operator to be suitable for ML use cases only. I believe it should be fine if we keep the copyright notices "Kubeflow Authors". Once Kubeflow is accepted by CNCF, everything belongs to CNCF anyways so I don't have any legal concerns. Any thoughts from others? Also cc @ahg-g |
FWIIW, the only dependencies that mpi-operator has with Yes, I have heard the desire of having a lightweight controller for MPI jobs from HPC users. kubernetes-sigs would be a good fit. SIG Apps already had interest in hosting the project. But we would have to wait until the ownership goes to CNCF. |
I like this idea. |
My team at Ant Group was using it but it's no longer the case. I don't know any other teams that are relying on this repo. @alculquicondor I think MPI operator repo can keep those constants itself so that we can remove the dependency on Kubeflow/common at once. WDYT? |
Would it be too bad to make mpi-operator depend on the constants from training-operator? |
I am not sure if that's a good idea as training-operator might be too heavy and there may be conflicts in dependencies. We only reply on very minimal set of constants so they should be easy to maintain. |
I agree with @terrytangyuan . I think, It is not worth managing overhead in handling dependencies for mpi-operator to use minimal set of constants.(If that is the concern). I leave it to you @alculquicondor Can folks give +1 to merge kubeflow/common to kubeflow/training-operator? We can take this up after creating release branch for KF release 1.7 during late Jan |
+1 |
I guess the common repo would just be archived and we can still depend on it while we don't need new constants (likely to be true for a long time). +1 |
+1 |
LGTM 👍 |
Some enterprise users may use it to write their own operators. But I think they will lock on a specified version. |
Based on the responses, |
Was a release branch opened? |
@alculquicondor Yes, we opened RC.0 branch; https://github.com/kubeflow/training-operator/tree/v1.6-branch. |
are we still on freeze? |
@alculquicondor Final release is not cut yet. I am expecting it to happen in a week |
/assign @johnugeorge |
Closing this as kubeflow/common is merged into training-operator |
|
Kubeflow/common was initially created to have a common library for all frameworks in separate repos. Since we have already merged all operators into a single training operator, do we really need a separate common repository now?
Motivation:
Related: #1703
/cc @terrytangyuan @zw0610 @gaocegege @Jeffwan
The text was updated successfully, but these errors were encountered: