Frontend-Backend separation support for providing multi-cluster joint management. #5605

siaimes · 2021-08-18T02:02:24Z

What would you like to be added:
Frontend-Backend separation support for providing multi-cluster joint management.
Why is this needed:
For large-scale data centers, it is unwise to deploy all nodes into one k8s cluster, so that if the cluster fails, all machines will not be able to provide services normally. However, splitting all nodes into multiple clusters will bring additional burdens to Operations management and end-users.
Without this feature, how does the current module work:
A Frontend mangas a Backend, thus forming a cluster, and a cluster maintains a user database.
Components that may involve changes:
Separate the Frontend from the cluster and provide multiple Backend access capabilities. When a user submits a job, before selecting a virtual cluster, first select a cluster, so there is no significant difference in usage from before. But for Operations management, the probability of SPOF will be reduced. Version upgrades can also be done in batches to minimize uncertainty.

suiguoxin · 2021-08-23T05:07:19Z

@siaimes Thanks for the proposal. Is this what you need ? We have supported job transfer among different clusters since v1.4.

siaimes · 2021-08-23T07:10:31Z

@suiguoxin Thank you for your reply.

Job transfer is not what I wanted.

For clarity, I drew a sketch as an illustration.

We can provide an option that allows users to configure a frontend (a single-node cluster or a node in a cluster) and connect to the back-end of other clusters.

The benefits are as follows:

Split a large k8s cluster into multiple small k8s clusters to reduce the probability that a single point of failure will cause the entire system to become unusable (The frontend node needs high availability);
We can provide a unified user management system to reduce the pressure on Operations management;
As long as the frontend node and the master node can access each other, the deployment of large-scale clusters with nodes distributed in multiple data centers is supported;
The version upgrade can be rolled separately for each k8s cluster to ensure that the frontend is always available.

mydmdm · 2021-11-18T15:20:36Z

seems the Job specialization in #4801 is what you need as the protocol and backend support. However, due to some resource limits, we haven't gotten a chance to implement. Hope this can inspire your ideas.

siaimes closed this as not planned Won't fix, can't repro, duplicate, stale Jun 12, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Frontend-Backend separation support for providing multi-cluster joint management. #5605

Frontend-Backend separation support for providing multi-cluster joint management. #5605

siaimes commented Aug 18, 2021 •

edited

Loading

suiguoxin commented Aug 23, 2021

siaimes commented Aug 23, 2021

mydmdm commented Nov 18, 2021

Frontend-Backend separation support for providing multi-cluster joint management. #5605

Frontend-Backend separation support for providing multi-cluster joint management. #5605

Comments

siaimes commented Aug 18, 2021 • edited Loading

suiguoxin commented Aug 23, 2021

siaimes commented Aug 23, 2021

mydmdm commented Nov 18, 2021

siaimes commented Aug 18, 2021 •

edited

Loading