Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] flink cluster new status tracking #2425

Open
6 of 7 tasks
xujiangfeng001 opened this issue Mar 10, 2023 · 1 comment
Open
6 of 7 tasks

[Feature] flink cluster new status tracking #2425

xujiangfeng001 opened this issue Mar 10, 2023 · 1 comment
Assignees
Labels
feature/accepted This feature request is accepted

Comments

@xujiangfeng001
Copy link
Contributor

xujiangfeng001 commented Mar 10, 2023

Search before asking

  • I had searched in the feature and found no similar feature requirement.

Description

After discussion with the developers, it was decided that the new status function points for Flink cluster are as follows:

  1. Heartbeat detection:
    a. the heartbeat detection capability of the cluster is added. Different modes (yarn-session, remote, k8s-session) have different url acquisition methods, REST polling, and status acquisition.
  2. Status update:
    a. the cluster adds a status field to record the cluster status (running, shutdown, lost).
  3. Failure alarm&failover:
    a. If cluster shutdown or lost is detected, an alarm will be sent.
    b. If the job is running on the cluster, the job will alarm in batches. At this time, it is necessary to prevent the job from alarming.
    c. The cluster does not fail over itself.
    d. The running job on the cluster will trigger the failover mechanism. At this time, you need to interrupt the failover mechanism of the job and set the job status to lost.
  4. The operation logic changes:
    a. Cluster deletion: the cluster is not bound to any job, and the cluster is stopped.
    b. Cluster stop: no jobs are bound on the cluster, or all the bound jobs are not running.
    c. Job start: if the job is in remote, k8s-session, yarn-session mode, you need to check whether the cluster bound to it is running, and then you can start it.
    d. Job addition: if it is remote, k8s-session, yarn-session mode, the cluster selection drop-down box needs to filter the cluster that is not started, leaving only the cluster that is running.
    e. Job modification: If the original mode is remote, k8s-session, yarn-session, and the bound cluster is not running, you cannot save it. You can only select another mode or switch to a running cluster.

The following is task splitting:

Usage Scenario

No response

Related issues

No response

Are you willing to submit a PR?

  • Yes I am willing to submit a PR!

Code of Conduct

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature/accepted This feature request is accepted
Projects
None yet
Development

No branches or pull requests

3 participants