Cluster component repr hierarchy #5216

jacobtomlinson · 2021-08-16T15:09:32Z

When launching a cluster with a cluster manager like SSHCluster there are four different places where worker info exists.

On the worker system that I am SSHing into there is an instance of distributed.worker.Worker.
On my local system there is an instance of distributed.deploy.ssh.Worker which manages the SSH subprocess. This is a subclass of ProcessInterface which is being discussed in this PR.
On the scheduler system there is an instance of distributed.scheduler.Scheduler which manages a dictionary of scheduler_info that is kept in sync by the worker heartbeat.
On my local system both the SSHCluster and Client objects have a copy of the scheduler_info dictionary from the Scheduler via the RPC.

Today the HTML repr which shows scheduler and worker info is on that scheduler_info object. This is because the scheduler_info is the simplest way of getting and showing this information to the user.

There are some things to think about here:

Do the Worker and ProcessInterface objects have all the same information about the workers that scheduler_info does? (I think no)
Should distributed.worker.Worker and distributed.deploy.ssh.Worker have reprs that look like the worker dropdowns in scheduler_info?
Should Cluster and Client reuse the reprs from the Worker objects instead of creating its own representation for them?

Originally posted by @jacobtomlinson in #5181 (comment)

The text was updated successfully, but these errors were encountered:

GenevieveBuckley · 2021-08-17T00:00:59Z

Thanks @jacobtomlinson for starting this discussion.

Today the HTML repr which shows scheduler and worker info is on that scheduler_info object. This is because the scheduler_info is the simplest way of getting and showing this information to the user.

I imagine there are a lot of people for whom scheduler_info is the only thing they look at regularly for information about the scheduler/workers. I think it's good to have this kind of "one stop shop", but it's worthwhile remembering that lots of people won't go digging around in other places.

Should Cluster and Client reuse the reprs from the Worker objects instead of creating its own representation for them?

I'd say yes, this is probably ideal.

jacobtomlinson · 2021-08-17T08:55:06Z

Thanks @GenevieveBuckley.

Given you think we should do both this seems like a good opportunity to make use of jinja2 includes once dask/dask#8019 lands. That way we can create a worker template which is used by the Worker repr and included in the scheduler_info repr.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cluster component repr hierarchy #5216

Cluster component repr hierarchy #5216

jacobtomlinson commented Aug 16, 2021

GenevieveBuckley commented Aug 17, 2021

jacobtomlinson commented Aug 17, 2021

Cluster component repr hierarchy #5216

Cluster component repr hierarchy #5216

Comments

jacobtomlinson commented Aug 16, 2021

GenevieveBuckley commented Aug 17, 2021

jacobtomlinson commented Aug 17, 2021