feat: Live resource view #267

jannfis · 2025-01-06T15:30:01Z

What does this PR do / why we need it:

This PR introduces functionality to provide live resource view from the agents to the control plane. There are still a few caveats with this, and some things to sort out.

The general architecture is documented here.

It's also the pre-requisite for other upcoming features, such as log view, resource actions and resource manipulation, this is why I'd like to have the code in early.

The CLI components to manipulate cluster secrets will be a follow-up PR.

Which issue(s) this PR fixes:

Fixes #?

How to test changes / Special notes to the reviewer:

The CLI code for managing cluster secrets will be submitted in a different PR, thus, the code in here is potentially difficult to test.

Checklist

Documentation update is required by this PR (and has been updated) OR no documentation update is required.

codecov-commenter · 2025-01-06T15:41:54Z

Codecov Report

Attention: Patch coverage is 47.33010% with 651 lines in your changes missing coverage. Please review.

Project coverage is 48.41%. Comparing base (d16afb3) to head (dd5d108).
Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
cmd/principal/main.go	0.00%	93 Missing ⚠️
principal/server.go	14.94%	68 Missing and 6 partials ⚠️
agent/inbound.go	10.25%	70 Missing ⚠️
cmd/cmdutil/log.go	0.00%	51 Missing ⚠️
internal/event/event.go	7.54%	49 Missing ⚠️
internal/argocd/cluster/informer.go	58.82%	26 Missing and 9 partials ⚠️
principal/resource.go	70.17%	27 Missing and 7 partials ⚠️
internal/argocd/cluster/conversion.go	42.30%	18 Missing and 12 partials ⚠️
cmd/cmdutil/term.go	0.00%	24 Missing ⚠️
principal/resourceproxy/resourceproxy.go	81.35%	14 Missing and 8 partials ⚠️
... and 17 more

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #267      +/-   ##
==========================================
+ Coverage   48.20%   48.41%   +0.21%     
==========================================
  Files          55       72      +17     
  Lines        5066     6219    +1153     
==========================================
+ Hits         2442     3011     +569     
- Misses       2424     2949     +525     
- Partials      200      259      +59

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: jannfis <[email protected]>

chetan-rns

Added a couple of small questions. Everything else looks good to me.

principal/resource.go

principal/server.go

jgwest

Looks great @jannfis, posted my thoughts below on anything that jumped out at me while working through the PR.

And one final question: I didn't see any E2E-style tests for the feature, what are your plans here?

agent/inbound.go

internal/argocd/cluster/conversion.go

internal/argocd/cluster/manager.go

test/testutil/context.go

test/testutil/waitfor.go

internal/resourceproxy/resourceproxy.go

principal/event.go

internal/resourceproxy/resourceproxy.go

Signed-off-by: jannfis <[email protected]>

ishitasequeira · 2025-01-22T00:59:19Z

agent/inbound.go

+	if err != nil {
+		logCtx.Errorf("could not request resource: %v", err)
+		status = err


If the err here is not nil, based on my understanding, we were not able to fetch any resources. Should we be returning the error from here itself?

Hm. In my thought process, it is not an error condition for processIncomingResourceRequest itself when we could not fetch the resource from the local cluster. We rather need to let the other side know, that resource fetching failed (and the reason, e.g. resource not found or access forbidden).

That makes sense. Continuing on the same scenario, would the json marshaling (line 129 below) be considered more of an internal error or should we be returning that to the caller as well? My thought is that, we might loose sending the response in scenarios where we find errors in json marshaling.

That's a good question, and one that could be asked about any error condition in that particular method.

For now, I think the answer is: We want to transport eventual status codes from the communication between the agent and Kubernetes API back to the principal, but nothing else. This is because the principal can then pass that status code on to the consumer of the resource proxy.

We could go ahead and send HTTP 500 status codes back to the principal whenever something goes wrong in processIncomingResourceRequest, then the question is, should we? I have not yet found a good answer to that question. The principal will notice that no reply has been sent from the agent anyway, and will let its consumer know.

cmd/cmdutil/term.go

Signed-off-by: jannfis <[email protected]>

jannfis · 2025-01-23T12:32:59Z

@jgwest I have not yet thought about e2e tests tbh. Agree we need to have some. Will work on them together with the corresponding CLI PR that's in the queue.

jannfis force-pushed the feat/live-resource-view branch from d83a0f9 to a7c58b7 Compare January 6, 2025 15:33

jannfis force-pushed the feat/live-resource-view branch 2 times, most recently from 262d0e2 to ad9b02d Compare January 8, 2025 17:08

feat: Live resource view

1bd19ed

Signed-off-by: jannfis <[email protected]>

jannfis force-pushed the feat/live-resource-view branch from ad9b02d to 1bd19ed Compare January 9, 2025 02:00

jannfis added 5 commits January 9, 2025 13:57

Unit tests for resource tracker

919f6e8

Signed-off-by: jannfis <[email protected]>

Tests and cleanup

34cf5de

Signed-off-by: jannfis <[email protected]>

Tests

7bae798

Signed-off-by: jannfis <[email protected]>

More tests

a82e43a

Signed-off-by: jannfis <[email protected]>

More tests

aac6a5e

Signed-off-by: jannfis <[email protected]>

jannfis marked this pull request as ready for review January 14, 2025 01:07

jannfis requested review from jgwest, ishitasequeira and chetan-rns as code owners January 14, 2025 01:07

chetan-rns reviewed Jan 21, 2025

View reviewed changes

principal/resource.go Show resolved Hide resolved

principal/server.go Outdated Show resolved Hide resolved

principal/server.go Show resolved Hide resolved

jgwest requested changes Jan 21, 2025

View reviewed changes

jannfis added 9 commits January 21, 2025 17:57

Update inline docs to reflect UUID for resource requests

ffda622

Signed-off-by: jannfis <[email protected]>

Address reviewer comments

4b65ee3

Signed-off-by: jannfis <[email protected]>

Rename remaining KubeProxy references to ResourceProxy

d7b06b7

Signed-off-by: jannfis <[email protected]>

Remove dead code around upstream proxying

6e7e420

Signed-off-by: jannfis <[email protected]>

Do not use buffered channel for tracking

0c9ef85

Signed-off-by: jannfis <[email protected]>

Rename resourceRequester to processResourceRequest

f5252b7

Signed-off-by: jannfis <[email protected]>

Use PollUntilContextTimeout instead of rolling our own

6e37c7f

Signed-off-by: jannfis <[email protected]>

Fix comment

95ff52d

Signed-off-by: jannfis <[email protected]>

Remove more upstream references from resourceproxy

97286ea

Signed-off-by: jannfis <[email protected]>

ishitasequeira reviewed Jan 22, 2025

View reviewed changes

jannfis added 2 commits January 22, 2025 01:49

Move resourceproxy to principal's package hierarchy

078ed8a

Signed-off-by: jannfis <[email protected]>

Clarify usage and refactor variable names

dd5d108

Signed-off-by: jannfis <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Live resource view #267

feat: Live resource view #267

jannfis commented Jan 6, 2025 •

edited

Loading

codecov-commenter commented Jan 6, 2025 •

edited

Loading

chetan-rns left a comment

jgwest left a comment

ishitasequeira Jan 22, 2025

jannfis Jan 22, 2025

ishitasequeira Jan 22, 2025

jannfis Jan 22, 2025

jannfis commented Jan 23, 2025

feat: Live resource view #267

Are you sure you want to change the base?

feat: Live resource view #267

Conversation

jannfis commented Jan 6, 2025 • edited Loading

codecov-commenter commented Jan 6, 2025 • edited Loading

Codecov Report

chetan-rns left a comment

Choose a reason for hiding this comment

jgwest left a comment

Choose a reason for hiding this comment

ishitasequeira Jan 22, 2025

Choose a reason for hiding this comment

jannfis Jan 22, 2025

Choose a reason for hiding this comment

ishitasequeira Jan 22, 2025

Choose a reason for hiding this comment

jannfis Jan 22, 2025

Choose a reason for hiding this comment

jannfis commented Jan 23, 2025

jannfis commented Jan 6, 2025 •

edited

Loading

codecov-commenter commented Jan 6, 2025 •

edited

Loading