RFC Resource Tracing #634

scothis · 2022-02-16T23:49:15Z

Changes proposed by this PR

closes #

Release Note

PR Checklist

Note: Please do not remove items. Mark items as done [x] or use ~~strikethrough~~ if you believe they are not relevant

Linked to a relevant issue. Eg: Fixes #123 or Updates #123
Removed non-atomic or wip commits
Filled in the Release Note section above
Modified the docs to match changes

Signed-off-by: Scott Andrews <[email protected]>

netlify · 2022-02-16T23:49:21Z

✔️ Deploy Preview for elated-stonebraker-105904 canceled.

🔨 Explore the source changes: 955261d

🔍 Inspect the deploy log: https://app.netlify.com/sites/elated-stonebraker-105904/deploys/621e562e702a29000816df4f

martyspiewak · 2022-02-17T18:20:49Z

I think this is great!

In terms of observedGeneration, it feels a bit odd for it to refer to the generation of the stamped object, but not be nested under the stampedRef. I also wonder what the intended use of this field is? You mentioned that lastTransitionTime could be useful for seeing that the supply chain is progressing but I'm not sure I see what observedGeneration is useful for.

cirocosta · 2022-02-17T19:45:07Z

rfc/rfc-0000-supply-chain-tracing.md

+- `.status.resources[*].stampedRef` object reference to the Kubernetes resource created by Cartographer for this resource.
+- `.status.resources[*].inputs[*]` inputs model the relationships between resources within the SupplyChain graph. The value of an input can be read from the referenced resource's outputs.
+- `.status.resources[*].inputs[*].name` the name of the resource backing this input. 
+- `.status.resources[*].outputs[*].name` the name of the output. Output names are fixed and defined by the template type.


made me wonder if it'd be more clear if we instead named the field resource: (for inputs)

~~and maybe type: (for output), so that we're very explicit about it 🤔~~ (nvm, for outputs)

~~(or perhaps even resourceName to really leave no room for other interpretations?)~~

or ... what if we even removed the plural here (outputs) and went solely with output? a resource can only export either one (cluster(source|image|config)template) or no output (clustertemplate)), and the type can already be inferred by looking up the templateref (.status.resources[*].templateRef.kind)

nvm, each entry is a field of the output

or change inputs to inputResources?

I waffled a bit while drafting the RFCs whether .inputs[*].name should be .inputs[*].resource since the value is from the resource field within the ResourceReference struct. I ended up going with name here because it's the dominate part of the relationship and I was viewing this as a local object reference like resource.

a resource can only export either one (cluster(source|image|config)template) or no output (clustertemplate))

We certainly could make the output a mux. I didn't because I think it makes it harder for a generic tool to consume as it would need to use a switch to find the active bit of the config. This is a loosely held opinion.

Signed-off-by: Scott Andrews <[email protected]>

martyspiewak · 2022-02-17T19:56:08Z

@emmjohnson and I took a stab at spiking this out to see what it might look like. It's on this branch, if anyone is interested in taking a look.

scothis · 2022-02-17T19:56:18Z

In terms of observedGeneration, it feels a bit odd for it to refer to the generation of the stamped object, but not be nested under the stampedRef. I also wonder what the intended use of this field is? You mentioned that lastTransitionTime could be useful for seeing that the supply chain is progressing but I'm not sure I see what observedGeneration is useful for.

All fair comments. I was trying to preserve a bit of the behavior from RFC 18 that included the ResourceVersion. I don't have a specific use case in mind. If we don't collectively have a use, I'll remove it.

rfc/rfc-0000-supply-chain-tracing.md

waciumawanjohi · 2022-02-17T20:35:16Z

I can imagine uses of the lastTransitionTime, think it's a great addition.

I don't think the observedGeneration is actionable. Even more, I think it would be often misinterpreted. I would expect users to infer far too much, e.g. that when the outputs report observedGeneration X then they are the result of the inputs of generation X (which is not necessarily true).

In practice, the observedGeneration on its own just lets me know that the object's controller is 'on'. Even then, without exposing the .metadata.generation the observedGeneration is simply a monotonic counter. Vote to remove the field.

waciumawanjohi · 2022-02-17T20:36:27Z

rfc/rfc-0000-supply-chain-tracing.md

+- [First draft of RFC 0014](https://github.com/vmware-tanzu/cartographer/pull/274)
+- [Introduce RFC 18 Workload Report Artifact Provenance](https://github.com/vmware-tanzu/cartographer/pull/519)
+
+# Unresolved Questions


To point out a possible risk:

We have another RFC on tracing that would duplicate much of this information if adopted. The use cases for that RFC remain and I haven't heard progress on alternate work that would need to be done at the level of all of the choreographed components of a supply chain if that RFC were to be rejected. E.g. at the moment I would predict that we're going to move forward with some version of that RFC.

An important note about tracing is that it is artifact centric rather than resource centric. E.g. wherein this proposal reports 1 artifact per resource, there is good reason for the tracing RFC to report more than 1 artifact for each resource.

If we adopt this RFC and later adopt the tracing RFC we'll need to decide:

if we want to have two longish fields on the workload that have much of the same information, or

if we want to make a breaking change and alter what's proposed here.

Please add a link to the RFC and ideally the parts of the RFC which you feel are relevant

waciumawanjohi · 2022-02-17T21:06:09Z

Random request:
Can the title of this RFC change? We've been using the term tracing to discuss following artifact A as it propagates through a supply chain and creates downstream changes. This RFC does not attempt to achieve that.

Possible names:

Resource reporting
Exposing stamped resources to workload owners
...

Signed-off-by: Scott Andrews <[email protected]>

rfc/rfc-0000-supply-chain-tracing.md

scothis · 2022-02-22T16:53:27Z

Can the title of this RFC change?

sure

We've been using the term tracing to discuss following artifact A as it propagates through a supply chain and creates downstream changes. This RFC does not attempt to achieve that.

I consider both of these to be tracing, but there's a difference in what is being traced. In this case what we're tracing is the supply chain defined steps for a workload. I agree that it's not artifact level tracing.

Signed-off-by: Scott Andrews <[email protected]> Co-authored-by: Marty Spiewak <[email protected]>

scothis · 2022-02-22T17:03:53Z

rfc/rfc-0000-supply-chain-tracing.md

+    - name: source-provider
+    outputs:
+    - name: image
+      value: registry.example/supply-chain/my-workload@sha256:68f8e8fc6e8ede7a411db9182cd695eac7b3e7e19e4ff9dcb9ba21205c135697


On RFC 18 @evankanderson raised a valid concern about the potential size of outputs causing issues.

#519 (comment)

There is also a 1MB limit on the size of a single resource, and the values might be repeated twice due to the addition of managedFields. I'd be careful about including large other documents directly for that reason. It seems like it would be better to include a resourceVersion or generation field referencing the resource's version, rather than the entire contents.

We could switch including the value to use a hash of the value. Clients would be required to fetch the stamped resource in order to resolve the actual value.

Switched to a digest and path instead of a value for outputs.

I definitely understand the argument about the config being too big, but I think capturing the value here is important. A lot of our modelling assumes that the workload is the only object that a developer needs to care about, now we're saying that they need to go and make these other associations to other objects.

I think if we don't have value anymore then we'd have to remove lastTransitionTime as we won't be storing the old value so we won't know if it's changed. It'd also be misleading as to what lastTransitionTime- it could be taken to mean the last time the output path changed.

@cirocosta how do we know what value needs to be masked?

I can update the output digest/value/preview to be whatever people collectively want.

There are two different dimensions of risk/customer impact to consider with the decision.

[confidentiality] "leaking" outputs that an org views as sensitive

[availability] a supply chain stops working in production, with an unclear path to recovery

field confidentiality risk availability risk

value medium high

preview medium low

digest low low

Can I get some thumbs up/thumbs down on moving forward with the following:

Add a preview field with which holds the first 200 bytes of each value.

@scothis After today's chat, I'm pretty comfortable with adding the preview field with <1000 bytes (200 seems fine to me). If you aren't opposed, would you mind adding it?

For `.status.resources[*].outputs[*]`, replace `value` with `digest` and `path` fields. This addresses two concerns: 1. the values may be too large, and exceed the size limit for a k8s resource 2. the values may be sensitive Now a consumer will need to resolve the raw value from the stamped resource using the output path specified. Signed-off-by: Scott Andrews <[email protected]>

cirocosta · 2022-02-22T20:06:35Z

rfc/rfc-0000-supply-chain-tracing.md

+- `.status.resources[*].inputs[*]` inputs model the relationships between resources within the SupplyChain graph. The value of an input can be read from the stamped resource based on the outputs for the referenced resource.
+- `.status.resources[*].inputs[*].name` the name of the resource backing this input. 
+- `.status.resources[*].outputs[*].name` the name of the output. Output names are fixed and defined by the template type.
+- `.status.resources[*].outputs[*].digest` sha256sum of the raw output value.


I wonder if we need to be more specific here given how "sensitive" hashing is to any bit change, e.g., a javascript based client might end up interpreting the raw value differently from the controller due to a different json decoder? wondering that because we're effectively hashing the result of the jsonpath query (rather than straight byte stream)

I know you mentioned that we should try as much as possible to not expose potentially sensitive information in workload.status, but storage-wise, made me wonder if it'd be too much of a crazy/bad idea considering a custom apiserver implementation to work around the etcd storage limitation

IIRC, because you can back the endpoints you register against with ... anything, we could back it up with """something else""", although it's clearly a large complexity add

somewhat related: tektoncd/community#606

I wonder if we need to be more specific here given how "sensitive" hashing is to any bit change, e.g., a javascript based client might end up interpreting the raw value differently from the controller due to a different json decoder?

That's a good point, the digest will only be stable against a string or byte array value, structs are ambiguous. The intent isn't to be cryptographically secure, but to provide a check of the value so a client can be reasonably sure that it has the "correct" value and that the value hasn't changed between when the c

We can drop the digest field for now if you're not confident in it, and add it back once we have the semantics nailed.

made me wonder if it'd be too much of a crazy/bad idea considering a custom apiserver implementation to work around the etcd storage limitation

yes, crazy and bad :)

But on a serious note, there are other concerns that would be nice to have that will never end up on the Workload status. For example, a history of artifacts and how they interact in the supply chain. This kind of data should be captured into a different system that can support temporal (and other types of) queries. I wouldn't use that front an aggregated api, but to be used as a separate store.

I'm not sure how sensible this is, but it is possible to serialize with sorted keys in the go yaml package (unsure about the k8s one). That would at least produce repeatable digests.

We can also say that the digest is primarily for Cartographer's internal use (to generate the lastTransitionTime corectly).

Signed-off-by: Scott Andrews <[email protected]>

scothis · 2022-02-24T16:45:53Z

renamed to "Resource Tracing"

squeedee

I believe folks still expect something like the digest as a part of this but everything else is fine as is.

Signed-off-by: Scott Andrews <[email protected]>

…racing.md

RFC Supply Chain Tracing

097383a

Signed-off-by: Scott Andrews <[email protected]>

martyspiewak added the rfc Requests For Comment label Feb 17, 2022

cirocosta mentioned this pull request Feb 17, 2022

When a resource with multiple templates select a different template, then there are not orphan objects #592

Closed

cirocosta reviewed Feb 17, 2022

View reviewed changes

whitespace is hard

077bcea

Signed-off-by: Scott Andrews <[email protected]>

cirocosta reviewed Feb 17, 2022

View reviewed changes

rfc/rfc-0000-supply-chain-tracing.md Outdated Show resolved Hide resolved

cirocosta reviewed Feb 17, 2022

View reviewed changes

rfc/rfc-0000-supply-chain-tracing.md Outdated Show resolved Hide resolved

waciumawanjohi reviewed Feb 17, 2022

View reviewed changes

rm observedGeneration

e1b88e0

Signed-off-by: Scott Andrews <[email protected]>

martyspiewak reviewed Feb 18, 2022

View reviewed changes

rfc/rfc-0000-supply-chain-tracing.md Outdated Show resolved Hide resolved

jwntrs approved these changes Feb 19, 2022

View reviewed changes

martyspiewak approved these changes Feb 22, 2022

View reviewed changes

Update rfc/rfc-0000-supply-chain-tracing.md

a58ef21

Signed-off-by: Scott Andrews <[email protected]> Co-authored-by: Marty Spiewak <[email protected]>

scothis commented Feb 22, 2022

View reviewed changes

squeedee approved these changes Feb 22, 2022

View reviewed changes

cirocosta reviewed Feb 22, 2022

View reviewed changes

emmjohnson approved these changes Feb 24, 2022

View reviewed changes

scothis changed the title ~~RFC Supply Chain Tracing~~ RFC Supply Chain Resource Tracing Feb 24, 2022

Rename to Resource Tracing

1b2916e

Signed-off-by: Scott Andrews <[email protected]>

scothis changed the title ~~RFC Supply Chain Resource Tracing~~ RFC Resource Tracing Feb 24, 2022

squeedee approved these changes Feb 25, 2022

View reviewed changes

zrob mentioned this pull request Feb 25, 2022

Include supply chain resources in workload status #664

Closed

Output preview

53dd2a1

Signed-off-by: Scott Andrews <[email protected]>

scothis force-pushed the rfc-supply-chain-tracing branch from 6ae72f0 to 53dd2a1 Compare March 1, 2022 01:37

jwntrs requested review from sclevine, waciumawanjohi, cirocosta and mklanjsek March 1, 2022 14:20

cirocosta approved these changes Mar 1, 2022

View reviewed changes

sclevine approved these changes Mar 1, 2022

View reviewed changes

scothis marked this pull request as ready for review March 1, 2022 15:44

martyspiewak mentioned this pull request Mar 1, 2022

Resources in workload status #675

Merged

4 tasks

idoru approved these changes Mar 1, 2022

View reviewed changes

Update and rename rfc-0000-resource-tracing.md to rfc-0020-resource-t…

955261d

…racing.md

martyspiewak enabled auto-merge (squash) March 1, 2022 17:23

martyspiewak merged commit 4cf4554 into vmware-tanzu:main Mar 1, 2022

scothis deleted the rfc-supply-chain-tracing branch March 21, 2022 17:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RFC Resource Tracing #634

RFC Resource Tracing #634

scothis commented Feb 16, 2022 •

edited

Loading

netlify bot commented Feb 16, 2022 •

edited

Loading

martyspiewak commented Feb 17, 2022

cirocosta Feb 17, 2022 •

edited

Loading

cirocosta Feb 17, 2022 •

edited

Loading

cirocosta Feb 17, 2022 •

edited

Loading

scothis Feb 17, 2022

scothis Feb 17, 2022

martyspiewak commented Feb 17, 2022

scothis commented Feb 17, 2022

waciumawanjohi commented Feb 17, 2022

waciumawanjohi Feb 17, 2022

scothis Feb 17, 2022

waciumawanjohi commented Feb 17, 2022

scothis commented Feb 22, 2022 •

edited

Loading

scothis Feb 22, 2022

scothis Feb 22, 2022

scothis Feb 22, 2022

jwntrs Feb 22, 2022

martyspiewak Feb 23, 2022

scothis Feb 28, 2022

scothis Feb 28, 2022

jwntrs Feb 28, 2022

sclevine Feb 28, 2022

scothis Mar 1, 2022

cirocosta Feb 22, 2022 •

edited

Loading

cirocosta Feb 22, 2022 •

edited

Loading

scothis Feb 22, 2022 •

edited

Loading

squeedee Feb 25, 2022

scothis Feb 25, 2022

scothis commented Feb 24, 2022

squeedee left a comment

RFC Resource Tracing #634

RFC Resource Tracing #634

Conversation

scothis commented Feb 16, 2022 • edited Loading

Changes proposed by this PR

Release Note

PR Checklist

netlify bot commented Feb 16, 2022 • edited Loading

martyspiewak commented Feb 17, 2022

cirocosta Feb 17, 2022 • edited Loading

Choose a reason for hiding this comment

cirocosta Feb 17, 2022 • edited Loading

Choose a reason for hiding this comment

cirocosta Feb 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

martyspiewak commented Feb 17, 2022

scothis commented Feb 17, 2022

waciumawanjohi commented Feb 17, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

waciumawanjohi commented Feb 17, 2022

scothis commented Feb 22, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cirocosta Feb 22, 2022 • edited Loading

Choose a reason for hiding this comment

cirocosta Feb 22, 2022 • edited Loading

Choose a reason for hiding this comment

scothis Feb 22, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

scothis commented Feb 24, 2022

squeedee left a comment

Choose a reason for hiding this comment

scothis commented Feb 16, 2022 •

edited

Loading

netlify bot commented Feb 16, 2022 •

edited

Loading

cirocosta Feb 17, 2022 •

edited

Loading

cirocosta Feb 17, 2022 •

edited

Loading

cirocosta Feb 17, 2022 •

edited

Loading

scothis commented Feb 22, 2022 •

edited

Loading

cirocosta Feb 22, 2022 •

edited

Loading

cirocosta Feb 22, 2022 •

edited

Loading

scothis Feb 22, 2022 •

edited

Loading