run go generate on CI #49

marten-seemann · 2021-07-12T22:12:14Z

Fixes #48.

marten-seemann · 2021-07-12T22:39:59Z

CI fails because #46 is not yet merged.

.github/workflows/go-generate.yml

mvdan · 2021-07-14T09:35:18Z

I'd be fully on board with this if code generation was deterministic... but it isn't - if just a few days or a week go by, there's a good chance the generated code will change. Plus, re-generation requires manual review, so I'm not sure that automation avoids human interaction either (see #50).

We could make the generator fully deterministic across time by pinning a hash of the CSV, but then we just move the problem elsewhere - now the manual step is to update the SHA every few weeks or months :)

I think there's a manual step every few months no matter what we do. Periodic checks/reminders could be useful, like a cron job - I'd personally prefer that over making commit CI builds start failing by no fault of their own.

masih · 2021-07-14T09:41:41Z

I'd be fully on board with this if code generation was deterministic... but it isn't - if just a few days or a week go by, there's a good chance the generated code will change. Plus, re-generation requires manual review, so I'm not sure that automation avoids human interaction either (see #50).

We could make the generator fully deterministic across time by pinning a hash of the CSV, but then we just move the problem elsewhere - now the manual step is to update the SHA every few weeks or months :)

I think there's a manual step every few months no matter what we do. Periodic checks/reminders could be useful, like a cron job - I'd personally prefer that over making commit CI builds start failing by no fault of their own.

I am curious why the code generation is non-deterministic?

mvdan · 2021-07-14T09:43:46Z

Because it uses the multicodec CSV as input from its HEAD branch, and that CSV table is updated from time to time.

mvdan · 2021-07-14T09:46:10Z

Thinking outloud, in an ideal world this Go code generation would be in the same repo as where the CSV is, then we wouldn't have this complexity with separately-moving HEADs. The output could be a module in a sub-directory in the same root repository, for example.

masih · 2021-07-14T09:46:45Z

Because it uses the multicodec CSV as input from its HEAD branch, and that CSV table is updated from time to time.

OK so when I hear non-deterministic; I think: Given the same CSV file would the generator generate the same code? And I think this is a "Yes". I'd then argue code generator is deterministic and should be automated.

Want to manually release? fine. But I don't think a human looking at a code generated by running go generate adds much tbh.

masih · 2021-07-14T09:48:18Z

Thinking outloud, in an ideal world this Go code generation would be in the same repo as where the CSV is, then we wouldn't have this complexity with separately-moving HEADs. The output could be a module in a sub-directory in the same root repository, for example.

A note on HEAD: If a motivation to do things manually is "control" then the generator could be triggered when there is a tag on the repo that contains the CSV. That way we have two manual control points: first is manual tagging of CSV file, and second manual release of code generated automatically.

marten-seemann · 2021-07-14T14:34:23Z

Because it uses the multicodec CSV as input from its HEAD branch, and that CSV table is updated from time to time.

Would it make sense to use a Git submodule here? That way things would be more predictable.

mvdan · 2021-07-14T16:50:08Z

OK so when I hear non-deterministic; I think: Given the same CSV file would the generator generate the same code?

At least to me, deterministic means "go generate on the same commit will give the same result when I run it many times, be it today or next week or next year".

Would it make sense to use a Git submodule here? That way things would be more predictable.

I mean, sure, it can't hurt - but it's akin to pinning a commit SHA in the URL like I mentioned before, so you simply move "regularly re-run go generate" to "regularly update the SHA and re-run go generate". Makes it more deterministic for sure, but it doesn't remove the manual review element.

mvdan · 2021-07-14T16:55:06Z

Maybe I misunderstood your last comment, actually. If you mean use a submodule to make go generate fully deterministic so CI can enforce it to be up to date, that sounds good to me. Then every few months, or as needed, we can update the submodule to pull in CSV changes and code review how that results in generated code changes.

marten-seemann · 2021-07-14T21:53:20Z

That's what I meant. We won't be able to automatically update the submodule (unless we want to write an Action that does that and then opens a PR, but that's probably overkill), but at least we can enforce that the version of the submodule and the generated code are consistent.

mvdan · 2021-07-14T21:59:59Z

Yup that sounds good. Happy to review a PR if you want to take a crack. If we don't fancy submodules, we could always swap the raw github URL to use a commit hash instead of HEAD.

marten-seemann · 2021-07-15T19:42:54Z

If I remember correctly, other repositories also use submodules to include the multicodec repo. That's why we introduced the recursive submodule checkout in the Unified CI Setup, btw.

I added the submodule and changed the code generator to read the file from there.

mvdan

Nice!

marten-seemann requested a review from Stebalien July 12, 2021 22:12

marten-seemann mentioned this pull request Jul 12, 2021

consider running go generate protocol/.github#123

Closed

Stebalien approved these changes Jul 12, 2021

View reviewed changes

masih reviewed Jul 13, 2021

View reviewed changes

.github/workflows/go-generate.yml Show resolved Hide resolved

mvdan mentioned this pull request Jul 14, 2021

Automate code generation via CI #47

Closed

marten-seemann added 2 commits July 15, 2021 21:38

run go generate on CI

9daa054

add multiformats/multicodec as a git submodule

c274c8d

marten-seemann force-pushed the go-generate-ci branch from 8cb726d to c274c8d Compare July 15, 2021 19:39

marten-seemann requested a review from mvdan July 15, 2021 19:43

mvdan approved these changes Jul 16, 2021

View reviewed changes

mvdan merged commit 14e1238 into master Jul 16, 2021

mvdan mentioned this pull request Aug 9, 2021

Run codec-fixtures tests via Actions ipld/go-ipld-prime#218

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run go generate on CI #49

run go generate on CI #49

marten-seemann commented Jul 12, 2021

marten-seemann commented Jul 12, 2021

mvdan commented Jul 14, 2021

masih commented Jul 14, 2021

mvdan commented Jul 14, 2021

mvdan commented Jul 14, 2021

masih commented Jul 14, 2021

masih commented Jul 14, 2021 •

edited

Loading

marten-seemann commented Jul 14, 2021

mvdan commented Jul 14, 2021

mvdan commented Jul 14, 2021

marten-seemann commented Jul 14, 2021

mvdan commented Jul 14, 2021

marten-seemann commented Jul 15, 2021

mvdan left a comment

run go generate on CI #49

run go generate on CI #49

Conversation

marten-seemann commented Jul 12, 2021

marten-seemann commented Jul 12, 2021

mvdan commented Jul 14, 2021

masih commented Jul 14, 2021

mvdan commented Jul 14, 2021

mvdan commented Jul 14, 2021

masih commented Jul 14, 2021

masih commented Jul 14, 2021 • edited Loading

marten-seemann commented Jul 14, 2021

mvdan commented Jul 14, 2021

mvdan commented Jul 14, 2021

marten-seemann commented Jul 14, 2021

mvdan commented Jul 14, 2021

marten-seemann commented Jul 15, 2021

mvdan left a comment

Choose a reason for hiding this comment

masih commented Jul 14, 2021 •

edited

Loading