[RFC 0095] Enable doCheck by default #95

gytis-ivaskevicius · 2021-06-25T13:37:55Z

Rendered

rfcs/0095-enable-docheck-by-default.md

nh2 · 2021-06-25T15:00:54Z

Tangential, but maybe worth asking:

Would it be possible/sensible to separate build and test phases into separate derivations, or something of that kind?

Sometimes, running the tests requires some overrides (e.g. LD_LIBRARY_PATH) on the checkPhase only, requiring developer iteration on that phase only. Currently, this requires a full rebuild (e.g. 1-hour rebuild for large software like ceph).

rfcs/0095-enable-docheck-by-default.md

edolstra · 2021-06-25T15:09:05Z

I think this is high cost, low benefit. Most upstream test suites don't test packaging bugs, so as long as upstream runs their own test suite before release, enabling doCheck by default won't reveal a lot of bugs for us. But the cost is very high: many test suites are very slow (sometimes taking hours), pull in lots of additional dependencies, and are brittle (so we'll get a lot of random test failures that we need to debug and patch).

Ericson2314 · 2021-06-25T20:00:10Z

@edolstra I would say all those bad test sweets should just be marked doCheck = false. I don't see this PR as us committing to bailing out upstream packages in new significant ways, just maintaining a blacklist rather than a whitelist.

I think that is nonetheless a big improvement because it encourages maintainers to at least try the test suite, and the opt-outs will come with useful comments saying what's wrong. Blacklists are also better to indicate to upstream devs that something is wrong --- which is different and easier than actually fixing the issue ourselves!

nh2 · 2021-06-25T21:03:13Z

As somebody who uses NixOS as production infrastructructure with user data that must not be lost, I would very welcome a change to tests-by-default, with problematic tests turned off where needed.

Zimmi48 · 2021-06-26T07:40:24Z

I agree that running test-suites by default would help detect new packaging errors, and therefore would be useful. However, a significant drawback of the proposed approach that this RFC doesn't acknowledge is the one that I reported in #55648 and https://discourse.nixos.org/t/behavior-of-undeclared-broken-packages-is-confusing/2123.

The only clear solution to the above problem is an always green hydra (though it doesn't solve the issue of derivations that are not supposed to be built in hydra).

Furthermore, when it is possible to test packages in separate derivations (cf. passthru.tests documented at https://nixos.org/manual/nixpkgs/stable/#var-meta-tests), then we should encourage this over doCheck = true (though it's harder to do in practice since test-suites rarely support this use case).

Co-authored-by: Niklas Hambüchen <[email protected]>

gytis-ivaskevicius · 2021-06-26T14:50:17Z

I was thinking about this - maybe one middle ground that is worth considering is running checks only during PR review stage.

zimbatm · 2021-06-27T14:02:15Z

This proposal appeals to my "let's make everything strict" side. At the same time, I don't remember a single time where unit tests uncovered an issue with the packaging. Usually, the issue is something related to the runtime, which isn't exercised by most tests.

Are there examples out there where doCheck = true would have saved us from runtime issues?

nh2 · 2021-06-27T16:06:37Z

Are there examples out there where doCheck = true would have saved us from runtime issues?

I am quite sure that I've seen such cases in Ceph and GlusterFS builds, where the upstram test suites uncovered missing runtime dependencies (e.g. Python libs) or lack of wrapping like LD_LIBRARY_PATH.

I didn't write them down at the time though, as at the time I wasn't aware that doCheck = true isn't the default.

kevincox · 2021-06-27T16:40:27Z

Missing runtime deps is a common problem that could be caught. I think that default-enable is the right approach. As long as disabling the checks are simple and not frowned upon, lots of software has fast and helpful test suites that wkll catch some common issues.

gytis-ivaskevicius · 2021-06-30T09:27:46Z

I updated the section on doCheck semantics and it seems that everyone has stated their opinions. The majority seems to like the idea but "cost vs value" is still in question.

Do you guys think that we can take this RFC to the next stages?

Ericson2314 · 2021-07-01T19:33:01Z

I nominate myself.

gytis-ivaskevicius · 2021-07-01T19:35:43Z

Nice going @Ericson2314 🚀
Who else would like to join in?? @nh2 @oxij thughts?

michaelpj · 2021-07-08T12:12:48Z

Would it be possible/sensible to separate build and test phases into separate derivations, or something of that kind?

Somewhat tangential, but haskell.nix does this. It's designed to focus on projects under development, though, where you really do want to run your tests in your CI! Having them in separate derivations is however extremely helpful in preventing slow tests from clogging the overall build progress. So I would say that this has proven quite valuable if you do want to run tests.

nh2 · 2021-07-08T13:15:02Z

I'm not very experienced in the RFC discussion / calls process yet, so I don't want to be in a position of leadership for it, but I'd certainly like to participate in such things to get into it.

piegamesde · 2021-07-08T14:13:43Z

Would it be possible/sensible to separate build and test phases into separate derivations, or something of that kind?

As far as I can tell, this would also allow us to better separate the build and the check process. For example, at the moment the only way to build packages without also running the tests is to use overrides, which gets messy quickly.

Mic92 · 2021-07-22T13:14:26Z

We need at least one more shepherd for this RFC.

Co-authored-by: Jörg Thalheim <[email protected]>

spacekookie · 2021-09-09T13:08:26Z

Shepherds for this RFC are @Ericson2314, @nh2, and @edolstra

@Ericson2314 would you mind being the leader?

Mic92 · 2021-09-09T13:18:29Z

@gytis-ivaskevicius please also update the yaml frontmatter to incorporate the shepherds.

rfcs/0095-enable-docheck-by-default.md

AndersonTorres · 2021-09-26T21:26:17Z

A thing about language normalization:

If I remember well, every phase can be turned on or off by setting attributes such as dontConfigure to true or false.

However, some of these attrs have name dontX and others have doX.

How sould we normalize them? Putting all as dontX? and using false as default?

Ericson2314 · 2021-10-14T15:38:51Z

Just so the rest of world knows, I made a matrix channel with the author and shepherds to get the ball rolling.

Mic92 · 2021-11-03T09:19:43Z

@Ericson2314 have you tried scheduling a meeting already?

gytis-ivaskevicius · 2021-11-03T12:43:19Z

Yeah, we tried scheduling a meeting and it was unsuccessful. Will try to align the meeting with everyone later today.

nh2 · 2021-11-11T15:04:58Z

Meeting happening right now

edolstra · 2021-11-11T16:41:22Z

Meeting summary:

While tests are useful, enabling them has downsides:
- Increased build times.
- More non-deterministic build failures.
- Extra dependencies for the test framework.
- Upstream tests don't often reveal downstream packaging/integration issues, because most are functional tests that are unlikely to break.
- Upstream tests typically run against the build tree rather than the installed package, so they don't actually test what we're interested in (namely whether the installed package works).
If enabling doCheck globally is too expensive, there are some ideas for running tests anyway:
- Let ofborg build pkg.override { doCheck = true; }. That way our CI runs tests but users who build from source don't have to.
- Have more .passthru.test derivations to test installed packages.
- Split tests into separate derivations, e.g. by saving the build tree into a separate output and running the test from there. This would be quite expensive for Hydra in terms of storage space, since build trees are large.
Action items:
- Add guidelines to the RFC (for inclusion in the Nixpkgs docs) on when doCheck should be enabled, e.g. 1) the tests shouldn't take "too long"; 2) the tests shouldn't require (big) additional dependencies; 3) the tests shouldn't be flaky.
- Set up a nixpkgs branch to test the cost of enabling doCheck globally.
- Create a sprintable project (like ZHF) for setting explicit doCheck = true|false attributes on as many packages as possible.
There was some parenthetical discussion on whether enableParallelBuilding should be enabled by default, but that's a topic for another RFC.

AndersonTorres · 2021-11-12T00:19:16Z

Split tests into separate derivations, e.g. by saving the build tree into a separate output and running the test from there. This would be quite expensive for Hydra in terms of storage space, since build trees are large.

Isn't it possible to reuse/cache such build trees when enabling tests?

rfcs/0095-enable-docheck-by-default.md

gytis-ivaskevicius · 2021-11-18T17:41:44Z

rfcs/0095-enable-docheck-by-default.md

+- By default `doCheck` option should be enabled as long as `stdenv.hostPlatform == stdenv.buildPlatform`.
+- Non-reproducible test prevention should be implemented.
+- All failing packages should be fixed or updated with `doCheck = false;`
+


Suggested change

Guidelines to refrain from enabling tests:

- If tests are taking _too_ long. (Tests aren't expected to run longer than build time. In case of quick builds - not more than 10min)

- If tests require additional large dependencies.

- If tests are flaky. (If tests randomly fail once in a while)

Guys, sounds good?

Guidelines to refrain from enabling tests

P.S.:

s|If|When|

When tests are flaky, unpredictably failing.

gytis-ivaskevicius · 2021-11-18T17:50:49Z

rfcs/0095-enable-docheck-by-default.md

+**New `doCheck`/`doInstallCheck` semantics:**
+In addition to booleans, `doCheck`/`doInstallCheck` should also accept strings.
+- String value should be considered as `false`
+- It should be used as a place for comment on why the check is disabled. For
+  example: "Requires X11 server" or "Requires network access".


Suggested change

**New `doCheck`/`doInstallCheck` semantics:**

In addition to booleans, `doCheck`/`doInstallCheck` should also accept strings.

- String value should be considered as `false`

- It should be used as a place for comment on why the check is disabled. For

example: "Requires X11 server" or "Requires network access".

**New semantics:**

- `doCheck`/`doInstallCheck` should default to `null` and work exactly the same as `false`

- New options should be introduced: `meta.{checksFlaky,checksLargeDependencies,checksTakeTooLong,checksDisableReason}`

The first point is so we could evaluate checks that are not set. Derivation paths are not expected to change.
And the second point is so we could evaluate the reason why checks were disabled. Maybe defining additional attrset would be nice? like checks.{flaky,largeDependencies,takeTooLong,disableReason}? Any thoughts?

A comment on the source code and a Boolean value shoud suffice. There is no reason to overload this.

During RFC call, we thought that it would be a very simple change which would allow us to actually evaluate the reason why tests were disabled and that is definitely handy. I am still in favor of it

That being said, it is preferrable a (short?) string, without true/false semantics attached to it.
After all, it can be useful to use the string to convey a useful information even in the case the tests are mandatory.

Something like

doCheck = false; checkReason = "I don't wanna fetch X.Org just to verify a blinking box!";

Or even better, a new attrset:

check.enable = true; check.reason = "This package is critical for bootstrap";

gytis-ivaskevicius · 2022-01-14T14:18:11Z

Sorry fellow nix'ers, but these days I don't have time for pretty much anything which is why I am closing this RFC. If anyone wishes to take over this RFC - feel free to do so. Also for the record - it seems that #119 addresses most things that I am interested in

[RFC 0095] Enable doCheck by default

cc112ac

gytis-ivaskevicius mentioned this pull request Jun 25, 2021

Run check phase by default NixOS/nixpkgs#33599

Closed

nh2 reviewed Jun 25, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

nh2 reviewed Jun 25, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

nh2 reviewed Jun 25, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

rfcs/0095-enable-docheck-by-default.md Show resolved Hide resolved

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

edolstra reviewed Jun 25, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

gytis-ivaskevicius and others added 5 commits June 26, 2021 17:11

[RFC 0095] Pin commit so that the lines don't move around

8c47ee8

Co-authored-by: Niklas Hambüchen <[email protected]>

[RFC 0095] Remove unecessary space

ad5adb6

Co-authored-by: Niklas Hambüchen <[email protected]>

[RFC 0095] 'if' -> 'If'

864572c

Co-authored-by: Niklas Hambüchen <[email protected]>

[RFC 0095] Add code blocks around values

af3d757

Co-authored-by: Niklas Hambüchen <[email protected]>

[RFC 0095] Add code block around 'checkPhase'

b8924de

Co-authored-by: Niklas Hambüchen <[email protected]>

[RFC 0095] update 'doCheck'/'doInstallCheck' semantics

918edff

spacekookie added the status: open for nominations Open for shepherding team nominations label Jul 8, 2021

spacekookie mentioned this pull request Jul 8, 2021

Meeting 2021-07-08 NixOS/rfc-steering-committee#66

Closed

17 tasks

Update rfcs/0095-enable-docheck-by-default.md

5980692

Co-authored-by: Jörg Thalheim <[email protected]>

spacekookie mentioned this pull request Sep 9, 2021

Meeting 2021-09-09 NixOS/rfc-steering-committee#69

Closed

20 tasks

gytis-ivaskevicius commented Sep 14, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

Update rfcs/0095-enable-docheck-by-default.md

680344d

This was referenced Oct 7, 2021

Meeting 2021-09-23 NixOS/rfc-steering-committee#73

Closed

Meeting 2021-10-07 NixOS/rfc-steering-committee#75

Closed

Mic92 mentioned this pull request Nov 3, 2021

Meeting 2021-11-03 NixOS/rfc-steering-committee#76

Closed

25 tasks

Ericson2314 mentioned this pull request Nov 11, 2021

[RFC 0092] Computed derivations #92

Merged

gytis-ivaskevicius commented Nov 15, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

Update drawbacks

4b75420

gytis-ivaskevicius commented Nov 15, 2021

View reviewed changes

rfcs/0095-enable-docheck-by-default.md Outdated Show resolved Hide resolved

Update alternatives

a39020b

spacekookie mentioned this pull request Nov 17, 2021

Meeting 2021-11-17 NixOS/rfc-steering-committee#77

Closed

25 tasks

gytis-ivaskevicius commented Nov 18, 2021

View reviewed changes

lheckemann mentioned this pull request Dec 1, 2021

Meeting 2021-12-01 NixOS/rfc-steering-committee#78

Closed

26 tasks

kloenk mentioned this pull request Dec 15, 2021

Meeting 2021-12-15 NixOS/rfc-steering-committee#79

Closed

25 tasks

edolstra mentioned this pull request Jan 12, 2022

Meeting 2022-01-12 NixOS/rfc-steering-committee#80

Closed

25 tasks

gytis-ivaskevicius closed this Jan 14, 2022

amyipdev mentioned this pull request Jun 4, 2024

pwalarmctl: init at 0.1.0 NixOS/nixpkgs#316279

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC 0095] Enable doCheck by default #95

[RFC 0095] Enable doCheck by default #95

gytis-ivaskevicius commented Jun 25, 2021

nh2 commented Jun 25, 2021

edolstra commented Jun 25, 2021

Ericson2314 commented Jun 25, 2021

nh2 commented Jun 25, 2021

Zimmi48 commented Jun 26, 2021

gytis-ivaskevicius commented Jun 26, 2021 •

edited

Loading

zimbatm commented Jun 27, 2021

nh2 commented Jun 27, 2021

kevincox commented Jun 27, 2021

gytis-ivaskevicius commented Jun 30, 2021

Ericson2314 commented Jul 1, 2021

gytis-ivaskevicius commented Jul 1, 2021

michaelpj commented Jul 8, 2021

nh2 commented Jul 8, 2021

piegamesde commented Jul 8, 2021

Mic92 commented Jul 22, 2021

spacekookie commented Sep 9, 2021

Mic92 commented Sep 9, 2021

AndersonTorres commented Sep 26, 2021

Ericson2314 commented Oct 14, 2021

Mic92 commented Nov 3, 2021

gytis-ivaskevicius commented Nov 3, 2021

nh2 commented Nov 11, 2021

edolstra commented Nov 11, 2021

AndersonTorres commented Nov 12, 2021

gytis-ivaskevicius Nov 18, 2021 •

edited

Loading

gytis-ivaskevicius Nov 18, 2021

AndersonTorres Nov 18, 2021

AndersonTorres Nov 19, 2021

gytis-ivaskevicius Nov 18, 2021

AndersonTorres Nov 18, 2021 •

edited

Loading

gytis-ivaskevicius Nov 19, 2021

AndersonTorres Nov 19, 2021

gytis-ivaskevicius commented Jan 14, 2022

+Guidelines to refrain from enabling tests:
+- If tests are taking _too_ long. (Tests aren't expected to run longer than build time. In case of quick builds - not more than 10min)
+- If tests require additional large dependencies.
+- If tests are flaky. (If tests randomly fail once in a while)

[RFC 0095] Enable doCheck by default #95

[RFC 0095] Enable doCheck by default #95

Conversation

gytis-ivaskevicius commented Jun 25, 2021

nh2 commented Jun 25, 2021

edolstra commented Jun 25, 2021

Ericson2314 commented Jun 25, 2021

nh2 commented Jun 25, 2021

Zimmi48 commented Jun 26, 2021

gytis-ivaskevicius commented Jun 26, 2021 • edited Loading

zimbatm commented Jun 27, 2021

nh2 commented Jun 27, 2021

kevincox commented Jun 27, 2021

gytis-ivaskevicius commented Jun 30, 2021

Ericson2314 commented Jul 1, 2021

gytis-ivaskevicius commented Jul 1, 2021

michaelpj commented Jul 8, 2021

nh2 commented Jul 8, 2021

piegamesde commented Jul 8, 2021

Mic92 commented Jul 22, 2021

spacekookie commented Sep 9, 2021

Mic92 commented Sep 9, 2021

AndersonTorres commented Sep 26, 2021

Ericson2314 commented Oct 14, 2021

Mic92 commented Nov 3, 2021

gytis-ivaskevicius commented Nov 3, 2021

nh2 commented Nov 11, 2021

edolstra commented Nov 11, 2021

AndersonTorres commented Nov 12, 2021

gytis-ivaskevicius Nov 18, 2021 • edited Loading

Choose a reason for hiding this comment

gytis-ivaskevicius Nov 18, 2021

Choose a reason for hiding this comment

AndersonTorres Nov 18, 2021

Choose a reason for hiding this comment

AndersonTorres Nov 19, 2021

Choose a reason for hiding this comment

gytis-ivaskevicius Nov 18, 2021

Choose a reason for hiding this comment

AndersonTorres Nov 18, 2021 • edited Loading

Choose a reason for hiding this comment

gytis-ivaskevicius Nov 19, 2021

Choose a reason for hiding this comment

AndersonTorres Nov 19, 2021

Choose a reason for hiding this comment

gytis-ivaskevicius commented Jan 14, 2022

gytis-ivaskevicius commented Jun 26, 2021 •

edited

Loading

gytis-ivaskevicius Nov 18, 2021 •

edited

Loading

AndersonTorres Nov 18, 2021 •

edited

Loading