Networkd containers #140669

Ma27 · 2021-10-05T19:46:25Z

Motivation for this change

Tests are regularly built at https://hydra.ist.nicht-so.sexy/jobset/nixpkgs/networkd-containers.
(This is also the reason for the temporary jobset.nix in the project's root).

Things done

nixos/modules/system/activation/switch-to-configuration.pl

nixos/modules/virtualisation/containers-next/container-profile.nix

Ma27 · 2021-12-19T17:16:46Z

Thanks a lot for your comments!

As soon as we start discussing the RFC in January and I know how much of this PR is actually useful, I'll start fixing these :)

SuperSandro2000

How would old things from /nix/var/nix/profiles/per-container/nixos/ and /var/lib/containers/ be migrated? I am not using the nixos-container script anymore because it is buggy but the container was originally created with it.

Hope the comments help you.

pkgs/tools/virtualization/nixos-nspawn/nixos-nspawn.py

pkgs/tools/virtualization/nixos-nspawn/default.nix

pkgs/tools/security/sudo/nspawn.nix

nixos/modules/virtualisation/qemu-vm.nix

nixos/modules/virtualisation/containers-next/default.nix

jobset.nix

Ma27 · 2022-01-04T20:38:02Z

How would old things from /nix/var/nix/profiles/per-container/nixos/ and /var/lib/containers/ be migrated?

I implemented a VM test demonstrating how a migration should work: https://github.com/NixOS/nixpkgs/pull/140669/files#diff-86feebe7d88f2d7c0dd00d87e110566c6e8fcb98cefdc7a06f3478789ef55a79

You basically rewrite the container expression (there's no way around that I'm afraid). Shouldn't be too bad though as the NixOS config inside can remain as-is in most cases, but e.g. networking, bind-mounts etc need to be adjusted.
Containers need to be stopped
move /var/lib/containers/{name} -> /var/lib/machines/{name}
Apply the new NixOS config (and thus also the "new" container should be started).

The same principle applies to imperative containers.

Sometimes it's needed to build a configuration within a `nix-build` for systemd units. While this is fairly easy for .service-units (where you can easily define overrides), it's not possible for `systemd-nspawn(1)`. This is mostly a hack to get dedicated bind-mounts of store paths from `pkgs.closureInfo` into the configuration without IFD. In the long term we either want to fix this in systemd or find a more suited solution though.

…s w/networkd This is the first batch of changes for a new container-module replacing the current `nixos-container`-subsystem in the longterm. The state in here is still strongly inspired by the `containers`[1]-module to declare declarative nspawn-instances by using NixOS config for the host and the container itself. For now, this module uses the tentative namespace `nixos.containers', but that's subject to change. This new module will also contain the following key-differences: * Rather than writing a big abstraction-layer on top, we'll rely on `.nspawn`-units[2]. This has the benefits that (1) we can stop adding options for each new nspawn-feature (such as MACVLANs, ephemeral instances, etc.) because it can be directly written into the `.nspawn`-unit using the module system like systemd.nspawn.foo.filesConfig = { BindReadOnly = /* ... */ }; Also, administrators don't need to learn too much about our abstractions, they only need to know a few basics about the module-system and how to write systemd units. * This feature strictly enforces `systemd-networkd` on both the container & the host. It can be turned off for containers in the host-namespace without a private network though. The reason for this is that the current `nixos-container` implementation has the long-standing bug that the container's uplink is broken *until* the container has booted since the host-side of the veth-pair is configured in `ExecStartPost=`[3]. This is, because there's no proper way to take care of it in an earlier stage since `systemd-nspawn` creates the interface itself. This has e.g. the implication that services inside the container wrongly assume that they connect to e.g. an external database via network (since `network{,-online}.target` was reached), however this is not the case due to the unconfigured host-side veth interface. However, when using `systemd-networkd(8)` on both sides, this is not the case anymore since systemd will automatially take care of configuring the network correctly when an nspawn unit starts and `networkd` is active. Apart from a basic draft, this also contains support for RFC1918 IPv4-addresses configured via DHCP and ULA-IPv6 addresses configured via SLAAC and `radvd(8)` including support for ephemeral containers. Further additions such as a better config-activation mechanism and a tool to manage containers imperatively will follow. [1] https://nixos.org/manual/nixos/stable/options.html#opt-containers [2] https://www.freedesktop.org/software/systemd/man/systemd.nspawn.html# [3] https://github.com/NixOS/nixpkgs/blob/8b0f315b7691adcee291b2ff139a1beed7c50d94/nixos/modules/virtualisation/nixos-containers.nix#L189-L240

This exposes a given `containerPort` to the host address. So if port 80 from the container is forwarded to the host's port 8080 and the container uses `2001:DB8::42` and the host-side uses `2001:DB8::23` on the veth-interface, then `[2001:DB::42]:80` will be available on the host as `[2001:DB8::2]:8080`.

This change tests various combinations of static & dynamic addressing and also fixes a bug where `radvd(8)` was errorneously configured for veth-pairs where it's actually not needed. This test is also supposed to show how to use `systemd`-configuration to implement most of the features (for instance there's no custom set of options to implement MACVLANs) and serves as regression-test for future `systemd`-updates in NixOS. Please note that the `ndppd`-hack is only here because QEMU doesn't do proper IPv6 neighbour resolution. In fact, I left comments whenever some workarounds were needed for the testing-facility.

This test is supposed to demonstrate how to migrate a single container to the new subsystem. Of course, docs on how to rewrite config isn't written yet, this is mainly a POC to show that it's generally possible by * Deploying a new configuration (using `nixos.containers`) being equivalent to the old one. * Moving the state from `/var/lib/containers` to `/var/lib/machines`. * Rebooting the host - unfortunately - because otherwise `systemd-networkd` will reach an inconsistent state - at least with v247. For the reboot-part I also had to change the QEMU vm-builder a bit to actually support persistent boot-disks.

Applied the diff from NixOS#140669 at revision cd533c3.

Princemachiavelli · 2023-06-22T01:25:26Z

I've been using this PR successfully for a while now and made a few changes. The current NixOS container module allows using an existing nixosConfiguration or path so I've implemented a similar feature in my branch. I also added an option to allow using a specialisation of the container's nixosConfiguration. containers-next default.nix

A clever use case for these features is you can easily deploy a production container and then a development/test specialisation of that same container.

I still have a bit to clean up but I thought I'd just comment on my experience with this PR which has been great.

Ma27 · 2023-06-29T21:17:24Z

(also posted in #nix-rfc-108:matrix.org)

Hi!
Short update from my side: unfortunately my priorities shifted and I'm quite occupied with a bunch of other things these days, so I'm afraid that right now I don't really have the energy and time to et this finished. What's missing is mostly some cleanup work around imperative containers (the script I hacked together is pretty messy) and writing some docs + a good migration path.
It'd be highly appreciated if somebody could take over from here (cc m1cr0man @Lassulus @arianvp ), of course I'm still happy to answer questions and give some basic feedback, but that's it I'm afraid.. The current state can be fonud at #140669, maintainers should have push access anyways to this branch, otherwise I'd suggest to push the branch to one's own fork.
Sorry that this took me so long, I wanted to announce this for a few weeks already now!
I'll also leave a message in the PR itself, perhaps there are some more people to be found.

m1cr0man · 2023-06-29T23:15:39Z

Thanks for the update ma27. I am indeed eager to contribute/take over and continue this work. It's a shame you've lost time to contribute but I hope that myself and whoever else comes on board can continue the great work you've done here 🙂

For those reading this that are maybe not aware, I had developed my own version of the nixos-nspawn declarative container tooling here: https://github.com/m1cr0man/python-nixos-nspawn Short of what is in #216025 (which I do need to update to reflect further work done here), it is standalone and can be imported as a flake. I have been running declarative containers now for over a year at least and it's been working great.

What I'd like to suggest as an action plan right now is getting the minimal viable changes to nixpkgs into master as soon as possible, and then creating a flake to iterate on the imperative and declarative container modules/components before then merging that into master too (if it makes sense). Right now, it's a bit inconvenient to run on a forked nixpkgs for any of RFC108 to work correctly, and I think it will ease adoption over time too. I still need to familiarize myself more with the imperative container management suite and also the migration path/solution for legacy containers.

These are just my thoughts of course, and I'm eager to hear what others think we should do. 😄

m1cr0man · 2023-07-06T22:52:29Z

Small update. Currently seeing if it's possible to consolidate the generation of systemd units (nspawn + networkd) between imperative and declarative containers. Right now, declarative container units are generated in Nix whilst the imperative container units are generated in Python, the reason being it's really awkward to make the module work correctly in both scenarios. I have a working (read: buildable but untested) POC here so I'm hopeful it will be possible. My main motive is to reduce the chance of the two deployment methods diverging and to reduce duplicate code.

nixos-discourse · 2023-09-09T23:28:08Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/nixcon-governance-workshop/32705/9

Applied the diff from NixOS#140669 at revision cd533c3.

github-actions bot added 6.topic: nixos Issues or PRs affecting NixOS modules, or package usability issues specific to NixOS 8.has: module (update) This PR changes an existing module in `nixos/` labels Oct 5, 2021

Ma27 mentioned this pull request Oct 5, 2021

Implement NixOS container networking with networkd #69414

Open

ofborg bot added the 2.status: merge conflict This PR has merge conflicts with the target branch label Oct 5, 2021

Ma27 mentioned this pull request Oct 5, 2021

[RFC 0108] NixOS Container rewrite NixOS/rfcs#108

Merged

Ma27 mentioned this pull request Oct 21, 2021

stage2: use atomic bind mounts #142412

Merged

12 tasks

PaulGrandperrin mentioned this pull request Nov 1, 2021

systemd-nspawn/machinectl containers don't work properly #144164

Closed

ofborg bot removed the 2.status: merge conflict This PR has merge conflicts with the target branch label Nov 29, 2021

Ma27 mentioned this pull request Nov 29, 2021

Revert "Merge pull request #141192 from helsinki-systems/feat/improve… #147609

Merged

13 tasks

Mic92 reviewed Nov 30, 2021

View reviewed changes

nixos/modules/system/activation/switch-to-configuration.pl Outdated Show resolved Hide resolved

fpletz reviewed Dec 14, 2021

View reviewed changes

nixos/modules/virtualisation/containers-next/container-profile.nix Outdated Show resolved Hide resolved

Ma27 mentioned this pull request Dec 17, 2021

changing part of a declarative container restarts the whole container #82217

Open

SuperSandro2000 added the 2.status: merge conflict This PR has merge conflicts with the target branch label Jan 2, 2022

Ma27 force-pushed the networkd-containers branch from fed3c5e to 2fc5579 Compare January 4, 2022 12:08

Ma27 removed the 2.status: merge conflict This PR has merge conflicts with the target branch label Jan 4, 2022

ofborg bot added the 8.has: package (new) This PR adds a new package label Jan 4, 2022

ofborg bot requested review from delroth and edolstra January 4, 2022 12:56

ofborg bot added 11.by: package-maintainer This PR was created by the maintainer of the package it changes 10.rebuild-darwin: 0 This PR does not cause any packages to rebuild on Darwin 10.rebuild-linux: 1-10 labels Jan 4, 2022

SuperSandro2000 reviewed Jan 4, 2022

View reviewed changes

Ma27 added 5 commits January 4, 2022 22:18

Ma27 added a commit to Ma27/nixpkgs that referenced this pull request May 21, 2023

Backport changes of networkd-containers to test on my infrastructure

4f66e5b

Applied the diff from NixOS#140669 at revision cd533c3.

m1cr0man mentioned this pull request Aug 19, 2023

Investigate compatiblity/interoperability with legacy containers m1cr0man/python-nixos-nspawn#4

Open

Ma27 added a commit to Ma27/nixpkgs that referenced this pull request Dec 26, 2023

Backport changes of networkd-containers to test on my infrastructure

c95f805

Applied the diff from NixOS#140669 at revision cd533c3.

wegank added 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md 2.status: merge conflict This PR has merge conflicts with the target branch labels Mar 19, 2024

stale bot removed the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Mar 20, 2024

wegank added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Jul 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Networkd containers #140669

Networkd containers #140669

Ma27 commented Oct 5, 2021 •

edited

Loading

Ma27 commented Dec 19, 2021

SuperSandro2000 left a comment

Ma27 commented Jan 4, 2022

Princemachiavelli commented Jun 22, 2023

Ma27 commented Jun 29, 2023

m1cr0man commented Jun 29, 2023

m1cr0man commented Jul 6, 2023

nixos-discourse commented Sep 9, 2023

Networkd containers #140669

Are you sure you want to change the base?

Networkd containers #140669

Conversation

Ma27 commented Oct 5, 2021 • edited Loading

Motivation for this change

Things done

Ma27 commented Dec 19, 2021

SuperSandro2000 left a comment

Choose a reason for hiding this comment

Ma27 commented Jan 4, 2022

Princemachiavelli commented Jun 22, 2023

Ma27 commented Jun 29, 2023

m1cr0man commented Jun 29, 2023

m1cr0man commented Jul 6, 2023

nixos-discourse commented Sep 9, 2023

Ma27 commented Oct 5, 2021 •

edited

Loading