Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HOME is overridden for containers #863

Open
andreineculau opened this issue Dec 15, 2020 · 32 comments
Open

HOME is overridden for containers #863

andreineculau opened this issue Dec 15, 2020 · 32 comments
Labels
enhancement New feature or request Runner Feature Feature scope to the runner

Comments

@andreineculau
Copy link

Describe the bug
If passing the HOME environment variable to a container configuration,
it will get ignored and set to /github/home anyway as per

container.ContainerEnvironmentVariables["HOME"] = container.TranslateToContainerPath(tempHomeDirectory);

To Reproduce

A workflow with

jobs:
  main:
    runs-on: ubuntu-latest
    container:
      image: 'rokmoln/sf-ubuntu-bionic-common:0.2.80'
      options: '--user sf:sf' # this user's home is /home/sf
      env:
        HOME: /home/sf # tried to change HOME from /github/home to /home/sf. doesn't work
    steps:
      - shell: bash
        run: |
          echo $HOME

will echo /github/home instead of /home/sf.

Expected behavior
Expected HOME to be left as is, as sf user's HOME,
or at the very least to be allowed to enforce it via the env dictionary.

Runner Version and Platform

2.274.2

What's not working?

N/A

Job Log Output

N/A

Runner and Worker's Diagnostic Logs

N/A

@andreineculau andreineculau added the bug Something isn't working label Dec 15, 2020
@andreineculau
Copy link
Author

If I may suggest a possible solution:

  • set HOME only if not already set
  • set GITHUB_HOME or GITHUB_RUNNER_HOME in order to allow devs not to hardcode /github/home in case there's a use-case to access the runner's home from inside a container.

@robin-rpr
Copy link

As we are also having trouble with this issue in our Build Actions, we used this workaround:

Note: We are expecting the /github/home folder to not exist yet in the container.

  1. Create the /github directory
  2. Create a Symlink with $HOME

See:

mkdir -p /github
ln -s /root $HOME # Create Symlink /github/home

@hoxu
Copy link

hoxu commented Jun 11, 2021

I created a bit similar issue about issues caused by setting HOME: #1146.

@andreineculau
Copy link
Author

@hoxu you touched on the most important bit:

On a related note, it would be really nice if the GitHub runner's parameters to docker containers were documented clearly somewhere.

@JJ
Copy link

JJ commented Dec 19, 2021

Overriding $HOME causes all kinds of trouble, to no obvious benefit...

@vamper424
Copy link

Hola gracias salientes desde Los Mochis Sinaloa México. Soy mexicano y de familia numerosa saludos again asta luego.bestamosven línea.

@Taknok
Copy link

Taknok commented Dec 26, 2021

Also, at build the HOME is defined by image/distribution to /root for example. Then when running, the HOME is overwritten by GitHub to /home/github. This is causing issues with some tools installed in image.

@andreineculau
Copy link
Author

I can update this thread for future readers with my current workaround, that doesn't require messing with the filesystem as in #863 (comment) , but does require sudo, maybe tee.

Add a first step to the job like

- shell: bash
  run: 'echo HOME=/home/sf | sudo tee -a $GITHUB_ENV'

FWIW the sudo tee is needed because $GITHUB_ENV will be owned by another user. If you want to skip tee, you can run sudo sh -c "echo HOME=/home/sf >> $GITHUB_ENV"

Following steps will start with the expected HOME, as well as source resource files correctly, like ~.profile ~/.bash_profile etc

@andreineculau
Copy link
Author

andreineculau commented Feb 2, 2022

On another topic, driven by #863 (comment) "no obvious benefit", I have started thinking what can such a benefit be and I've come with these possible benefits:

  1. persist state between container startups/shutdowns
  2. persist state with host
  3. persist state between containers
  4. control the environment via .profile, .bash_profile, .curlrc, etc

Let's start from the bottom, with 4. /github/home is empty. So the only way to reason around that is if Github wants NO environment, in case they found cases where settings in resource files create unexpected behaviour. Sounds far fetched.

Regarding 3 - there's a specific volumes setting. "You can use volumes to share data between services or other steps in a job." Interesting formulation. First of all, did anyone see if Github overrides $HOME for services as well? I didn't. Secondly "other steps in a job" ?! I have just ran a test, and Github doesn't shuts down the container after each step. State is maintained between steps, and furthermore, action steps (i.e. steps with uses:some/action@ref) also run in the same container, having access to the same state.

Regarding 2. Which host? Either you run the job in a container or in a real/virtual machine.

Regarding 1. As mentioned for 3., state seems to be maintained. There's one particular case that I haven't tested, because I don't know where to look to understand how it works, how it should be invoked - docker hub action https://docs.github.com/en/actions/using-workflows/workflow-syntax-for-github-actions#example-using-a-docker-hub-action . Like ok, you reference an image hosted by hub.docker.com, what then? The docs reference an alpine image. How do you run shell commands? Or do they mean that you can package a docker image with a action.yml file in the WORKDIR, and they would run that. I really don't know how that works. But it could be that they start another container based on that image, set HOME to the same /github/home mount, same with GITHUB_WORKSPACE and maybe others, and thus share state though your job and the action step run isolated otherwise.

Summary:

  • ❓ couldn't find a benefit
  • 🙏 can someone enlighten me if a service also has $HOME mounted as /github/home ?
  • 🙏 can someone enlighten me how a docker hub action step is intended to be used, how it works?

** 🚨 THE BEST thing would be if someone at Github can spend some minutes just to clarify WHY? I'm starting from the premise that there is a benefit. Something that evades some of us. If we understand why, then maybe we can even start using Github in different/better ways.**

@andreineculau
Copy link
Author

ping @TingluoHuang

@andreineculau
Copy link
Author

after some googling, I found https://stackoverflow.com/a/56685281/465684

GitHub Actions support confirmed that all of /github persists throughout a workflow run.

As the comment says, there's no citation, no documentation around that, AND it also doesn't hold in reality. I ran a test here https://github.com/ysoftwareab/yplatform-test/actions/runs/1783086868 . test3 jobs depends on test1 and test2, and test3 doesn't have access to the test1 and test2 files, neither in the real $HOME, neither in /github/home, neither in $GITHUB_WORKSPACE. Afaik, one uses upload-artifacts and download-artifacts actions for sharing files between jobs.

Just wanted to be exhaustive.

@andreineculau
Copy link
Author

Looking at the runner's sourcecode, there's another mention of this mount https://github.com/actions/runner/blame/9a829995e02d2db64efb939dc2f283002595d4d9/src/Runner.Worker/Handlers/ContainerActionHandler.cs#L174

I ran out of time so I cannot test this to be sure, but I guess the whole idea of overriding $HOME is to share state with a "docker container action" https://docs.github.com/en/actions/creating-actions/creating-a-docker-container-action 😞

@nikola-jokic
Copy link
Contributor

Hi @andreineculau,

I will mark this as an enhancement since this is working as expected, but obviously, this change could add some value. I am putting it in the enhancement backlog so it is considered in the future enhancements 😊

@nikola-jokic nikola-jokic added enhancement New feature or request Runner Feature Feature scope to the runner and removed bug Something isn't working labels Mar 30, 2022
@andreineculau
Copy link
Author

andreineculau commented Mar 30, 2022

@nikola-jokic why do you say it is working as expected? Where is it documented that "variables CANNOT be overridden" is normal behaviour?

As far as I, and others on this thread, can see - this is a BUG . A step can set new, but cannot override previous variables. That's some peculiar "expectation".


EDIT: ignore this comment, I thought I'm replying in the #789 thread

@nikola-jokic
Copy link
Contributor

Hi @andreineculau,

We are experiencing similar problems with the Issue 434, which is related to how we mount the fs to have those files re-used. After the discussion with the team, we marked it as a feature that we should work on in the near future. That problem is also very important to us. This issue is related to the HOME environment, which is used to share the state between the host system where the runner is executing, and the containerized application.

Since issue 434 is considered a feature, I don't see why this issue would not fall under the same umbrella and should be picked up together.

@andreineculau
Copy link
Author

@nikola-jokic Apologies. My comment above was wrong, because I thought we were talking about #789 (not an excuse but I have a cold and I wasn't thinking right).

I will read up on #434 and come back if needed. Thanks and once again sorry for the noise.

@nikola-jokic
Copy link
Contributor

@andreineculau, no problem 😊. I misunderstood the issue 789, so I re-opened it once I got it. There is a third party PR now related to that issue so it should be fixed as soon as it is merged.

@klepiz
Copy link

klepiz commented Jun 23, 2022

Im running a script on the CI in order to set new environment variables (such as JAVA_HOME) but its seems like it doesn't work

- run: sh /usr/local/sbin/entrypoint.sh 

is that part of this issue? or is there a way to set env variable by a bash script?

Note: seems like the container is not overridden the variable if it was set as ENV statement in the dockerfiles, so that could be a possible workaround of this issue

simonmichael added a commit to simonmichael/hledger that referenced this issue Apr 18, 2024
This is for actions/runner#863 which is causing
"Preventing creation of Stack root '/github/home/.stack/'. Parent directory '/github/home/' is owned by someone else."
simonmichael added a commit to simonmichael/hledger that referenced this issue Apr 18, 2024
This is for actions/runner#863 which is causing
"Preventing creation of Stack root '/github/home/.stack/'. Parent directory '/github/home/' is owned by someone else."
simonmichael added a commit to simonmichael/hledger that referenced this issue Apr 18, 2024
This is for actions/runner#863 which is causing
"Preventing creation of Stack root '/github/home/.stack/'. Parent directory '/github/home/' is owned by someone else."
@gabyx
Copy link

gabyx commented Jul 17, 2024

Can this be fixed in 2024, this should not be any feature or something, this is just totally non-sense and is a bug!
The container defines the HOME variable, full-stop.

@rlindner81
Copy link

After reading this whole thread, I'm really disappointed that this got re-labeled as an enhancement. This behavior is not only unintuitive and pointlessly disruptive to most non-trivial docker images, it's also something that thousands of Github Action users will run into and be completely confused by. That's real suffering.

lukegriffith added a commit to lukegriffith/nvim-scripts that referenced this issue Aug 17, 2024
geedo0 added a commit to geedo0/openssh that referenced this issue Aug 21, 2024
We are hitting the 1 hour time limit of Circle CI (Issue open-quantum-safe#166). This migrates the existing CircleCI job completely to Github Actions which has a 5 hour time limit.

For the most part, this is pretty much a one-to-one migration. Since upstream OpenSSH provided its own set of Github Actions, I simply moved those to the `upstream-github` directory to avoid conflicts and preserve the source. I did run into two issues with getting the integration tests to pass. Beyond that, I ran into two issues that arose from migrating to Github Actions which need to be partched around.

The combination of Github Actions' host with the OQS CI container results in a lazier reaping of zombie processes which breaks this test. In this test, ssh-agent is run as a subprocess to some arbitrary user command. This enables exclusive access to ssh-agent to that specific process. The way this works under the hood is that ssh-agent forks into a child process and the parent process exec's into the arbitrary command ([code ref](https://github.com/open-quantum-safe/openssh/blob/OQS-v9/ssh-agent.c#L2384)) which runs to completion. The child process than polls its parent process until it detects its own orphaned status and terminates itself. This, by design, results in a zombie process which must be reaped. The test's assertion uses `kill -0` to check for liveness, but that counts zombies as "alive". The workaround for this then is to add an additional check to assert that zombies are in fact "dead".

The `percent` test tests % expansions inside SSH config files (e.g. home directory, username, port number). The assertion for the home directory uses the `HOME` environmental variable. Unfortunately, when running a container on a Github Runner, they unconditionally override the value of `HOME` with `/github/home` ([issue ref](actions/runner#863)) and this breaks the test assertion. The fix here is to get a more reliable reference for the home directory and use that for the assertion.

Signed-off-by: gcr <[email protected]>
geedo0 added a commit to geedo0/openssh that referenced this issue Aug 21, 2024
We are hitting the 1 hour time limit of Circle CI (Issue open-quantum-safe#166). This migrates the existing CircleCI job completely to Github Actions which has a 5 hour time limit.

For the most part, this is pretty much a one-to-one migration. Since upstream OpenSSH provided its own set of Github Actions, I simply moved those to the `upstream-github` directory to avoid conflicts and preserve the source. I did run into two issues with getting the integration tests to pass. Beyond that, I ran into two issues that arose from migrating to Github Actions which need to be partched around.

agent-subprocess zombie process reaping

The combination of Github Actions' host with the OQS CI container results in a lazier reaping of zombie processes which breaks this test. In this test, ssh-agent is run as a subprocess to some arbitrary user command. This enables exclusive access to ssh-agent to that specific process. The way this works under the hood is that ssh-agent forks into a child process and the parent process exec's into the arbitrary command ([code ref](https://github.com/open-quantum-safe/openssh/blob/OQS-v9/ssh-agent.c#L2384)) which runs to completion. The child process than polls its parent process until it detects its own orphaned status and terminates itself. This, by design, results in a zombie process which must be reaped. The test's assertion uses `kill -0` to check for liveness, but that counts zombies as "alive". The workaround for this then is to add an additional check to assert that zombies are in fact "dead".

percent expansion is broken due to Github's HOME override

The `percent` test tests % expansions inside SSH config files (e.g. home directory, username, port number). The assertion for the home directory uses the `HOME` environmental variable. Unfortunately, when running a container on a Github Runner, they unconditionally override the value of `HOME` with `/github/home` ([issue ref](actions/runner#863)) and this breaks the test assertion. The fix here is to get a more reliable reference for the home directory and use that for the assertion.

Signed-off-by: gcr <[email protected]>
geedo0 added a commit to geedo0/openssh that referenced this issue Aug 21, 2024
We are hitting the 1 hour time limit of Circle CI (Issue open-quantum-safe#166). This migrates the existing CircleCI job completely to Github Actions which has a 5 hour time limit.

For the most part, this is pretty much a one-to-one migration. Since upstream OpenSSH provided its own set of Github Actions, I simply moved those to the `upstream-github` directory to avoid conflicts and preserve the source. I did run into two issues with getting the integration tests to pass. Beyond that, I ran into two issues that arose from migrating to Github Actions which need to be partched around.

agent-subprocess zombie process reaping

The combination of Github Actions' host with the OQS CI container results in a lazier reaping of zombie processes which breaks this test. In this test, ssh-agent is run as a subprocess to some arbitrary user command. This enables exclusive access to ssh-agent to that specific process. The way this works under the hood is that ssh-agent forks into a child process and the parent process exec's into the arbitrary command ([code ref](https://github.com/open-quantum-safe/openssh/blob/OQS-v9/ssh-agent.c#L2384)) which runs to completion. The child process than polls its parent process until it detects its own orphaned status and terminates itself. This, by design, results in a zombie process which must be reaped. The test's assertion uses `kill -0` to check for liveness, but that counts zombies as "alive". The workaround for this then is to add an additional check to assert that zombies are in fact "dead".

percent expansion is broken due to Github's HOME override

The `percent` test tests % expansions inside SSH config files (e.g. home directory, username, port number). The assertion for the home directory uses the `HOME` environmental variable. Unfortunately, when running a container on a Github Runner, they unconditionally override the value of `HOME` with `/github/home` ([issue ref](actions/runner#863)) and this breaks the test assertion. The fix here is to get a more reliable reference for the home directory and use that for the assertion.

Signed-off-by: Gerardo Ravago <[email protected]>
@kdvolder
Copy link

kdvolder commented Sep 3, 2024

Here's a concrete example of problems caused by the 'unexpected' value of $HOME.

Code like this doesn't do what you would expect:

ssh-keyscan -t rsa some.example.domain >> ~/.ssh/known_hosts

You'd expect this will avoid 'unknown host key' for ssh to the host in question. Sadly it doesn't and the reason appears to be that ~/.ssh resolves to /github/home/.ssh but ssh itself is looking instead (in our case) in /home/ghrunner/.ssh (which is where the actual home directory of the 'ghrunner' user is).

@kyledecot
Copy link

kyledecot commented Sep 3, 2024

@kdvolder Exactly my experience. Anywhere where tilde is used you need to "expand" it manually to be the "real" home, not the hijacked one.

My Dockerfile is full of things like:

ENV RUSTUP_HOME=/workflow/.rustup \
    NPM_CONFIG_PREFIX=/workflow/.npm \
    DOCKER_CONFIG=/workflow/.docker \
    CARGO_HOME=/workflow/.cargo
…

This is a sub-optimal “solution” at best. In some cases the tool internally uses tilde and there is no way to “re-hijack” is back to the “real” HOME.

@AlexSkrypnyk
Copy link

I've added this as a very first step for each of my jobs in the workflow:

    steps:
      - name: Preserve $HOME set in the container
        run: echo HOME=/root >> "$GITHUB_ENV"

/root is a $HOME in my container.

This has to be the very first step in the job and it should not be a symlink as suggested above or the contents of your $HOME in the container will be overwritten by steps in the actions that precede this step.

@horror-proton
Copy link

# these gives /root
getent passwd | grep $USER
echo ~root

# while these gives /github/home
echo ~
echo $HOME

That's hilarious 😞.
Not all applications use environment variables to determine home directory (e.g. openssh).
One must be very patient to find out what's going on with them.

ThrawnCA added a commit to qld-gov-au/ckanext-qgov that referenced this issue Sep 27, 2024
- GitHub Actions overrides HOME for unclear reasons, which breaks container expectations,
actions/runner#863
ThrawnCA added a commit to qld-gov-au/ckanext-qgov that referenced this issue Sep 27, 2024
- GitHub Actions overrides HOME for unclear reasons, which breaks container expectations,
actions/runner#863
@Yurii-huang
Copy link

I don't think it's appropriate to redirect to the root directory, some builds can specify user in options, for example, archlinux does not allow root to build, for the default home directory should determine whether there is a --user designation in options as a judgment condition

@Yurii-huang
Copy link

@andreineculau

@pcolladosoto
Copy link

# these gives /root
getent passwd | grep $USER
echo ~root

# while these gives /github/home
echo ~
echo $HOME

That's hilarious 😞. Not all applications use environment variables to determine home directory (e.g. openssh). One must be very patient to find out what's going on with them.

Yeah... I went mad because $HOME/.gitconfig wasn't being honoured, but the SSH configuration was! It took some digging to find out that OpenSSH actually grabs the home directory based on /etc/passwd as seen here.

Anyway, without considering whether it makes 'sense' or not to overwrite the value of $HOME, I do believe this decision should be made explicit somewhere in the documentation to avoid people chasing ghosts in the future...

@guettli
Copy link

guettli commented Jan 15, 2025

This snippet helped me:

      - name: Fix HOME Directory
        shell: bash
        run: |
          # Issue [HOME is overridden for containers](https://github.com/actions/runner/issues/863)
          h=$(getent passwd $(id -un) | cut -d: -f6)
          if [ "$h" = "$HOME" ]; then
            echo "HOME fine: $HOME"
            exit 0
          fi
          echo "HOME=$HOME was broken. Setting it to $h"
          ls -ld $HOME
          ls -ld $h
          echo "USER: $USER"
          echo "id: $(id)"
          echo "HOME=$h" >> $GITHUB_ENV

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request Runner Feature Feature scope to the runner
Projects
None yet
Development

No branches or pull requests