Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[mce-2.5] Hive 2485/mce 2.6: Backport AssumeRole, credential_process, and kubeconfig exec fixes #2391

Conversation

openshift-cherrypick-robot
Copy link

@openshift-cherrypick-robot openshift-cherrypick-robot commented Aug 1, 2024

This is an automated cherry-pick of #2389

/assign 2uasimojo

HIVE-2485

Simplify the AssumeRole flow: Rather than doing it via
`credential_process` as a callback from within the creds file used by
the provision pod, flatten this out so the AssumeRole is done implicitly
by the AWS SDK.

This flow remains unchanged:

The clusterdeployment controller:
- Copies the service provider secret into the CD namespace
- Creates an AWS credentials secret
- Creates the provision pod

The provision pod:
- Loads the credentials secret
- Projects the AWS config therein onto the file system
- Invokes the installer

The installer:
- Creates an AWS client using that config file
- Proceeds with installation

Before this commit:
The AWS config contained a `credential_process` which invoked
`hiveutil install-manager aws-credentials` which...
- Loaded the service provider secret
- Created an AWS client
- Used the client to AssumeRole and generate credentials with a 15m
expiration
- Printed the credentials to stdout in the format expected by AWS.

Per AWS docs[1], the SDK will automatically rerun the
`credential_process` before the expiration time to refresh the creds.

With this commit:
The clusterdeployment controller loads the service provider secret and
folds it into the AWS config as a separate profile, referenced from the
default via `source_profile`:

```
[default]
source_profile = source
role_arn = arn:aws:iam::123456789012:role/assume-role-customer

[profile source]
aws_access_key_id: ABCDEFGHIJKLMNOPQRST
aws_secret_access_key: 1234567890abcdefghijklmnopqrstuvwxyz0123
role_arn = arn:aws:iam::210987654321:role/assume-role-provider
```

Per AWS docs[2], the SDK will use the source creds to AssumeRole to
generate temporary creds, which it will automatically refresh as they
expire -- i.e. natively performing the same function as `hiveutil
install-manager aws-credentials`.

[1] https://docs.aws.amazon.com/cli/v1/userguide/cli-configure-sourcing-external.html
[2] https://docs.aws.amazon.com/cli/latest/userguide/cli-configure-role.html

HIVE-2485
HIVE-2529

(cherry picked from commit 8f11ce3)

Conflicts:
        pkg/install/generate.go (hiveutil binary path changed, no longer
        relevant.)
As a security measure, check AWS config/credential files for
`credential_process`, and explode if found.

We used to use `credential_process` deliberately to AssumeRole for STS
clusters. A prior commit switched this over to use a different
mechanism, but existing clusters in the field may still be configured
with the old mechanism in the relevant Secrets. Convert such Secrets to
use the new mechanism.

HIVE-2485

(cherry picked from commit 13ea4f4)
A previous commit (openshift#2306 / 13ea4f4) put in checks to forbid the use of
`credential_process` in AWS config/credentials files. It turns out that
AWS accepts this key case-insensitively, so this commit updates our
checks accordingly.

HIVE-2485

(cherry picked from commit 229f705)
Users with write access to the admin kubeconfig Secret for a given
ClusterDeployment should not be able to execute arbitrary code in the
privileged environment in which we run the controllers that use those
Secrets. Funnel all code paths that load such Secrets through a
validator to ensure that the AuthInfos[].Exec path is not used.

HIVE-2485

(cherry picked from commit df1ea18)
@2uasimojo
Copy link
Member

/lgtm

...if CI is happy. Except for

/override "Red Hat Konflux / hive-mce-25-on-pull-request"

...which will never be happy.

@openshift-ci openshift-ci bot added the lgtm Indicates that a PR is ready to be merged. label Aug 1, 2024
Copy link
Contributor

openshift-ci bot commented Aug 1, 2024

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: 2uasimojo, openshift-cherrypick-robot

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Copy link
Contributor

openshift-ci bot commented Aug 1, 2024

@2uasimojo: Overrode contexts on behalf of 2uasimojo: Red Hat Konflux / hive-mce-25-on-pull-request

In response to this:

/lgtm

...if CI is happy. Except for

/override "Red Hat Konflux / hive-mce-25-on-pull-request"

...which will never be happy.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@openshift-ci openshift-ci bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 1, 2024
@2uasimojo
Copy link
Member

/override ci/prow/security

#2387

Copy link
Contributor

openshift-ci bot commented Aug 1, 2024

@2uasimojo: Overrode contexts on behalf of 2uasimojo: ci/prow/security

In response to this:

/override ci/prow/security

#2387

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Copy link
Contributor

openshift-ci bot commented Aug 1, 2024

@openshift-cherrypick-robot: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name Commit Details Required Rerun command
ci/prow/security b9d2ed9 link true /test security

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here.

Copy link

codecov bot commented Aug 1, 2024

Codecov Report

Attention: Patch coverage is 27.42857% with 127 lines in your changes missing coverage. Please review.

Project coverage is 57.92%. Comparing base (beababc) to head (b9d2ed9).

Files Patch % Lines
pkg/install/generate.go 2.08% 47 Missing ⚠️
contrib/pkg/utils/generic.go 0.00% 18 Missing ⚠️
pkg/controller/utils/secrets.go 61.76% 8 Missing and 5 partials ⚠️
pkg/awsclient/client.go 0.00% 12 Missing ⚠️
pkg/installmanager/installmanager.go 0.00% 9 Missing ⚠️
contrib/pkg/utils/aws/aws.go 0.00% 6 Missing ⚠️
.../controller/clusterdeployment/clusterprovisions.go 33.33% 4 Missing ⚠️
...lusterdeprovision/clusterdeprovision_controller.go 20.00% 4 Missing ⚠️
contrib/pkg/utils/openstack/openstack.go 0.00% 2 Missing ⚠️
contrib/pkg/utils/ovirt/ovirt.go 0.00% 2 Missing ⚠️
... and 9 more
Additional details and impacted files

Impacted file tree graph

@@             Coverage Diff             @@
##           mce-2.5    #2391      +/-   ##
===========================================
+ Coverage    57.88%   57.92%   +0.04%     
===========================================
  Files          187      186       -1     
  Lines        26088    26075      -13     
===========================================
+ Hits         15101    15105       +4     
+ Misses        9721     9707      -14     
+ Partials      1266     1263       -3     
Files Coverage Δ
...roller/argocdregister/argocdregister_controller.go 50.50% <100.00%> (-0.22%) ⬇️
...roller/awsprivatelink/awsprivatelink_controller.go 68.09% <100.00%> (+0.27%) ⬆️
...controller/clusterclaim/clusterclaim_controller.go 63.59% <100.00%> (ø)
.../clusterdeployment/clusterdeployment_controller.go 66.66% <100.00%> (+0.12%) ⬆️
...kg/controller/clusterdeployment/clusterinstalls.go 75.23% <100.00%> (ø)
pkg/remoteclient/remoteclient.go 73.18% <100.00%> (+1.07%) ⬆️
contrib/pkg/utils/azure/azure.go 0.00% <0.00%> (ø)
contrib/pkg/utils/gcp/gcp.go 0.00% <0.00%> (ø)
pkg/controller/awsprivatelink/cleanup.go 46.10% <50.00%> (ø)
pkg/controller/clusterdeprovision/awsactuator.go 34.48% <0.00%> (ø)
... and 15 more

@openshift-merge-bot openshift-merge-bot bot merged commit 7bc174f into openshift:mce-2.5 Aug 1, 2024
10 of 12 checks passed
@2uasimojo
Copy link
Member

/cherry-pick mce-2.4

@openshift-cherrypick-robot
Copy link
Author

@2uasimojo: #2391 failed to apply on top of branch "mce-2.4":

Applying: Replumb AssumeRole (AWS)
Applying: Forbid and convert credential_process
Using index info to reconstruct a base tree...
M	pkg/controller/clusterdeployment/clusterprovisions.go
M	pkg/install/generate.go
M	pkg/installmanager/installmanager.go
Falling back to patching base and 3-way merge...
Auto-merging pkg/installmanager/installmanager.go
Auto-merging pkg/install/generate.go
CONFLICT (content): Merge conflict in pkg/install/generate.go
Auto-merging pkg/controller/clusterdeployment/clusterprovisions.go
error: Failed to merge in the changes.
hint: Use 'git am --show-current-patch=diff' to see the failed patch
Patch failed at 0002 Forbid and convert credential_process
When you have resolved this problem, run "git am --continue".
If you prefer to skip this patch, run "git am --skip" instead.
To restore the original branch and stop patching, run "git am --abort".

In response to this:

/cherry-pick mce-2.4

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved Indicates a PR has been approved by an approver from all required OWNERS files. lgtm Indicates that a PR is ready to be merged.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants