fix: improve change detection for GHAs #27904

mistercrunch · 2024-04-04T15:58:02Z

SUMMARY

Recently with #27867 (upgrading) pylint, we discovered that many python-related checks that should have run didn't run. The conditional execution system was uneven and showing different issues.

This PR streamline the whole conditional execution of github actions across the board.

The problem:

many, imperfect ways to do file change detection
- native GHA paths triggers that are incompatible with "required checks" (and require no-op.yml, to the script)
- a shell script that's a bit cryptic and a bit verbose and error-prone
actions that should be conditional but aren't set up that way

The solution:

creation of a single reusable action for all file-based conditional execution
introducing scripts/change_detector.py, a simple script inspired by scripts/ci_check_no_file_changes.sh, but more advanced and readable, handling both the pull_request and push context
moving all actions with file-related triggers to using this
getting rid of the no-op.yml hack altogether

TESTING

A fair amount of manual testing here, triggering with/without python/frontend/docker changes and making sure that the right steps get executed or skippped

.github/workflows/codeql-analysis.yml

superset/__init__.py

john-bodley · 2024-04-04T18:00:04Z

.github/workflows/superset-python-integrationtest.yml

  pull_request:
    types: [synchronize, opened, reopened, ready_for_review]
+    paths:
+      - "superset/**"


@mistercrunch I wonder if we would need to include .pre-commit-config.yaml or would the premise be that if pre-commit needed to touch any files the superset-python-misc.yml workflow would fail. That would then force the user run tox -e pre-commit locally which would then touch files in the superset/** and tests/** paths?

pre-commit has its own [required] check and runs on all files (not just the last commit as it does locally) and will non-zero exit if there's anything. So that prevents people from commit --no-verify or simply not setting up pre-commit

mistercrunch · 2024-04-04T19:31:31Z

Now I'm realizing that some of the files I touched here use ./scripts/ci_check_no_file_changes.sh and others the "if/no-op" pattern, which is not super elegant but pretty functional and DRY. Let me standardize one way or another here.

rusackas · 2024-04-05T04:34:21Z

.github/workflows/codeql-analysis.yml

+        id: check
+        uses: ./.github/actions/change-detector/
+        with:
+          token: ${{ secrets.GITHUB_TOKEN }}


Is there a use case where you might use different github tokens for different actions? If not, you could DRY it up further and not pass a token from every action to the change detector, but rather use the secret directly there.

this seemed like a limitation / requirement give how GHA works, I was able to make everything else DRY, but not this. We really just need read access to a public repo here (looking up either the PR's file touched, or the diff since last commit for push events), so maybe there's a way this can work without passing a token explicitely. Let me test whether it works without passing it.

Tested and seems it's just the way reusable actions work - they don't have access to secrets for security reason. The previous shell script had to do something similar, required also the PR_NUMBER to be passed (now I pick it up from env var), and I think it didn't support push events properly. So still a step forward.

mistercrunch · 2024-04-05T17:17:13Z

scripts/change_detector.py

+from urllib.request import Request, urlopen
+
+# Define patterns for each group of files you're interested in
+PATTERNS = {


This is the nice part that makes things evolutive. Here patterns for all actions can be managed centrally and be extended to support more rules.

eschutho · 2024-04-05T23:08:03Z

.github/workflows/superset-e2e.yml

@@ -109,7 +108,7 @@ jobs:
          run: cypress-run-all
      - name: Upload Artifacts
        uses: actions/upload-artifact@v4
-        if: failure()
+        if: steps.check.outputs.python || steps.check.outputs.frontend


isn't this statement different than failure() which will catch if any of the above run steps fail?

from my understanding if: failure() is similar to if: steps.check.outcome == 'failure'. The previous bash script would receive params, like python or frontend and exit 1 if it discovered a file matched the pattern, then subsequent jobs used that failure as a trigger. Now if failure() is broader than if: steps.check.outcome == 'failure', and will trigger the job if ANY of the previous step failed. It feels like this is right though

geido

LGMT but I'd like to have some more eyes on this. CC @dpgaspar and @craig-rueda

mistercrunch · 2024-04-08T23:20:11Z

I'll merge since we had 3 reviewers looking into this one so far. I will be monitoring master and upcoming PRs to make sure the right jobs are ran or skipped.

Shoki52 · 2024-04-09T09:12:18Z

Hello there, can anyone help me? I want to add new currency by overwriting config file, where I can see the list of all currencies or locales

mistercrunch · 2024-04-09T15:51:39Z

Please open a new issue. You start by searching the codebase for currency-related terms or symbol.

mistercrunch requested review from villebro, geido, eschutho, rusackas, betodealmeida, nytai, craig-rueda, john-bodley, kgabryje and dpgaspar as code owners April 4, 2024 15:58

pull-request-size bot added the size/S label Apr 4, 2024

github-actions bot added the github_actions Pull requests that update GitHub Actions code label Apr 4, 2024

mistercrunch mentioned this pull request Apr 4, 2024

fix(pylint): Address errors/warnings introduced by #27867 #27889

Merged

9 tasks

mistercrunch commented Apr 4, 2024

View reviewed changes

.github/workflows/codeql-analysis.yml Show resolved Hide resolved

mistercrunch commented Apr 4, 2024

View reviewed changes

superset/__init__.py Show resolved Hide resolved

john-bodley reviewed Apr 4, 2024

View reviewed changes

pull-request-size bot added size/XL and removed size/S labels Apr 4, 2024

mistercrunch changed the title ~~fix: add more path triggers for python GHAs~~ fix: improve change detection for GHAs Apr 4, 2024

github-actions bot added the preset-io label Apr 4, 2024

mistercrunch force-pushed the trigger_ci branch from 6f96108 to d20555a Compare April 4, 2024 23:53

fix: add more path triggers for python GHAs

c14606d

mistercrunch force-pushed the trigger_ci branch from 624e9b7 to c14606d Compare April 5, 2024 01:00

mistercrunch added 3 commits April 4, 2024 18:07

print tweaks

e102487

uncommenting

136ae7c

remove artifact

ddc9c96

rusackas reviewed Apr 5, 2024

View reviewed changes

mistercrunch commented Apr 5, 2024

View reviewed changes

trying without token

a5fa49b

rollback

b354ea7

eschutho reviewed Apr 5, 2024

View reviewed changes

geido approved these changes Apr 8, 2024

View reviewed changes

mistercrunch merged commit e80d194 into master Apr 8, 2024
28 checks passed

mistercrunch deleted the trigger_ci branch April 8, 2024 23:20

EnxDev pushed a commit to EnxDev/superset that referenced this pull request Apr 15, 2024

fix: improve change detection for GHAs (apache#27904)

a0c61bc

qleroy pushed a commit to qleroy/superset that referenced this pull request Apr 28, 2024

fix: improve change detection for GHAs (apache#27904)

ae70f2b

jzhao62 pushed a commit to jzhao62/superset that referenced this pull request May 16, 2024

fix: improve change detection for GHAs (apache#27904)

f2e4880

vinothkumar66 pushed a commit to vinothkumar66/superset that referenced this pull request Nov 11, 2024

fix: improve change detection for GHAs (apache#27904)

b179301

mistercrunch added 🏷️ bot A label used by `supersetbot` to keep track of which PR where auto-tagged with release labels 🚢 4.1.0 labels Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: improve change detection for GHAs #27904

fix: improve change detection for GHAs #27904

mistercrunch commented Apr 4, 2024 •

edited

Loading

john-bodley Apr 4, 2024 •

edited

Loading

mistercrunch Apr 4, 2024

mistercrunch commented Apr 4, 2024

rusackas Apr 5, 2024

mistercrunch Apr 5, 2024

mistercrunch Apr 5, 2024

mistercrunch Apr 5, 2024

eschutho Apr 5, 2024

mistercrunch Apr 6, 2024

geido left a comment

mistercrunch commented Apr 8, 2024

Shoki52 commented Apr 9, 2024

mistercrunch commented Apr 9, 2024

fix: improve change detection for GHAs #27904

fix: improve change detection for GHAs #27904

Conversation

mistercrunch commented Apr 4, 2024 • edited Loading

SUMMARY

TESTING

john-bodley Apr 4, 2024 • edited Loading

Choose a reason for hiding this comment

mistercrunch Apr 4, 2024

Choose a reason for hiding this comment

mistercrunch commented Apr 4, 2024

rusackas Apr 5, 2024

Choose a reason for hiding this comment

mistercrunch Apr 5, 2024

Choose a reason for hiding this comment

mistercrunch Apr 5, 2024

Choose a reason for hiding this comment

mistercrunch Apr 5, 2024

Choose a reason for hiding this comment

eschutho Apr 5, 2024

Choose a reason for hiding this comment

mistercrunch Apr 6, 2024

Choose a reason for hiding this comment

geido left a comment

Choose a reason for hiding this comment

mistercrunch commented Apr 8, 2024

Shoki52 commented Apr 9, 2024

mistercrunch commented Apr 9, 2024

mistercrunch commented Apr 4, 2024 •

edited

Loading

john-bodley Apr 4, 2024 •

edited

Loading