-
Notifications
You must be signed in to change notification settings - Fork 82
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: 🎸 add a /backfill admin endpoint #708
Conversation
It only updates all the datasets of the Hub, with a low priority. Note that most of the jobs will be skipped, because the response will already be in the cache. Also thiscommit fixes the creation of children jobs: the priority is preserved (ie low priority jobs created children low priority jobs)
Codecov ReportBase: 83.46% // Head: 87.12% // Increases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## main #708 +/- ##
==========================================
+ Coverage 83.46% 87.12% +3.66%
==========================================
Files 14 20 +6
Lines 526 660 +134
==========================================
+ Hits 439 575 +136
+ Misses 87 85 -2
Flags with carried forward coverage won't be shown. Click here to find out more. Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
Merging... |
Followup: it does not work because the loop that creates the jobs one by one takes too long and the nginx reverse-proxy returns a gateway timeout error. Only part of the datasets are refreshed. |
Alternatives:
I think the first option is the simplest one. Trying here: #720 |
OK. It's still a lot too slow, and it still timeouts:
Note that in #720 we were still doing a loop, it was not a batch operation |
The logic is very basic: it updates all the datasets of the Hub, with a low priority. Note that most of the jobs will be skipped, because the response will already be in the cache.
We might want to take a more detailed approach later to reduce the number of unnecessary jobs by specifically creating jobs for the missing data only.
Apart of this, the PR also fixes the creation of children jobs: the priority is preserved (ie low priority jobs created low priority children jobs)