-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
berkeley
: Dagster job ensure_alldocs
fails with AssertionError
#690
Comments
This issue is causing the following downstream issue: #689 |
There is no longer a 1:1 with collection names and class names. Since type is universally enforced we reduced the total number of collections. This is high priority b/c this backs production endpoints and the is needed for the ncbi export code. |
I think @PeopleMakeCulture, @sujaypatil96, and I have a solid plan for fixing this. It will make the generation of alldocs take longer (by several minutes) because it involves processing documents one-by-one instead of treating every document in the collection as though they have identical class hierarchies. I plan to prototype the fix later today. |
A fix is ready for review in #694. |
Today, I visited the Dagit instance in the Berkeley environment (
nmdc-berkeley
namespace on Spin) and tried running theensure_alldocs
job.While the
materialize_alldocs
op was running, an error occurred. Here's a screenshot of the error message, followed by a copy/paste of the same error message:Show/hide copy/pasted error message
I think this was the first time that job had been run in the Berkeley environment. That is based upon what I see here, on the "Runs" page of Dagit:
Task
The task here is to make it so the
ensure_alldocs
job runs and an alldocs collection exists in the Berkeley Mongo database.The text was updated successfully, but these errors were encountered: