-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(ingest): include instance in container dataPlatform when provided #6083
feat(ingest): include instance in container dataPlatform when provided #6083
Conversation
When I raised this topic:
This PR addresses the missing platform instance. What's your thought about the environment? It's likely that the eg |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm
@sgomezvillamor to distinguish in that case, I'd recommend setting platform instance to Unfortunately we can't add an |
From my point of view that would be even worse because filters would become totally unaligned.
Quite complex IMO and hard to understand for the users. What about defining “env” as optional for containers so it keeps backward compatibility? Ingestion recipes may have a flag to enable “env” field in the containers, otherwise it remains as null and backwards compatible. Alternatively it could be a datahub deployment setting. Just anyway to opt-in and enable the feature. |
Agree that it's not great. I chatted with the team internally and wanted to explain some of the thinking here. Our long-term intention is that Towards this goal, our plan is to add an env attribute to platform instances soon. Basically, you'd name your platform instance "dev_customers" or whatever, and also set env=DEV. The UI would enable search / filtering on this attribute. In the short term I realize that it's not a great user experience. A minor correction though: because we currently include the platform instance name as part of the dataset urn, searching for "prod_customers" should surface both containers and datasets. The piece that won't work is that setting the env "prod" filter won't show any containers. |
Thanks for sharing the long-term plans. The reasoning about dataset identity and so moving env to the platform instance makes sense. It's unfortunate that we break the alignment during the transition and the cost is the user experience, as you said. The use case my organization is facing now is about glue databases and tables and it's weird not having them aligned and supporting the filters in a homogeneous way. In order to mitigate that cost, what about adding env to the platform instance as part of the work done in this PR? Wit that,
With this approach, the env is duplicated in the dataset entity and so the cost is to ensure the consistency and likely more complex graphql queries. So the cost impacts developer experience, not user experience. And also, this would mean moving a step closer to the long-term vision. Coincidentally, I also have another PR for extending the features of the Platform Instance entity. #5728 Opportunistically sharing it here 🙄 We may easily add Factory (dev, pre, pro, ...) too 😄 |
We could do that, but I worry about making this PR huge in scope. Adding an env attribute to platform instance and/or containers isn't quite enough, since we'd also need to add support for filtering using that on the backend and frontend. I'm not sure I have bandwidth for taking that whole thing on right now, but I'd be happy to hand this PR over if you're interested. |
Supersedes #4665.
Checklist