-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add validations when materializing from file sources #1615
Add validations when materializing from file sources #1615
Conversation
Signed-off-by: Achal Shah <[email protected]>
Codecov Report
@@ Coverage Diff @@
## master #1615 +/- ##
==========================================
+ Coverage 83.57% 83.64% +0.07%
==========================================
Files 65 67 +2
Lines 5771 5816 +45
==========================================
+ Hits 4823 4865 +42
- Misses 948 951 +3
Flags with carried forward coverage won't be shown. Click here to find out more.
Continue to review full report at Codecov.
|
Signed-off-by: Achal Shah <[email protected]>
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: achals, woop The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/lgtm |
* Validate join keys when materializing from file sources Signed-off-by: Achal Shah <[email protected]> * Dedupe columns when extracting from the dataframe Signed-off-by: Achal Shah <[email protected]>
Signed-off-by: Achal Shah [email protected]
What this PR does / why we need it:
This PR validate join keys when materializing from file sources, to make sure that the join keys are present in the dataframe columns. Additionally, we dedupe column names when extracting columns from the pandas dataframe. This should specifically catch the case of
event_timestamp_column
andcreated_timestamp_column
pointing to the same column name.Additionally, this fixes a bug where the
EntityNotFoundException
was instantiated with the entity name and project name switched.Which issue(s) this PR fixes:
Fixes #
Does this PR introduce a user-facing change?: