fix(ingest): correctly handle transformer patch semantics #6505

hsheth2 · 2022-11-22T02:22:22Z

The ownership transformer contains a clever optimization, where if it is set to PATCH and does not need to make any changes, no aspect is generated. The surrounding code did not handle this logic correctly, and would issue an upsert with the original ownership object, hence overwriting anything that lived on the server. This bug would only be triggered if the server already contained a superset of what the transformer was trying to add.

Many other dataset transformers also had a similar, subtle bug. For example, if using the pattern_add_dataset_tags transformer and a no new tags applied to a given dataset, the server's tags would get overwritten even if PATCH was specified. This PR should also resolve that class of bugs.

Tested manually with the ownership, props, and tags transformers.

Checklist

The PR conforms to DataHub's Contributing Guideline (particularly Commit Message Format)
Links to related issues (if applicable)
Tests for the changes have been added/updated (if applicable)
Docs related to the changes have been added/updated (if applicable). If a new feature has been added a Usage Guide has been added for the same.
For any breaking change/potential downtime/deprecation/big changes an entry has been made in Updating DataHub

github-actions · 2022-11-22T02:42:47Z

Unit Test Results (metadata ingestion)

      8 files ±0       8 suites ±0 58m 6s ⏱️ + 2m 7s
  761 tests ±0   758 ✔️ +6 3 💤 ±0 0 ❌ - 1
1 524 runs ±0 1 517 ✔️ +7 7 💤 ±0 0 ❌ - 2

Results for commit 959eedb. ± Comparison against base commit 490097e.

github-actions · 2022-11-22T02:52:46Z

Unit Test Results (build & test)

621 tests ±0 617 ✔️ ±0 15m 49s ⏱️ +6s
157 suites ±0     4 💤 ±0
157 files ±0     0 ❌ ±0

Results for commit 959eedb. ± Comparison against base commit 490097e.

mayurinehate

LGTM

…oject#6505)

hsheth2 added 2 commits November 21, 2022 21:21

fix(ingest): correctly handle transformer patch semantics

10cea63

port to other transformers

959eedb

github-actions bot added the ingestion PR or Issue related to the ingestion of metadata label Nov 22, 2022

mayurinehate approved these changes Nov 22, 2022

View reviewed changes

jjoyce0510 approved these changes Nov 22, 2022

View reviewed changes

jjoyce0510 merged commit 74cc88f into datahub-project:master Nov 22, 2022

hsheth2 deleted the transformer-fixes branch November 22, 2022 19:24

cccs-Dustin pushed a commit to CybercentreCanada/datahub that referenced this pull request Feb 1, 2023

fix(ingest): correctly handle transformer patch semantics (datahub-pr…

e5b11d3

…oject#6505)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(ingest): correctly handle transformer patch semantics #6505

fix(ingest): correctly handle transformer patch semantics #6505

hsheth2 commented Nov 22, 2022 •

edited

Loading

github-actions bot commented Nov 22, 2022

github-actions bot commented Nov 22, 2022

mayurinehate left a comment

fix(ingest): correctly handle transformer patch semantics #6505

fix(ingest): correctly handle transformer patch semantics #6505

Conversation

hsheth2 commented Nov 22, 2022 • edited Loading

Checklist

github-actions bot commented Nov 22, 2022

Unit Test Results (metadata ingestion)

github-actions bot commented Nov 22, 2022

Unit Test Results (build & test)

mayurinehate left a comment

Choose a reason for hiding this comment

hsheth2 commented Nov 22, 2022 •

edited

Loading