feat(ingest): add ability to preserve dbt table identifier casing #7854

viplazylmht · 2023-04-19T04:04:43Z

Summary

Resolve the issue #7853.

Checklist

The PR conforms to DataHub's Contributing Guideline (particularly Commit Message Format)
Links to related issues (if applicable)
Tests for the changes have been added/updated (if applicable)
Docs related to the changes have been added/updated (if applicable). If a new feature has been added a Usage Guide has been added for the same.
For any breaking change/potential downtime/deprecation/big changes an entry has been made in Updating DataHub

hsheth2 · 2023-05-24T04:12:26Z

@viplazylmht the code overall looks good here

However, I've generally found that the approach of setting convert_dataset_urns to True everywhere more reliably produced correct lineage. As such, I'm curious to understand the motivation behind this PR

laulpogan · 2023-06-07T17:21:56Z

Hi @viplazylmht - we haven't seen activity on this PR for a little bit, are you still interested in contributing? If not we'll go ahead and close it if we haven't heard back from you in a week!

viplazylmht · 2023-06-11T02:41:54Z

@hsheth2 @laulpogan I'm here. Well, convert_dataset_urns_to_lowercase currently has the default value as True, so it will not break any lineages.

In my case, I use datahub with dbt and Bigquery, and the Bigquery adapter said that they have a convert_urns_to_lowercase configuration, but default to False. So the urns they produced are completely different (because our bigquery tables are in UPPERCASE).

I am planning to integrate dbt to the existing datahub x bigquery production environment, so dbt should have the above config, instead of dropping all current metadata and ingesting all again.

feat(ingest): add ability to preserve dbt table identifier casing

9b2f44c

github-actions bot added the ingestion PR or Issue related to the ingestion of metadata label Apr 19, 2023

vercel bot had a problem deploying to Preview April 19, 2023 04:14 Failure

Merge branch 'datahub-project:master' into dbt-preverse-dataset-urns

efe2d6e

vercel bot deployed to Preview April 27, 2023 13:44 View deployment

Merge branch 'master' into dbt-preverse-dataset-urns

789fe53

vercel bot had a problem deploying to Preview May 23, 2023 01:01 Failure

hsheth2 self-requested a review May 23, 2023 15:25

Merge branch 'master' into dbt-preverse-dataset-urns

81460e0

vercel bot deployed to Preview May 24, 2023 04:29 View deployment

anshbansal added the community-contribution PR or Issue raised by member(s) of DataHub Community label Jun 23, 2023

asikowitz assigned hsheth2 Aug 14, 2023

shirshanka added the on-deck PR or Issue that will be reviewed and/or addressed by the DataHub Maintainers in future cycles label Jun 28, 2024

viplazylmht closed this by deleting the head repository Oct 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ingest): add ability to preserve dbt table identifier casing #7854

feat(ingest): add ability to preserve dbt table identifier casing #7854

viplazylmht commented Apr 19, 2023

hsheth2 commented May 24, 2023

laulpogan commented Jun 7, 2023

viplazylmht commented Jun 11, 2023

feat(ingest): add ability to preserve dbt table identifier casing #7854

feat(ingest): add ability to preserve dbt table identifier casing #7854

Conversation

viplazylmht commented Apr 19, 2023

Summary

Checklist

hsheth2 commented May 24, 2023

laulpogan commented Jun 7, 2023

viplazylmht commented Jun 11, 2023