Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Epic] Next Steps for DBT / normalization #2566

Closed
4 of 12 tasks
cgardens opened this issue Mar 22, 2021 · 7 comments
Closed
4 of 12 tasks

[Epic] Next Steps for DBT / normalization #2566

cgardens opened this issue Mar 22, 2021 · 7 comments
Assignees
Labels
Epic team/destinations Destinations team's backlog type/enhancement New feature or request

Comments

@cgardens
Copy link
Contributor

cgardens commented Mar 22, 2021

Tell us about the problem you're trying to solve

Here is a compiled list of features for normalization from discussions/feedbacks with users

Describe the solution you’d like

Google document to discuss and comment on these ideas in more details which would serve to prioritize roadmap around transformations and re-order the items

┆Issue is synchronized with this Asana task by Unito

@cgardens cgardens added the type/enhancement New feature or request label Mar 22, 2021
@cgardens cgardens added this to the Core - 2021-03-26 milestone Mar 22, 2021
@ChristopheDuong
Copy link
Contributor

ChristopheDuong commented Mar 30, 2021

All discussions are written in this doc with more details, please feel free to comment!!

Anyone should be able to access:
https://docs.google.com/document/d/1_eii32qoznzQ7_7aSzI5S0b-tajCeHbgRyQvYwNjK40/edit?usp=sharing

@ChristopheDuong ChristopheDuong removed this from the Core - 2021-03-26 milestone Mar 30, 2021
@leecheeaun
Copy link

leecheeaun commented Mar 31, 2021

Upvote for Incremental batch normalization - this was what I was looking for in 2683.

The rationale for this as a user is to offload querying the OLTP DB to querying data warehouses, or to avoid using OLTP DBs as warehouse.

Also, with regards to incremental batch normalization, looking for low latency querying difference between source and destination, so this would mean the time taken for the data to be normalized in the warehouse.

My use case can best be illustrated below, where I have a (relatively) large amount of historical data, and a few rows of new data added per day to an OLTP DB

data_patterns

@hoanghapham
Copy link

Upvote for better Handling Source Schema Changes. Currently, when I want to add a new table to be synced, Airbyte cleans all of the existing destination tables, which is not a desired behavior.

@ChristopheDuong
Copy link
Contributor

Upvote for better Handling Source Schema Changes. Currently, when I want to add a new table to be synced, Airbyte cleans all of the existing destination tables, which is not a desired behavior.

It seems like the related issue to track this is here: #3520

@fixico-abdel
Copy link

Upvote for better Handling Source Schema Changes. Currently, when I want to add a new table to be synced, Airbyte cleans all of the existing destination tables, which is not a desired behavior.

@erichartono
Copy link

Upvote for this improvement. I had a sync with incremental dedupe, and it took 48 minutes on normalisation process.

image

@ChristopheDuong
Copy link
Contributor

The dedicated ticket for incremental normalization support is #4286

@grishick grishick added the Epic label Apr 21, 2022
@grishick grishick changed the title Next Steps for DBT / normalization [Epic] Next Steps for DBT / normalization Apr 21, 2022
@grishick grishick added the team/destinations Destinations team's backlog label Sep 27, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Epic team/destinations Destinations team's backlog type/enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

8 participants