Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question]Add time constraits when a middle table has no timestamp #11

Open
DanielMao1 opened this issue Aug 25, 2024 · 1 comment
Open

Comments

@DanielMao1
Copy link

DanielMao1 commented Aug 25, 2024

In snowflake schema, How to set constraits if an intermediate table has no timestamp.
Hello, I am a getML user. I am trying to switch from featuretools to your tools. I appreciate this great job you did. But I encounter a problem:
suppose in some of datasets which is snowflake schema, T0 is target table, and there is a join path: T0->T1<-T2.
If T0, T1, T2 all have timestamp, that's OK, we can add constraints on join like

schema.join(
    T1,
    on="FK", 
    time_stamps=("date1",  "date2"),
)

A more concrete example is in this schema, customers and calls have timestamp, but complaints does not. How can I add time constraints on customers and calls ?
snowflake_schema

But if T0 and T2 has timestamp, but T1 does not, I can not set timeconstraits althouth I would like to because T1 has no timestamp. It works fine on featuretools but does not work on getML. Do you plan to support such thing or are there any workwround?

@Jogala
Copy link
Collaborator

Jogala commented Aug 26, 2024

Hi @DanielMao1, thanks for your question.

As I quick fix, I would suggest to add the customer_id to the calls table and then directly joining the calls table onto the customers table.

Could you please share the feature tools code?

Is that a public source dataset?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants