-
Notifications
You must be signed in to change notification settings - Fork 229
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Python SQL Registry #311
Python SQL Registry #311
Conversation
I have a few high level comemnts:
|
for those DerivedFeatureDef or AnchorFeatureDef classes, can we rename it so that we know it's specific to registry? Currently it's a bit confusing as there might be duplications on the existing feature definition that is defined here, vs the ones that are available in the "Feathr Core" code. |
The Dockerfile is not intent to be used by users directly, it is used by the building process, and we'll setup CI actions to publish ready-to-use docker images to some public repo like DockerHub, @jainr is working on this task. |
How would the end users use that pre-built container in this case? |
That's why I suggested to split the registry from the feathr client repo. |
Again, this part is covered by the manual that @jainr is currently working on, so let's put the comments about the deploying/running questions in that upcoming user-manual PR. |
Correct me if I'm wrong here, but seems like the registry in this PR is only to be consumed by UI (and the REST API layer). I.e. it's not ready to be consumed by the Python client (say, users cannot call For this specific PR, it should be self contained (that's why you have a dockerfile) and it's equivalent of building another smaller Purview service, I think we should move it under a folder NOT in a separate repo, but rather in the same repo but not in the python client folder. maybe create something under root folder and put it there. Or put it under the Also - this folder (https://github.com/linkedin/feathr/tree/main/feathr_project/feathr/api) should be also moved tin a similar way (i.e. not using registry service) |
Agree. Please raise an issue and let's make decision on directory structure and do a major restructuring soon, not only for this part, but also other components. |
Can we do a few things here?
|
1 similar comment
Can we do a few things here?
|
raise ValueError("Too many nested levels") | ||
if isinstance(d, str): | ||
d = d[:100] | ||
return re.sub(r'([A-Z]\w+$)', r'_\1', d).lower() |
Check warning
Code scanning / CodeQL
Polynomial regular expression used on uncontrolled data
* main: Fixing purview test issues and improve performance (#350) [feathr] Add product_recommendation advanced sample (#348) obejectId query cmd update (#360) add license, release, docs, python api ref badges with shields img (#357) quick fix the 404 not found in read me link (#355) Python SQL Registry (#311) enable JWT token param in frontend API calls (#337) Optimize environment variable behavior (#333) Adding better warning message to let user know that config file is missing and they need to set env parameters. (#347) Feature Monitoring (#330) Windoze/211 maven submission (#334) Windoze/211 maven submission (#334) Windoze/211 maven submission (#334) Fix Synapse quickstart link (#346) Show feature details when click feature in lineage graph (#339) Update pull_request_push_test.yml Update UI README for how to create overrides for local development (#335) Update databricks quick start experience (#217)
This is the SQL-based Feathr implemented in Python.
The API spec is updated to include some previously missing pieces.
The corresponding UI changes is in another PR #303 submitted by @blrchen
There are still some TODOs:
script/schema.sql
.We can talk about these tasks later, right now, please focus on the implementation itself.