-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(ingest): Glue jobs #2687
feat(ingest): Glue jobs #2687
Changes from 41 commits
334a485
8580c54
499d308
520b099
1941e93
5270fac
67c0807
3ffdb1a
bdae7c1
4acc825
68ed8e5
2fd692b
a0bc357
e4b4d64
7f0eb42
fe63ce6
218338e
5c1c9f2
5e4873a
4739367
ce58f7b
a37b1db
5050b8f
ff682a8
5144ffc
ab419a4
ecd89a6
06d6c8a
51ae22a
108a203
d9d8bc5
e4a96b4
db9d8ca
7e6333b
4026917
c436149
8619918
33dd0ea
e0eaf0c
6f7b74a
c738a74
1d89927
fe5b087
948dd24
8a681cd
95efa7e
55e6619
51bdb88
59bbf15
2a4501b
8feb874
d173d6c
1b44baf
d80f529
db1c638
8eb3d50
5b66074
9955e06
6335f95
09b9b60
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -378,7 +378,7 @@ | |
"type" : "record", | ||
"name" : "DataJob", | ||
"namespace" : "com.linkedin.datajob", | ||
"doc" : "Metadata bout DataJob", | ||
"doc" : "Metadata about DataJob", | ||
"include" : [ { | ||
"type" : "record", | ||
"name" : "DataJobKey", | ||
|
@@ -438,9 +438,10 @@ | |
"name" : "AzkabanJobType", | ||
"namespace" : "com.linkedin.datajob.azkaban", | ||
"doc" : "The various types of support azkaban jobs", | ||
"symbols" : [ "COMMAND", "HADOOP_JAVA", "HADOOP_SHELL", "HIVE", "PIG", "SQL" ], | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Let's not keep piling onto the cc @jjoyce0510 is the plan still to add a free form string? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yeah. This can either be a freeform string or another enum with a better name. I don't have a strong preference for adding a freeform string vs a better enum |
||
"symbols" : [ "COMMAND", "HADOOP_JAVA", "HADOOP_SHELL", "HIVE", "PIG", "SQL", "GLUE" ], | ||
"symbolDocs" : { | ||
"COMMAND" : "The command job type is one of the basic built-in types. It runs multiple UNIX commands using java processbuilder.\nUpon execution, Azkaban spawns off a process to run the command.", | ||
"GLUE" : "Glue type is for running AWS Glue job transforms.", | ||
"HADOOP_JAVA" : "Runs a java program with ability to access Hadoop cluster.\nhttps://azkaban.readthedocs.io/en/latest/jobTypes.html#java-job-type", | ||
"HADOOP_SHELL" : "In large part, this is the same Command type. The difference is its ability to talk to a Hadoop cluster\nsecurely, via Hadoop tokens.", | ||
"HIVE" : "Hive type is for running Hive jobs.", | ||
|
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,9 +1,10 @@ | ||
source: | ||
type: glue | ||
config: | ||
aws_region: "us-east-1" | ||
aws_region: "us-west-2" | ||
extract_transforms: true | ||
|
||
sink: | ||
type: "datahub-rest" | ||
config: | ||
server: 'http://localhost:8080' | ||
server: "http://localhost:8080" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's the rationale for this change?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The previous implementation was causing a bug where nodes only had urns and types specified as they were from upstreamLineage – @gabe-lyons can elaborate!