-
Notifications
You must be signed in to change notification settings - Fork 123
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
data and scripts for amazon-google dataset #131
Conversation
Run the pyspark scripts from dirctory - examples/amazon-google/scripts training file has cluster_id and label at the end right now. trying to move them in the beginning...selection column list needs to be adjusted accordingly. |
examples/amazon-google/config.json
Outdated
@@ -0,0 +1,82 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
why do we need two configs - dont we need just one for linking?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed.
It was added if somebody wants to generate own training/labelled data.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
File removal will be part of next checkin. Version from first checkin is still there.
f14e405
to
933fb69
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what is the need for config.json?
examples/amazon-google/config.json
Outdated
}" | ||
}, | ||
{ | ||
"name": "test2", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
call it aws?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ok
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes. I've made the changes.
For other datasets, these fields may be updated accordingly.
The code was updated yesterday with some changes. Some improvements were made and some mistakes were rectified. |
Some queries/notes about changes:
|
No description provided.