-
Notifications
You must be signed in to change notification settings - Fork 251
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Example training code over oxford pet dataset #144
Conversation
e22cce3
to
fdedf66
Compare
This PR is built on top of |
Rough Benchmarks: Configuration
The Dataset / Scanner does not work well with partition pruning tho. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
couple of nits
@@ -173,6 +173,15 @@ class PlainDecoderImpl : public Decoder { | |||
return fmt::format("PlainEncoder({})", type_->ToString()); | |||
} | |||
|
|||
protected: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
just curious why protected instead of private?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This one is used by BooleanDecoder
as well.
@@ -78,8 +78,8 @@ def read_metadata(self, check_quality=False) -> pd.DataFrame: | |||
no_index = pd.Index(names.values).difference(df.filename) | |||
self._data_quality_issues["missing_index"] = no_index | |||
|
|||
# TODO lance doesn't support writing booleans yet | |||
with_xmls['segmented'] = with_xmls.segmented.astype(pd.Int8Dtype()) | |||
with_xmls['segmented'] = with_xmls.segmented.apply( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
with_xmls.segments.astype('boolean')
should work (https://pandas.pydata.org/docs/user_guide/boolean.html) no?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yup I tried, but it does not work. So the segmented is mixed with string and float NaN.
28a0c02
to
0d1b2a7
Compare
A Pytorch lighting model to train classification models over Oxford Pet dataset.