DOC Add intelex inference example #303

ahuber21 · 2023-02-20T14:48:49Z

I want to give a self-contained example how scikit-learn-intelex speeds up model inference times. This effort was kicked off in #251 and the PR in this form is basically just to align the requirements.

I'm new to the discussion, so I'm including just the most basic things. Please let me know if this example is a step in the right direction.

Output from running on my machine:

$ python examples/use_intelex.py
Intel(R) Extension for Scikit-learn* enabled (https://github.com/intel/scikit-learn-intelex)
[skl] Inference took t_stock = 3.7e+00s and achieved 21.7% accuracy
[skl-ex] Inference took t_opt = 2.3e-01s and achieved 21.7% accuracy
t_stock / t_opt = 16.1
# ... more output about pushing and downloading ...
[skl] Inference took t_stock = 3.8e+00s and achieved 21.7% accuracy
[skl-ex] Inference took t_opt = 2.5e-01s and achieved 21.7% accuracy
t_stock / t_opt = 15.3
Intel(R) Extension for Scikit-learn* enabled (https://github.com/intel/scikit-learn-intelex)
[skl] Inference took t_stock = 3.8e+00s and achieved 21.7% accuracy

BenjaminBossan

Thanks for adding this example. This is not an in depth review yet, just what I found on first look:

The example is not actually being run but just included as is into the docs, as can be seen here:

https://skops--303.org.readthedocs.build/en/303/auto_examples/use_intelex.html#sphx-glr-auto-examples-use-intelex-py

I'm not completely sure why that happens here but not with the other examples, Possibly, this example needs to be linked explicitly in the docs somewhere? A good place would be in:

https://github.com/skops-dev/skops/blob/3e1f138c5a7c12863ca263beb2804ceede07d35a/docs/examples.rst

(which would be a good addition regardless of whether that is the root of the issue or not)

In general, I think this guide could be more useful if it was made a bit more accessible. E.g. explain in a few words what intelex does, link to its docs, and avoid too many abbreviations like "skl" and "sklex".

examples/use_intelex.py

ahuber21 · 2023-02-22T09:08:37Z

Hi @BenjaminBossan
thanks for the feedback! I'm happy to hear that the original suggestion was already a step in the right direction.
I added your requested changes and also added some content to the introduction in fixup commits that we can squash later.
Let's see if the code actually runs this time.

Let me know what else you want to see!

BenjaminBossan · 2023-02-22T11:22:42Z

The example is still not building correctly but I believe I have figured out the issue. It seems like the file name should always start with plot_. Could you please try if that fixes the issue?

Btw. you can build the docs locally by going into docs/ and running make html, then open the html files in _build/html.

ahuber21 · 2023-02-23T09:22:59Z

Hi @BenjaminBossan thanks for the tip. The latest commit worked locally so I hope this time the example runs.

I had an interesting thought. At the bottom of my example I show that loading a pickled non-optimized model has no speedup, even when sklearnex is loaded. I'm curious if that is also true for skops.io methods. Essentially, if a new object is created, i.e. the constructor is called, Intel optimizations should apply. It would be interesting to see if there are any compatibility issues.
I think @adrinjalali already looked into that, maybe he can comment.

BenjaminBossan · 2023-02-23T10:14:56Z

The latest commit worked locally so I hope this time the example runs.

The doc build is failing, but it's still progress :) The trouble now is that building the docs has a dependency on sklearnex, which is not installed. Could you please add it to the dependencies as a "doc" dependency?

I'm curious if that is also true for skops.io methods. Essentially, if a new object is created, i.e. the constructor is called, Intel optimizations should apply.

I think this will depend on how exactly the patching works, which I don't know. Ideally, there would be an easy way to check on each estimator if it was patched, something like hasattr(estimator, "_uses_intelex") or sklearnex.check_is_patched(estimator), I believe this would be really useful.

ahuber21 · 2023-02-23T11:16:58Z

Progress indeed. The sklearnex.check_is_patched(estimator) suggestion is very good. I'm wondering if we already have something like this. Let me ping @napetrov, maybe he knows more.

Edit: There is an unreleased function that allows you to see the global patching status in the latest main branch, but nothing to investigate model instances. I've created a feature request.

ahuber21 · 2023-02-24T08:26:45Z

By the way, the pipeline now complains about the missing HF token. I followed the approach that I found in plot_hf_hub.py, so I think the error is nothing new. How is this expected to work?

BenjaminBossan

Thanks Andreas for adding this example. I have now given it a more thorough review. Overall it looks good, but I think a few things could be improved further. Please take a look at my comments.

By the way, the pipeline now complains about the missing HF token. I followed the approach that I found in plot_hf_hub.py, so I think the error is nothing new. How is this expected to work?

Yes, that's unfortunately expected, see #47. As soon as this is merged, it should work, as it would use the HF token set up for this repo. When I tested this locally with my token, it worked.

examples/plot_intelex.py

ahuber21 · 2023-02-24T15:07:30Z

Hi Benjamin. Thanks! I hope all your comments are addressed with the latest commit.

adrinjalali

Thanks for the PR @ahuber21 , it's in a pretty good shape.

I left a few minor comments. I haven't made sure the line lengths are correct. You don't need to "accept suggestion" here, you can take the texts and apply them the way you like.

examples/plot_intelex.py

adrinjalali · 2023-02-27T11:10:07Z

examples/plot_intelex.py

+from sklearn.neighbors import KNeighborsClassifier
+
+clf = KNeighborsClassifier(3)
+clf.fit(X_train, y_train)
+
+# %%
+# Training the optimized model
+# ============================
+# We apply ``patch_sklearn()`` and reimport the model to load the patched
+# version. A message is shown, telling us that Intel(R) Extension for
+# Scikit-learn* has been enabled.
+patch_sklearn()
+from sklearn.neighbors import KNeighborsClassifier
+
+clf_opt = KNeighborsClassifier(3)
+clf_opt.fit(X_train, y_train)


I think in a script, this is very confusing. It'd be nicer if we explicitly import the class from sklearnex for it to be clear which one it is, and leave a comment that users don't have to change imports, and they can only add the call to patch on top of the file before they run imports.

We can also here show training times and compare them.

I changed the imports. Training for kNN is <1s and in many cases sklearnex comes with a few percent of overhead. We're working on that, but for now I think it wouldn't help to add it.

in that case, it does help to add it, since right now we're only selectively showing data, not the whole truth, and that's not what we wanna do here.

I have added a perf_counter for the fit stage for completeness. On my machine, both fit times were ~0.05s. The numbers are printed, but doing anything else with them (like calculating ratios) won't help if we don't do multiple runs. I hope you're fine with how it is now. Let me know :)

examples/plot_intelex.py

skops/_min_dependencies.py

examples/plot_intelex.py

adrinjalali · 2023-02-28T15:53:16Z

Build failing, once you're done with edits, please ping me for a second review.

ahuber21 · 2023-03-01T08:39:35Z

Don't quite understand what happened with the ubuntu tests. To me it looked unrelated. Anyway, I just pushed an update without the "Download model and re-evaluate" section, so there'll be a new run.
From my side, I've added everything. Let me know what you think @adrinjalali

adrinjalali

Other than the nits, LGTM. WDYT @BenjaminBossan

examples/plot_intelex.py

docs/examples.rst

ahuber21 · 2023-03-02T17:29:42Z

Force-pushed again because in the intro I still said that we download the models again. Nothing else changed.

adrinjalali · 2023-03-03T06:41:26Z

@ahuber21 we squash and merge anyway, please avoid force pushing since it makes reviewing changes harder :)

BenjaminBossan

Yes, looks good, thanks Andreas

ahuber21 · 2023-06-26T07:48:26Z

I think this will depend on how exactly the patching works, which I don't know. Ideally, there would be an easy way to check on each estimator if it was patched, something like hasattr(estimator, "_uses_intelex") or sklearnex.check_is_patched(estimator), I believe this would be really useful.

@BenjaminBossan this feature is now added in the latest release.

from sklearnex import is_patched_instance
my_instance = pickle.load(open("some_file.pkl", "rb"))
is_patched_instance(my_instance)  # true if it was trained using sklearnex, false otherwise

BenjaminBossan · 2023-06-27T10:26:16Z

Nice addition, thx for letting us know.

BenjaminBossan requested changes Feb 20, 2023

View reviewed changes

examples/use_intelex.py Outdated Show resolved Hide resolved

examples/use_intelex.py Outdated Show resolved Hide resolved

examples/use_intelex.py Outdated Show resolved Hide resolved

BenjaminBossan requested changes Feb 24, 2023

View reviewed changes

adrinjalali reviewed Feb 27, 2023

View reviewed changes

napetrov reviewed Feb 28, 2023

View reviewed changes

examples/plot_intelex.py Outdated Show resolved Hide resolved

ahuber21 force-pushed the feat-sklearnex-example branch from 2dce17e to 0dd8640 Compare February 28, 2023 12:38

ahuber21 force-pushed the feat-sklearnex-example branch from 9471ca6 to 1eb5d5f Compare March 1, 2023 08:50

adrinjalali reviewed Mar 1, 2023

View reviewed changes

examples/plot_intelex.py Outdated Show resolved Hide resolved

docs/examples.rst Outdated Show resolved Hide resolved

ahuber21 force-pushed the feat-sklearnex-example branch from 8fc0c67 to bad02d2 Compare March 1, 2023 17:08

adrinjalali changed the title ~~FEAT: Add intelex inference example~~ DOC Add intelex inference example Mar 2, 2023

ahuber21 force-pushed the feat-sklearnex-example branch from bad02d2 to e73cfb9 Compare March 2, 2023 11:47

DOC: Add intelex inference example

e047a1d

ahuber21 force-pushed the feat-sklearnex-example branch from e73cfb9 to e047a1d Compare March 2, 2023 17:29

BenjaminBossan approved these changes Mar 3, 2023

View reviewed changes

BenjaminBossan merged commit ffe3ea7 into skops-dev:main Mar 3, 2023

ahuber21 deleted the feat-sklearnex-example branch March 3, 2023 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC Add intelex inference example #303

DOC Add intelex inference example #303

ahuber21 commented Feb 20, 2023 •

edited

Loading

BenjaminBossan left a comment

ahuber21 commented Feb 22, 2023 •

edited

Loading

BenjaminBossan commented Feb 22, 2023

ahuber21 commented Feb 23, 2023

BenjaminBossan commented Feb 23, 2023

ahuber21 commented Feb 23, 2023 •

edited

Loading

ahuber21 commented Feb 24, 2023

BenjaminBossan left a comment

ahuber21 commented Feb 24, 2023

adrinjalali left a comment

adrinjalali Feb 27, 2023

ahuber21 Feb 28, 2023

adrinjalali Feb 28, 2023

ahuber21 Mar 1, 2023 •

edited

Loading

adrinjalali commented Feb 28, 2023

ahuber21 commented Mar 1, 2023 •

edited

Loading

adrinjalali left a comment

ahuber21 commented Mar 2, 2023

adrinjalali commented Mar 3, 2023

BenjaminBossan left a comment

ahuber21 commented Jun 26, 2023

BenjaminBossan commented Jun 27, 2023

DOC Add intelex inference example #303

DOC Add intelex inference example #303

Conversation

ahuber21 commented Feb 20, 2023 • edited Loading

BenjaminBossan left a comment

Choose a reason for hiding this comment

ahuber21 commented Feb 22, 2023 • edited Loading

BenjaminBossan commented Feb 22, 2023

ahuber21 commented Feb 23, 2023

BenjaminBossan commented Feb 23, 2023

ahuber21 commented Feb 23, 2023 • edited Loading

ahuber21 commented Feb 24, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

ahuber21 commented Feb 24, 2023

adrinjalali left a comment

Choose a reason for hiding this comment

adrinjalali Feb 27, 2023

Choose a reason for hiding this comment

ahuber21 Feb 28, 2023

Choose a reason for hiding this comment

adrinjalali Feb 28, 2023

Choose a reason for hiding this comment

ahuber21 Mar 1, 2023 • edited Loading

Choose a reason for hiding this comment

adrinjalali commented Feb 28, 2023

ahuber21 commented Mar 1, 2023 • edited Loading

adrinjalali left a comment

Choose a reason for hiding this comment

ahuber21 commented Mar 2, 2023

adrinjalali commented Mar 3, 2023

BenjaminBossan left a comment

Choose a reason for hiding this comment

ahuber21 commented Jun 26, 2023

BenjaminBossan commented Jun 27, 2023

ahuber21 commented Feb 20, 2023 •

edited

Loading

ahuber21 commented Feb 22, 2023 •

edited

Loading

ahuber21 commented Feb 23, 2023 •

edited

Loading

ahuber21 Mar 1, 2023 •

edited

Loading

ahuber21 commented Mar 1, 2023 •

edited

Loading