[FEA] Simplify process to train cuml KMeans on GPU and save the model and later load on a CPU machine for inference #3626

user06039 · 2021-03-16T20:06:55Z

I am trying to use KMeans in CUML for fitting the data, but for inference/prediction I want to do it on CPU? Is it possible somehow? I really need a way to predict on CPU. Please help

EDIT:

I feel like it's useful feature for the community, since training and tuning is more resource taking process using GPU make sense but for inference I feel like a CPU machine should do a decent job in production.

viclafargue · 2021-03-17T12:29:45Z

This code seems to be working:

import cuml
from sklearn.cluster import KMeans as skKMeans
from cuml.cluster import KMeans as cuKMeans

from sklearn.datasets import make_blobs
from numpy.testing import assert_equal

X, _ = make_blobs(n_samples=1000, n_features=10, centers=8)

cuModel = cuKMeans()
cuModel.fit(X)

skModel = skKMeans()
with cuml.using_output_type("numpy"):
    skModel.labels_ = cuModel.labels_
    skModel.cluster_centers_ = cuModel.cluster_centers_
skModel._n_threads = 1

assert_equal(cuModel.predict(X), skModel.predict(X))

Also see sklearn's Model persistence page

user06039 · 2021-03-18T14:21:45Z

@viclafargue Thank you, this seems to be really interesting trick, Is there any disadvantage of doing this?

Also, why do we need to set skModel._n_threads = 1 ?

viclafargue · 2021-03-19T09:43:38Z

I don't see any disadvantage apart from the fact that this method may not work with every estimators. Know that if you're only interested in storing your trained cuML estimator it is possible to persist it with pickling. It will then be redeployed to GPU allowing faster predictions/transformations.

Also, why do we need to set skModel._n_threads = 1 ?

This is something specific to Scikit-Learn's KMean code. It needs to be specified to avoid a crash during prediction. In my understanding, it is used to set the number of threads to be used in OpenMP.

user06039 · 2021-03-19T22:51:31Z

@viclafargue Thanks for clarifying it. This saved hours of re-training using scikit-learn kmeans implementation. I think there should be a way to do this directly in cuml, since not everyone uses GPU's in their production environment for inference.

Is there a way I could turn this post into a feature request?

dantegd · 2021-03-19T23:21:47Z

@John-8704 turning it into a feature request would be very welcomedd

user06039 · 2021-03-20T12:29:35Z

@dantegd I have edited the post, I hope that would suffice. I guess someone should change the labels attached to this post.

github-actions · 2021-04-19T16:07:30Z

This issue has been labeled inactive-30d due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d if there is no activity in the next 60 days.

JohnZed · 2021-05-11T16:57:12Z

We will consider this a feature request for simplification of this process in a future release (and documenting better). Thank you for filing!

github-actions · 2021-11-23T21:04:47Z

This issue has been labeled inactive-90d due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.

user06039 added ? - Needs Triage Need team to review and classify bug Something isn't working labels Mar 16, 2021

user06039 changed the title ~~[BUG] How can we fit cuml KMeans on GPU and save the model and later load on a CPU machine for slower inference?~~ [BUG] How can we fit cuml KMeans on GPU and save the model and later load on a CPU machine for inference? Mar 18, 2021

divyegala added Cython / Python Cython or Python issue question Further information is requested and removed ? - Needs Triage Need team to review and classify bug Something isn't working labels Mar 18, 2021

divyegala assigned viclafargue Mar 18, 2021

user06039 changed the title ~~[BUG] How can we fit cuml KMeans on GPU and save the model and later load on a CPU machine for inference?~~ [FEA] How can we fit cuml KMeans on GPU and save the model and later load on a CPU machine for inference? Mar 20, 2021

dantegd added feature request New feature or request and removed question Further information is requested labels Mar 20, 2021

github-actions bot added the inactive-30d label Apr 19, 2021

JohnZed changed the title ~~[FEA] How can we fit cuml KMeans on GPU and save the model and later load on a CPU machine for inference?~~ [FEA] Simplify process to train cuml KMeans on GPU and save the model and later load on a CPU machine for inference May 11, 2021

JohnZed removed the inactive-30d label May 11, 2021

JohnZed unassigned viclafargue May 11, 2021

github-actions bot added the inactive-90d label Nov 23, 2021

viclafargue mentioned this issue Aug 22, 2022

[QST] running sklearn model using cuml? #4869

Closed

singhmanas1 added the cuml-cpu label Oct 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEA] Simplify process to train cuml KMeans on GPU and save the model and later load on a CPU machine for inference #3626

[FEA] Simplify process to train cuml KMeans on GPU and save the model and later load on a CPU machine for inference #3626

user06039 commented Mar 16, 2021 •

edited

Loading

viclafargue commented Mar 17, 2021 •

edited

Loading

user06039 commented Mar 18, 2021 •

edited

Loading

viclafargue commented Mar 19, 2021 •

edited

Loading

user06039 commented Mar 19, 2021

dantegd commented Mar 19, 2021

user06039 commented Mar 20, 2021

github-actions bot commented Apr 19, 2021

JohnZed commented May 11, 2021

github-actions bot commented Nov 23, 2021

[FEA] Simplify process to train cuml KMeans on GPU and save the model and later load on a CPU machine for inference #3626

[FEA] Simplify process to train cuml KMeans on GPU and save the model and later load on a CPU machine for inference #3626

Comments

user06039 commented Mar 16, 2021 • edited Loading

viclafargue commented Mar 17, 2021 • edited Loading

user06039 commented Mar 18, 2021 • edited Loading

viclafargue commented Mar 19, 2021 • edited Loading

user06039 commented Mar 19, 2021

dantegd commented Mar 19, 2021

user06039 commented Mar 20, 2021

github-actions bot commented Apr 19, 2021

JohnZed commented May 11, 2021

github-actions bot commented Nov 23, 2021

user06039 commented Mar 16, 2021 •

edited

Loading

viclafargue commented Mar 17, 2021 •

edited

Loading

user06039 commented Mar 18, 2021 •

edited

Loading

viclafargue commented Mar 19, 2021 •

edited

Loading