Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multi label support in Scala xgboost. #10121

Closed
aniruddhghatpande opened this issue Mar 13, 2024 · 8 comments
Closed

multi label support in Scala xgboost. #10121

aniruddhghatpande opened this issue Mar 13, 2024 · 8 comments

Comments

@aniruddhghatpande
Copy link

aniruddhghatpande commented Mar 13, 2024

I see support for multi Label objective functions for python https://xgboost.readthedocs.io/en/release_2.0.0/python/examples/custom_softmax.html. Would there be any support for such in Scala lib or only python is currently avaiable? Exploring multi objective support for Scala package for xgboost

Also if we want to implement this in XGBoost JVM wrapper does updating the LabeledPoint Class be the only touch point in JVM wrapper or do i require to make changes on core XGBoost engine/code

@trivialfis

@trivialfis
Copy link
Member

cc @wbo4958

@wbo4958
Copy link
Contributor

wbo4958 commented Mar 14, 2024

Yeah, seems its doable, but it's may be a little kind of complicated, which may touch many part of jvm packages. Anyway, please try it.

@aniruddhghatpande
Copy link
Author

Could you confirm if this will not require any core xgboost library change ? If i add more than 1 label in jvm package would this work smoothly or requires more changes in JVM .

@wbo4958
Copy link
Contributor

wbo4958 commented Mar 15, 2024

I think it requires more changes in JVM packages.

@aniruddhghatpande
Copy link
Author

How could i search for python changes so i can mimic them in scala? Also how/where do i find documentation for internal implementation for multi label in xgboost? How does xgboost optimize for all labels in a target ?

@wbo4958
Copy link
Contributor

wbo4958 commented Mar 15, 2024

At present, neither xgboost pyspark nor xgboost jvm package supports multi-labels, so there is no reference implementation for this functionality.

@trivialfis
Copy link
Member

We have a Python native implementation that's not yet available for distributed systems. Feel free to look into the sklearn interface for multi label classifier.

@trivialfis
Copy link
Member

Closing in favor of #9043 .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants