Replies: 1 comment
-
https://saturncloud.io/blog/python-machine-learning-labels-and-features-a-guide-for-data-scientists/
this cleared things up |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
(I am just starting to learn ML in general, and this library in particular. So it might very well be that this question is entirely obsolete when you're more "in the know" of the material, but I just can't seem to find something that helps me)
I don't quite understand how DataSets work here. I understand they represent the training or test data, and, as in ML we operate with numbers and calculations, all data is effectively just numbers.
However, reading the source and documentation i always come across this notion of "labels" in DataSets. most notably in the
DataSet
class itself, where methods likegetY(idx: Int): Float
exist. But to me,label
as a word has always meant something like a tag attached to something. What "number" is this supposed to be?If I have training data like in a CSV file as
and wanted to build a model for this, what is the "label" here? And what should
fun getX(idx: Int): FloatArray
return?Again, maybe this is entirely obvious when you have used other ML libraries before, but to me none of this is self-explanatory enough. Or did I miss some documentation?
Beta Was this translation helpful? Give feedback.
All reactions