Implementing KNN classifier with Inference #583

sapan · 2023-01-31T03:58:11Z

Excellent library Kevin - Thanks a lot.

In my hyperparameter search, I am trying to perform KNN classification with query labels in my 'test' set and reference labels in my 'train' set. I thought of doing the following:

Add custom accuracy metric by subclassing accuracycalculator
use splits_to_eval to include {'test' : 'train'}
I thought along with a custom function and splits_to_eval, I should get the KNN classifier. However, this is not going to help - reason being - the accuracy calculator class is supposed to compare class labels of query and reference embeddings -- and not supposed to predict class label for query. Is this correct?

The InferenceModel class has all the required ingredients for doing KNN I believe. It would really help if you can provide some idea on how to go about this. (I think having KNNClassifier in future version would be a really good feature)

KevinMusgrave · 2023-01-31T04:40:01Z

I'll come back to this tomorrow, but I just wanted to say that it might be worth trying scikit learn's KNeighborsClassifier if you haven't already. You just need to convert your tensors to numpy.

sapan · 2023-01-31T04:50:58Z

Thanks for the suggestion. Let me try that.
I am thinking to first call
base_tester.get_all_embeddings(dataset, trunk, embedder, collate_fn,..) to compute emberddings for both train and test set and then use that in sklearn.

sapan · 2023-01-31T06:42:35Z

This is working for me. I am adding the overall flow here - may help others.

trainer.train() // the trainer class in pytorch-metric-learning
best_model=load best Trunk model from 'example_saved_models' (the one having best regex)
tester=GlobalEmbeddingSpaceTester()
KNN classification here
4.1 compute embeddings for train and test data -- using,
embeddings, labels = tester.get_all_embeddings(dataset, best_trunk_model, embedded, collate_fn, result_as_numpy=True)
labels = labels.reshape(labels.shape[0])
4.2 from sklearn.neighbors import KNeighborClassifier
from sklearn.metrics import accuracy_score
knnmod=KNeighborClassifier(n_neighbors=20, weights='distance')
knnmod.fit(train_embs, train_labels)
predicted=knnmod.predict(test_embs)
accuracy_score(train_labels, predicted)

sapan closed this as completed Jan 31, 2023

KevinMusgrave added the question A general question about the library label Jan 31, 2023

KevinMusgrave mentioned this issue Aug 11, 2023

how implement combining trunk model and embedding model without pytorch metric learning-based trainer and tester #656

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implementing KNN classifier with Inference #583

Implementing KNN classifier with Inference #583

sapan commented Jan 31, 2023

KevinMusgrave commented Jan 31, 2023

sapan commented Jan 31, 2023

sapan commented Jan 31, 2023

Implementing KNN classifier with Inference #583

Implementing KNN classifier with Inference #583

Comments

sapan commented Jan 31, 2023

KevinMusgrave commented Jan 31, 2023

sapan commented Jan 31, 2023

sapan commented Jan 31, 2023