Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sparse matrix support #12

Closed
lmcinnes opened this issue Nov 13, 2017 · 2 comments
Closed

Sparse matrix support #12

lmcinnes opened this issue Nov 13, 2017 · 2 comments

Comments

@lmcinnes
Copy link
Owner

In principle a distance function could take sparse vectors and thus allow UMAP to take sparse matrices as input. This would allow for much higher dimensional data (NLP related data for example) to be handled by UMAP.

@ensonario
Copy link

This would be an amazing feature!

@lmcinnes
Copy link
Owner Author

It should now be working. Some distance measures won't work for sparse data (Haversine and Yule are the major ones). You will also need to write different custom metrics for the sparse case, and I haven't enabled that as an option just yet (although it should be an easy change -- I just want to be able to verify the function is valid, and I need to know how to get that info out of numba).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants