A Java implementation of doc2vec in ICML'14 based on https://github.com/NLPchina/Word2VEC_java
src/test/Doc2VecTest.java
Java 7 or above, I use Java 8 in this project. The input file should be in the form of file/amazon_docs.txt. One document per line.
- Le, Quoc V., and Tomas Mikolov. "Distributed representations of sentences and documents." ICML (2014).
- Mikolov, Tomas, Ilya Sutskever, Kai Chen, Greg S. Corrado, and Jeff Dean. "Distributed representations of words and phrases and their compositionality." In Advances in neural information processing systems, pp. 3111-3119. 2013.