-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Semisup example uses 1000 instead of 100 samples #3
Comments
I ran the code with the default value in the README just now, and I confirmed that it achieves ~2.0 % test error rate with 100 labeled examples, which is the almost same as the published results in https://arxiv.org/abs/1507.00677. If you mean the results published in the recent paper on arXiv https://arxiv.org/abs/1704.03976, we use different network architecture (784-1200-600-300-150-10), and also add gaussian noise( mean=0, stddev=0.5) on each hidden layer following https://arxiv.org/abs/1606.03498. The optimal epsilon on this setting is epsilon=5.0. Please see Appendix C in https://arxiv.org/abs/1704.03976. I thought that many people are not interested in the results on MNIST and I did not upload the code for MNIST at https://github.com/takerum/vat_tf, but it is OK to upload the code for MNIST if you cannot reproduce by yourself. |
Thank you so much for your reply! And thanks for providing code examples in different frameworks! It really helps understanding your method. As I understand the code, this happens: In the README you don't specify the number of labeled samples. The script uses the default value of 1000 as defined in train_mnist_semisup.py. If I add --num_labeled_samples=100 I get much different and worse results. Could you take a look? |
Ah, that's right, I will take a look at the code again and I will let you know the update here. |
Hi, I ran with 100 labeled examples and I got 2.08% test error rate. |
I run this: Here is what I get. What does your test error look like after 20 epochs?
(also on theano 0.9.0) |
I am reasonably confident that the epsilon is wrong. |
I am sorry for the late reply. |
I did indeed run the code on CPU. Maybe there is some bug or strange behaviour in Theano? |
I am not sure... anyway, thanks for reporting the results on CPU! |
The semisupervised MNIST example uses the default value of 1000 labeled samples instead of 100 labeled samples (as stated in the README). The given hyperparameters also only work for the 1000 sample case.
Could you give details on hyperparameters for the 100 sample setting?
The text was updated successfully, but these errors were encountered: