Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outputs constant: _EOS #15

Open
Lldenaurois opened this issue Jun 8, 2017 · 3 comments
Open

Outputs constant: _EOS #15

Lldenaurois opened this issue Jun 8, 2017 · 3 comments

Comments

@Lldenaurois
Copy link

Hi there,

I implemented your code in Tensroflow r1.1 and I am able to train the entire model.

When I then attempt to get a sample, I simply get a constant output [ 2, 2, 2, 2, 2 ]

This means that the trained model outputs _EOS every time.

Any ideas as to why this is happening?

@domerin0
Copy link
Owner

domerin0 commented Jun 8, 2017

Hello,

How long did you train it for? This is something I had before. I found it occurs during early training stages. I found the EOS token to disappear (mostly) after about 24 hours on a Titan X. Though I suspect to train a 'good' model requires significantly more training.

@Lldenaurois
Copy link
Author

Hey man,

Thanks for the response. I trained for less than 12 hours.

I wonder if maybe there's a tiny bug in your code, where you maybe end each conversation with a newline. This would lead the model to have these examples where outputting the empty sentence was most likely. I can look into it!

Great work on the repo though!

@domerin0
Copy link
Owner

domerin0 commented Jun 9, 2017

That's very possible there is a bug like that. If you want to look into it and find a bug, feel free to submit a pr.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants