NaN problem fix by aregic · Pull Request #3 · camigord/Neural-Turing-Machine

aregic · 2017-08-14T20:37:27Z

The problem was the sharpening function in memory.py which had a division with no guarantee that the denominator is not zero.

I have tested it by starting to learn from a checkpoint which was very close to outputting NaNs (it always happened in a couple of hundred iterations) and with this fix it worked fine for 300k more iterations.

Of course the result of the loss function changed a little at first, so this is not a 100% guarantee, but I'm confident this patch removes one possibility for the NaN problem.

The problem was the sharpening function in memory.py which had a division with no guarantee that the denominator is not zero. I have tested it by starting to learn from a checkpoint which was very close to outputting NaNs (it always happened in a couple of hundred iterations) and with this fix it worked fine for 300k more iterations. Of course the result of the loss function changed a little at first, so this is not a 100% guarantee, but I'm confident this patch removes one possibility for the NaN problem.

camigord

Thanks a lot for the fix and sorry for the huge delay... I would like to approve the pull request, but I would like to keep the current README given that the problem may still not be solved. Could you send a new pull request with only the changes in memory.py?

camigord requested changes Nov 17, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NaN problem fix#3

NaN problem fix#3
aregic wants to merge 1 commit into
camigord:masterfrom
aregic:NaN_fix

aregic commented Aug 14, 2017

Uh oh!

camigord left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

aregic commented Aug 14, 2017

Uh oh!

camigord left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants