v2 colab: Monitoring runs when no logs or tensorboard working? #202
Replies: 3 comments 2 replies
-
Here's a picture of the It seems that TF events are instead being written inside the checkpoints directory instead of the logs direcotory, but there's no way to point Tensorboard to this new directory once it's started. |
Beta Was this translation helpful? Give feedback.
-
Ok, so @hexorcismos answered my tweet question, saying
Which means you can't specify any directory for So, I'm going to modify the notebook so that you specify Will post a link to my changed notebook after I verify that it works. This has a couple lines specific to my dataset, but more generally:
|
Beta Was this translation helpful? Give feedback.
-
...also wondering when to expect the first checkpoint to be written? I'm now at 200 epochs and still no checkpoints. Edit: Ok, apparently the default is every 10k steps. Now that I've passed that threshold, I see a checkpoint file. |
Beta Was this translation helpful? Give feedback.
-
Hi, I'm using @hexorcismos' RAVE v2 training notebook. I created a directory called
/content/logs
and specified that as my logs directory.But during training, nothing ever seems to get written to that directory. Consequently tensorboard never shows anything.
How are we to monitor our runs, and/or decide when to stop training? Thanks.
Beta Was this translation helpful? Give feedback.
All reactions