-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
start: use get-started-experiments
for GS:Experiments tutorial
#2497
Conversation
There are some conflicts, also could we now close #2491 ? |
@jorgeorpinel please note, that we probably should be reviewing this - it's already a branched version. |
47c9cb7
to
ed386eb
Compare
get-started-experiments
for GS:Experiments tutorialget-started-experiments
for GS:Experiments tutorial
I think it may look better to use a terminal with the identical background-color when taking the screenshots. Is there a predefined color scheme for code blocks? (And also the fonts.) Could you point me the exact color/font styles for the code blocks? @julieg18 @rogermparent |
…links, selected experiment changed
@iesahin Looks nice! from this screenshots I like the one that is not bold. No strong opinion here :) |
@@ -192,29 +271,26 @@ Storage, HTTP, HDFS, etc.). The Git remote is often a central Git server | |||
|
|||
</details> | |||
|
|||
Experiments that have not been made persistent will not be stored or shared |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
^^^^ a few lines about - fix code block, lines should start with $
I had to locate away from my Mac on the weekend. I'm taking the screenshots with Gnome Terminal but it doesn't look as good. This is probably about horizontal spacing config in iTerm. I used 0.95 or 0.90, but Gnome Terminal accepts only >1. Let's keep the current screenshots for the time being and I'd update theem in a later iteration. |
results" at the end.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good, and most of my comments are minor grammatical suggestions.
I have two broad questions:
- Is the new baseline experiment necessary? I think the way you introduce this is better than the current approach, especially because it introduces
dvc exp show
earlier, but it doesn't seem well integrated with the other sections now (creates lots of duplication). I'd suggest either leaving this out for a separate PR or else making bigger changes to remove content from the existing doc. - Can we reduce the repo complexity? The repo looks great, but if we now have the flexibility of having a repo specifically for this doc, I'd prefer we keep it as simple as possible because it makes the example commands, tables, etc. easier to read, and it's easier for user to look through the contents of the repo.
I feel introducing You may be right about repeating
At first I wanted to set up many parameters to make a wiggle room for us and for the user. I think it's now safe to remove most of the parameters from The code still will read the params from |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I feel introducing dvc exp run at first with -S is too much. IMHO we need to tell that dvc exp run is a replacement to dvc repro, but doing this with --set-param is a bit distracting. I can remove that section but it looks like we're delving too early into parameters, before telling exp run in general.
You may be right about repeating exp run and exp show, over and over, but we are telling experimentation here, and experimenting usually involves such repeats with some minor changes.
I put a couple more comments on sections where I think the duplication doesn't make sense.
The code still will read the params from params.yaml, but they won't be considered as a dependency. Is this OK?
You mean you will change dvc.yaml
but leave the rest of the repo the same?
Thank you. I began before that link was there and probably lost it in a rebase. Of course it should be there. |
``` | ||
|
||
Then you can `dvc pull` to get the dataset and run the commands in this | ||
document. For detailed information on parameters and the project structure | ||
please refer to the | ||
[project repository](https://github.com/iterative/get-started-experiments) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
``` | |
Then you can `dvc pull` to get the dataset and run the commands in this | |
document. For detailed information on parameters and the project structure | |
please refer to the | |
[project repository](https://github.com/iterative/get-started-experiments) | |
$ dvc pull | |
``` | |
> See `dvc pull` for more details. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Review based on the intro: I think we need to simplify the explanations, avoid low-level details and too many references to the example repo. It looks a bit like this doc is being rewritten to showcase the example repo but it should be the other way around: docs come first, make up command output if needed, and later update the example repo to match (enough). I don't even think it has to match 100%
Sorry if some of this overlaps with feedback from @dberenbaum. There's lots of unresolved comments, which may mean a) need to mark them resolved or b) the changes are too much. Maybe some are out of scope?
Some specific feedback and suggestions 👇
In order to run a baseline experiment with the default parameters defined in | ||
`params.yaml`: | ||
|
||
```dvc | ||
$ dvc exp run | ||
'data/fashion-mnist/raw.dvc' didn't change, skipping | ||
Stage 'prepare' didn't change, skipping | ||
Stage 'preprocess' didn't change, skipping | ||
Stage 'train' didn't change, skipping | ||
Stage 'evaluate' didn't change, skipping | ||
``` | ||
|
||
This resembles `dvc repro`. However, when using `dvc repro` we need to update | ||
`params.yaml` manually, run the pipeline, if the results are worth it commit | ||
them to DVC and Git. `dvc exp` automates this process through its subcommands. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nothing substantial seems to happen here? I'd either skip dvc pull
in the preparation <details>
and have some result here, or just don't add all this. Probably the latter (in agreement with #2497 (review)), as it seems out of scope for this PR. Also, GS != e2e tutorial. Can't cover everything
In order to run a baseline experiment with the default parameters defined in | |
`params.yaml`: | |
```dvc | |
$ dvc exp run | |
'data/fashion-mnist/raw.dvc' didn't change, skipping | |
Stage 'prepare' didn't change, skipping | |
Stage 'preprocess' didn't change, skipping | |
Stage 'train' didn't change, skipping | |
Stage 'evaluate' didn't change, skipping | |
``` | |
This resembles `dvc repro`. However, when using `dvc repro` we need to update | |
`params.yaml` manually, run the pipeline, if the results are worth it commit | |
them to DVC and Git. `dvc exp` automates this process through its subcommands. |
That said, some of the last paragraph, summarized as a single sentence, could potentially be preserved in the previous paragraph, or in the one after the code block below. Or maybe as a note (md block quote).
$ dvc exp show --include-metrics categorical_accuracy \ | ||
--include-params model.name,model.cnn.conv_units |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
make the commands and tables easier to read
Yeah. I'd just use plain exp show
here and modify the sample output manually as needed. Later update the example repo. Again, docs writing should drive this process I think.
But actually, I don't think exp show
would display anything here? No experiments have been run. @iesahin this change in the story also seems out of scope TBH (from line 75 to 86).
Co-authored-by: Dave Berenbaum <[email protected]>
Co-authored-by: Restyled.io <[email protected]>
Logistics. Please:
Thanks |
My take:
|
I think we will want to show some significant experiment results in an experimentation document. With |
yes, but the first priority should be to show the possibility of this thing. We might btw rewrite the text a bit and when we do bigrams we can them first with |
We can close this in favor of #2574 now. |
This one is the branched version of #2491
Closes #2479
It doesn't contain any content changes but updates the commands with the new project in github.com/iterative/get-started-experiments .
The tables are currently
code blocks
but they will be updated with the screenshots instead.