-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
guide: expand Experiments guide #2654
Changes from all commits
5008c55
ff85352
923040f
3ae85e5
29b17b2
e21fef4
c21dbe3
d8f2d7c
1337453
8d93521
90f3042
fb4663c
d1422b1
532df56
0581991
e6d4eca
ec2ac41
2e4a512
e4f4024
581a9a9
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,11 +1,12 @@ | ||
--- | ||
name: Experiment | ||
match: [experiment, experiments] | ||
match: [experiment, experiments, 'DVC experiments'] | ||
tooltip: >- | ||
An attempt to reach desired/better/interesting results during data pipelining | ||
or ML model development. DVC is designed to help [manage | ||
experiments](/doc/start/experiments), having [built-in | ||
mechanisms](/doc/user-guide/experiment-management) like the | ||
[run-cache](/doc/user-guide/project-structure/internal-files#run-cache) and | ||
the `dvc experiments` commands (available on DVC 2.0 and above). | ||
the [`dvc experiments`](/doc/command-reference/exp) commands (available on DVC | ||
2.0 and above). | ||
--- |
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,36 @@ | ||
## DVC Experiments | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. it should be clear that those are DVC experiments? also it's clear that this is about experiments what is the actual intention behind this page? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. also, renaming page + copy editing makes it almost impossible to review There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Currently the section in the index is just called Experiments and I'm happy to rename it back to that. I just thought it would be clearer to "brand them" since they're special. We also say things like DVC Project, DVC Cache, and DVC Remote.
"...dedicated DVC Experiments page to explain what experiments are and potentially go into implementation details (in the future)." 🙂
This is a brand new page. No file was renamed in this PR. Some of it's info was extracted from the index.md. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. p.s. I realize it's a huge PR. That's because we wanted to try nesting PRs, but it resulted in this one being too big and losing track of which changes we already approved... But the PR description has a list of changes done with links to the files updated (4 major ones). Everything else is small copy edits and link updates. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
In fact we already call it "DVC Experiments" in some places e.g. there's a few instances in https://github.com/iterative/dvc.org/pull/2901/files#diff-8bac2dee1e13766aefa536c3d2fa38296dc6b09ba750fd71c9eedde61df8d1b9 |
||
|
||
_New in DVC 2.0_ | ||
|
||
`dvc exp` commands let you automatically track a variation to an established | ||
[data pipeline](/doc/command-reference/dag) baseline. You can create multiple | ||
isolated experiments this way, as well as review, compare, and restore them | ||
later, or roll back to the baseline. The basic workflow goes like this: | ||
|
||
- Modify stage <abbr>parameters</abbr> or other dependencies (e.g. input data, | ||
source code) of committed stages. | ||
- [Run experiments] with `dvc exp run` (instead of `repro`) to execute the | ||
pipeline. The results are reflected in your <abbr>workspace</abbr>, and | ||
tracked automatically. | ||
- Use `dvc metrics` to identify the best experiment(s). | ||
- Visualize, compare experiments with `dvc exp show` or `dvc exp diff`. Repeat | ||
🔄 | ||
- Use `dvc exp apply` to roll back to the best one. | ||
- Make the selected experiment persistent by committing its results to Git. This | ||
cleans the slate so you can repeat the process. | ||
|
||
[run experiments]: /doc/user-guide/experiment-management/running-experiments | ||
|
||
## Persistent Experiments | ||
|
||
When your experiments are good enough to save or share, you may want to store | ||
them persistently as Git commits in your <abbr>repository</abbr>. | ||
|
||
Whether the results were produced with `dvc repro` directly, or after a | ||
`dvc exp` workflow, `dvc.yaml` and `dvc.lock` will define the experiment as a | ||
new project version. The right <abbr>outputs</abbr> (including | ||
[metrics](/doc/command-reference/metrics)) should also be present, or available | ||
via `dvc checkout`. | ||
|
||
Use `dvc exp apply` and `dvc exp branch` to persist experiments in your Git | ||
history. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not this PR: below: Setting up the project ... should we do
##
->###
?This comment was marked as resolved.
Sorry, something went wrong.