Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

guide: external cache info (extracted from Use Cases) #2475

Merged
merged 13 commits into from
May 26, 2021

Conversation

jorgeorpinel
Copy link
Contributor

@jorgeorpinel jorgeorpinel commented May 14, 2021

Rel. #654 (comment)

  • Make external cache setup section in external data guide
  • Simplify the shared cache setup section in the use case
  • Add some usage notes and warnings
  • Review usage of term "external cache" and link to guide
  • Other small copy edits

May overlap with #2471

not sure when it was changed?
@shcheklein shcheklein temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 00:13 Inactive
in regards to external caches
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 00:41 Inactive
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 00:42 Inactive
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 00:43 Inactive
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 18:00 Inactive
@jorgeorpinel jorgeorpinel marked this pull request as ready for review May 14, 2021 18:00
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 18:00 Inactive
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 14, 2021 18:39 Inactive
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 16, 2021 18:32 Inactive
@jorgeorpinel
Copy link
Contributor Author

@casperdcl @iesahin please try to pre-review this if one of you has a chance soon. I'm wondering if the changes will make sense to you without much context (I did add a description above though). Thanks

@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 16, 2021 22:56 Inactive

Tell DVC to use the directory we've set up above as the <abbr>cache</abbr> for
your <abbr>project</abbr>:
A <abbr>cache</abbr> outside the <abbr>workspace</abbr> is called an
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the <abbr>cache</abbr> contents (from https://github.com/iterative/dvc.org/blob/master/content/docs/user-guide/basic-concepts/dvc-cache.md) doesn't seem consistent with this para

Copy link
Contributor Author

@jorgeorpinel jorgeorpinel May 18, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How so? Can you be more specific please? 🙂

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

the tooltip doesn't mention anything about the cache being external, but this para is abot externally located caches. Not sure if it may cause further confusion for people who have no idea what's going on.

Copy link
Contributor Author

@jorgeorpinel jorgeorpinel May 19, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Gotcha. I think that the part "by default in .dvc/cache" suggests it can be located elsewhere. And nothing there limits it from being external.

I guess that's what @iesahin's #2475 (comment) referred to as well.

[external dependencies](/doc/user-guide/external-dependencies)) provide ways to
track and version data outside of the <abbr>project</abbr>.

## How external outputs work

External <abbr>outputs</abbr> are considered part of the (extended)
<abbr>workspace</abbr>: DVC will track them for
External <abbr>outputs</abbr> will be tracked by DVC for
Copy link
Contributor

@casperdcl casperdcl May 17, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think so. The tooltip says "by default" and it implies it can be configured to have different locations, e.g., externally in some NFS mount.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

But the page needs a concept explanation.

Copy link
Contributor Author

@jorgeorpinel jorgeorpinel May 18, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you mean the output tooltip @casperdcl ? (I edited your comment). If so can you specify why or what part? Sorry, the connection is not obvious to me. The part about .dvc files?

But the page needs a concept explanation.

@iesahin you mean a standalone concepts page? Sure, but out of scope here 😬 (related to #2453 though)

Copy link
Contributor

@casperdcl casperdcl May 18, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oops, yes wrong link. I'm complaining about a different <abbr> (output) here.

The point is

  1. the section title was changed from "external outputs" to "external data," and
  2. yet we continue using "external outputs" in this para here, while
  3. the tooltip is only about "outputs."

Copy link
Contributor Author

@jorgeorpinel jorgeorpinel May 19, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK.

  1. Context on why we don't want the page title to have the word "output": docs: clarifications around external outputs info. #2154 (review)
  2. True. But the section name didn't change and it is How external outputs work.
  3. Right, so similar to the "external cache" comment where you see possible confusion by adding an adjective to the tooltipped term? TBH I don't see the problem but maybe I'm biased.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have the approach where docs are not meant for us, they're meant for people who are confused are thus looking for docs. I understand, but I don't think they will. Using multiple different similar-sounding terms tends to create a sense of complexity and makes readers (consciously or subconsciously) feel lost.

Copy link
Contributor Author

@jorgeorpinel jorgeorpinel May 19, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah of course they're not meant for us 🙂

I just don't understand how this can be confusing. What are the similar sounding terms here? Or if you have a suggestion to improve this, maybe that would clarify your PoV.

@jorgeorpinel jorgeorpinel requested a review from shcheklein May 18, 2021 06:08
@jorgeorpinel jorgeorpinel temporarily deployed to dvc-org-cases-shared-de-vohp7s May 18, 2021 06:15 Inactive
@jorgeorpinel jorgeorpinel self-assigned this May 19, 2021
@@ -57,7 +57,7 @@ $ ls -a
.git code.py foo
```

## Example: External cache directory
## Example: External cache directory (is preserved)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

hmm, not sure I understand this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My intention was to clarify what the example is about. It shows that destroy doesn't affect external cache dirs. TBH I'm not sure this is the best place to show that, maybe just a brief note in the description?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

UPDATE: I see the note already exists. Changed to "Preserve an external cache directory" for now. I guess removing the example would be out of scope for this PR.

Copy link
Contributor Author

@jorgeorpinel jorgeorpinel May 25, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

UPDATE 2: Shoot, I went ahead and rephrased most of the example. It wasn't very clear before. Should I split into another PR @shcheklein ? See 11f5986.

jorgeorpinel added a commit that referenced this pull request May 25, 2021
jorgeorpinel added a commit that referenced this pull request May 25, 2021
@jorgeorpinel jorgeorpinel force-pushed the cases/shared-dev/external-cache branch 2 times, most recently from 62b4fed to 11f5986 Compare May 25, 2021 00:14
@jorgeorpinel jorgeorpinel merged commit 27e3143 into master May 26, 2021
@jorgeorpinel jorgeorpinel linked an issue May 26, 2021 that may be closed by this pull request
jorgeorpinel added a commit that referenced this pull request Jun 29, 2021
* cases&how: generalizing shared dev server case and
and extract guided info to new how-to

* cases: rewrite intro to Sharing Res and
draft a first section on storage

* cases: quick updates per #2482 feedback

* cases: connect cache/remote solutions to Sharing Res problem stmt
per #2482 (review)

* cases: move img down, reorg some Ps, remove WIP comment

* ref: update destroy external cache example
rel #2475 (review)

* cases: add figure to Shared Resources intro

* cases: wrap up Shared Resources story (1st version)

* cases: fix links to old case and
copy edits on the new one

* cases: address minor feedback
from #2482 (review) +

* cases: generalize info about basic caching data optimization
rel #2482 (review)

* cases: higher level solution intro
per #2482 (review)

* cases: remove md comment... oops

* cases: rewrite part about efficient processing
per #2482 (review)

* cases: roll back new use case

* ref: remove an md ref link

* guide: fix links that shouold go to now how-to

* guide: better intro for shared cache how-to
per #2482 (review)

* cases: remove unnecessary images
per static/img/resource-pool.png
@casperdcl casperdcl deleted the cases/shared-dev/external-cache branch July 14, 2021 08:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

guide: extract shared cache info from shared dev server case
4 participants