Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Basic Concepts + Introduction files #296

Closed
wants to merge 6 commits into from
Closed

Add Basic Concepts + Introduction files #296

wants to merge 6 commits into from

Conversation

tapaswenipathak
Copy link

Fix #53.

Hi folks,

I am a GSoD applicant, have read your ideas list and working on closing tickets present in the list.

I have participated in multiple SoC/WoC and have been involved w/ open source since years. I have worked with Jupyter and have worked w/ ML algorithms. I am fully familiar with these areas. DVC.org is great fit for me but as a product new for me.

On docs I like detailed docs having every step present so that the user can be spoon feeded, makes it easily usable.

I see a scope of writing introduction on a lot of commands in the doc but curious if you are interested as most of the commands have a writeup written separately. I found diagrams as well on these. Please let me know what you think would be best for your org. Thanks!

Copy link
Member

@shcheklein shcheklein left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi!

Is this PR done and ready for merge?

I see some unfinished sentences like The example workspace look as follows - - that make me think this is WIP. Please add [WIP] to the title in this case.

@tapaswenipathak
Copy link
Author

@shcheklein Done as per me, please let me know if you think I can make it more useful.


## Remote

In DVC you can have remote data storage or a remote git hosted code server.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remote git hosted code server - I'm not sure I understand this. Where does it come from and what does it mean?

Copy link
Author

@tapaswenipathak tapaswenipathak May 11, 2019

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Git can be hosted on any remote ip, if you put code + git on the remote ip, you could call the ip as remote git hosted code server (https://git-scm.com/book/en/v1/Git-on-the-Server, https://git-scm.com/book/en/v2/Git-on-the-Server-Setting-Up-the-Server).

## Remote

In DVC you can have remote data storage or a remote git hosted code server.
A remote can be specified by the remote type prefix and a path. You can find
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

remote type prefix - remote is usually specified by its URL

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think in dvc adding a remote should be specified by type + prefix? Remotes can be

  • local
  • s3
  • gs
  • azure
  • ssh
  • hdfs
  • http

You would run dvc remote add -d /typer/remote path/url.

Copy link
Member

@shcheklein shcheklein left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's still not 80 symbols. Please, check the README.md#Contributing.

A few questions here and there to address.

In general, the doc is a good starting point, but I'm not sure we can consider the ticker resolved. Right now it's not very informative. It does not explain how cache is related to working space, etc.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 11, 2019

@shcheklein

Screen Shot 2019-05-12 at 3 20 19 AM

I used vim and have set colorcolumn=80. Can you let me know the line number w/ more than 80 chars. You would find lines w/ less than 80 chars as I think documentation shouldn't have character breakings.

I added introduction.md as well, please let me know if you think any more additions would close the ticket. ref #296 (comment).

@tapaswenipathak
Copy link
Author

I think writing just on few things and just on relations when you have the docs per type clearly written, you would have less text when describing in introdction.md. If you prefer repetitions, I will add.

@tapaswenipathak tapaswenipathak changed the title Add introduction Add Basic Concepts + Introduction files May 11, 2019
@shcheklein
Copy link
Member

I used vim and have set colorcolumn=80. Can you let me know the line number w/ more than 80 chars. You would find lines w/ less than 80 chars as I think documentation shouldn't have character breakings.

@tapaswenipathak I'm not sure I understand your question.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 13, 2019 via email

@shcheklein
Copy link
Member

@tapaswenipathak on the screenshot you shared you can see a lot of lines that are mode than 80 symbols. Here is an example - https://github.com/iterative/dvc.org/pull/296/files#diff-2e4a72ea9621637441b257e4b8428e16R18

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 14, 2019 via email

@shcheklein
Copy link
Member

@tapaswenipathak sorry for the delay. Either I'm missing something or I don't get you questions sorry. There is a style guide for the documentation. Every .md file should wrap all lines (except a few very specific case, like long URLs) at 80 symbols. Please, read also this link: https://dvc.org/doc/user-guide/contributing-documentation#code-style-guidelines

@shcheklein
Copy link
Member

@tapaswenipathak sorry, may be it's unclear a little bit, but style guide dictates the way raw files should be formatted.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 17, 2019

The .md file has

Screen Shot 2019-05-17 at 9 55 25 AM

The formatted file has
Screen Shot 2019-05-17 at 9 54 19 AM

Guidelines has
Screen Shot 2019-05-17 at 10 03 13 AM

As per this guidelines the formatting is correct?

Do you mean?

Screen Shot 2019-05-17 at 10 10 12 AM

The formatted file will have much lesser than 80. Please let me know if I should move the word in next line.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 17, 2019 via email

@shcheklein
Copy link
Member

@tapaswenipathak you got the idea right! You need to wrap the lines in your raw files at 80 symbols. If you use vim I would recommend to install the pre-commit hook that will check and format the files for you.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 17, 2019 via email

@shcheklein
Copy link
Member

Should I fix the guidelines as well? This line suggests formatting should be correct and it is.

Yes, let's clarify it. Thanks!

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 21, 2019

@shcheklein do you have any more comments? Gentle reminder for #293, #292 a quick review.

Copy link
Member

@shcheklein shcheklein left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. Does not fit code style. There are still lines that are > 80 symbols.
  2. Content is not informative and does not give that much value on top of what we already have.

My recommendation: focus on one task at a time, go through DVC tutorials and get started, write one single new section that gives some values and does not just repeat other sections.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 21, 2019

oh @shcheklein should I close this pr then?

@shcheklein
Copy link
Member

@tapaswenipathak I'm not sure, to be honest. It mostly depends on what are you going to do next with all three PRs.

@tapaswenipathak
Copy link
Author

tapaswenipathak commented May 21, 2019

I don't think I understand the inputs, I added the suggestions/iterations on this pr per the suggestions. If you can write what is missing in this pr or if you think I should close this, please let me know. Other two def I can close as no work is present on them or any iterations, after creating the pr.

@shcheklein
Copy link
Member

@tapaswenipathak let's close all three then. Sorry, I just don't have capacity right now to collaborate with you on this on the level it requires to get this stuff done :(

@tapaswenipathak
Copy link
Author

ok. :(

@tapaswenipathak tapaswenipathak deleted the issue-53 branch May 21, 2019 17:57
@shcheklein shcheklein mentioned this pull request Jun 9, 2019
2 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

user-guide: cache/remote/workspace relationships
2 participants