Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

get-started: agenda copy edits #683

Merged
merged 4 commits into from
Oct 10, 2019
Merged
Changes from 2 commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
39 changes: 19 additions & 20 deletions static/docs/get-started/agenda.md
Original file line number Diff line number Diff line change
@@ -1,36 +1,35 @@
# Agenda

In the next few sections we will build a simple natural language processing
gagejustins marked this conversation as resolved.
Show resolved Hide resolved
(NLP) project from scratch. If you'd like to get the final result or have some
issues along the way, you can download the fully reproducible
[GitHub project](https://github.com/iterative/example-get-started) with:
In the next few sections we'll build a simple natural language processing (NLP)
project from scratch. If you'd like to get the final result or have any issues
along the way, you can download the fully reproducible
[GitHub project](https://github.com/iterative/example-get-started) by running:

```dvc
$ git clone https://github.com/iterative/example-get-started
```

Otherwise, bear with us and we will introduce the basic DVC concepts and get to
Otherwise, bear with us and we'll introduce some basic DVC concepts to get to
gagejustins marked this conversation as resolved.
Show resolved Hide resolved
the same result together!

The idea of the project is a simplified version of the
[Tutorial](/doc/tutorials/deep). It explores the NLP problem of predicting tags
for a given StackOverflow question. For example, we want a classifier that can
predict posts about the Python language by tagging them `python`.
The idea of the project is a simplified version of our
[text classification tutorial](/doc/tutorials/deep). It explores the NLP problem
gagejustins marked this conversation as resolved.
Show resolved Hide resolved
of predicting tags for a given StackOverflow question. For example, we might
want a classifier that can flag (or predict) posts about Python by tagging them
with `python`.
jorgeorpinel marked this conversation as resolved.
Show resolved Hide resolved

![](/static/img/example-flow-2x.png)

Do not let the NLP nature of the example discourage you from using DVC in other
Data Science areas. There was no strong reason behind picking the NLP area. On
contrary, DVC is designed to be agnostic of frameworks, languages, etc. If you
have data files or datasets and/or you produce other data files, models,
This is a natural language processing context, but NLP isn't the only area where
gagejustins marked this conversation as resolved.
Show resolved Hide resolved
DVC can help. DVC is designed to be agnostic of frameworks, languages, etc. If
you _have_ data files or datasets and/or you _produce_ data files, models, or
gagejustins marked this conversation as resolved.
Show resolved Hide resolved
datasets and you want to:

- Capture and save those <abbr>data artifacts</abbr> the same way we capture
- Capture and save those <abbr>data artifacts</abbr> the same way you capture
code
- Track and switch between different versions of the data easily
- Be able to answer the question of how data artifacts (e.g. ML models) were
built in the first place
- Be able to compare them
- Bring best practices to your team and get everyone on the same page
- Track and switch between different versions of data easily
- Understand how data artifacts (e.g. ML models) were built in the first place
- Be able to compare models to each other
- Bring software best practices to your team and get everyone on the same page

Then you are in a good place! Click the `Next` button below to start ↘
Then you're in the right place! Click the `Next` button below to start ↘