Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[super] Install a Data Package #30

Open
rufuspollock opened this issue Nov 11, 2016 · 1 comment
Open

[super] Install a Data Package #30

rufuspollock opened this issue Nov 11, 2016 · 1 comment
Milestone

Comments

@rufuspollock
Copy link
Contributor

rufuspollock commented Nov 11, 2016

[TODO]

This issue is about downloading (installing) datapackage to be used locally. Should also involve versioning - i.e. downloading particular version or updating the downloaded datapackage (separate issue?)

Some datapackages can be read directly into memory (see other usecases below). But sometimes datapackages are too large to fit into memory.
Web developers could use datapackages in their projects, and would like to install these locally (see original user stories below)

Other usecases not covered in this issue

  • For Nodejs users we can probably create native npm packages (TODO: new issue)
  • R users can read datapackage directly from bitstore into memory using datapkg
  • Python (pandas) users can read datapackage directly from bitstore into memory using jsontableschema-pandas

Original user stories:

As a Data Analyst I want to download a data package, so that I can study it and wrangle with it to infer new data or generate new insights.

As a Data Analyst, I want to update previously downloaded data package, so that I can work with the most recent data.

Low Priority

As a Web Developer of multiple projects, I want to be able to install multiple versions of the same datapackage separately so that all my projects could be developed independently and deployed locally. (virtualenv-like)

As a Developer I want to list all DataPackages requirements for my project in the file and pin the exact versions of any DataPackage that my project depends on so that the project can be deterministically deployed locally and won’t break because of the DataPackage schema changes. (requirements.txt-like)

@Fak3
Copy link
Contributor

Fak3 commented Jan 17, 2017

Copying here from US doc. How it could be implemented:
dpm install could download datapackages into the current dir. So that datapackages will live inside the web\analytics project tree (possibly gitignored if needed). This way analytics code will produce stable results, and web projects won't confilct with each other.

@zelima zelima added this to the Backlog milestone Apr 26, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants