jsonstat.py is a library for reading the JSON-stat data format maintained and promoted by Xavier Badosa. The JSON-stat format is a JSON format for publishing dataset. JSON-stat is used by several institutions to publish statistical data. An incomplete list is:
- Eurostat that provide statistical information about the European Union (EU)
- Central Statistics Office of Ireland
- United Nations Economic Commission for Europe (UNECE) statistical data are here
- Statistics Norway
- UK Office for national statistics see their blog post
- others...
jsonstat.py library tries to mimic as much is possible in python the json-stat Javascript Toolkit. One of the library objectives is to be helpful in exploring dataset using jupyter (ipython) notebooks.
For a fast overview of the feature you can start from this example notebook oecd-canada-jsonstat_v1.html You can also check out some of the jupyter example notebook from the example directory on github or from the documentation
You can find useful another python library pyjstat by Miguel Expósito Martín concerning json-stat format.
This library is in beta status. I am actively working on it and hope to improve this project. For every comment feel free to contact me [email protected]
You can find source at github , where you can open a ticket, if you wish.
You can find the generated documentation at readthedocs.
Pip will install all required dependencies. For installation:
pip install jsonstat.py
There is a simple command line interface, so you can experiment to parse jsonstat file without write code:
# parsing collection $ jsonstat info --cache_dir /tmp http://json-stat.org/samples/oecd-canada.json downloaded file(s) are stored into '/tmp' download 'http://json-stat.org/samples/oecd-canada.json' Jsonsta tCollection contains the following JsonStatDataSet: +-----+----------+ | pos | dataset | +-----+----------+ | 0 | 'oecd' | | 1 | 'canada' | +-----+----------+ # parsing dataset $ jsonstat info --cache_dir /tmp "http://ec.europa.eu/eurostat/wdds/rest/data/v2.1/json/en/tesem120?sex=T&precision=1&age=TOTAL&s_adj=NSA" downloaded file(s) are stored into '/tmp' download 'http://ec.europa.eu/eurostat/wdds/rest/data/v2.1/json/en/tesem120?sex=T&precision=1&age=TOTAL&s_adj=NSA' name: 'Unemployment rate' label: 'Unemployment rate' size: 467 +-----+-------+-------+------+------+ | pos | id | label | size | role | +-----+-------+-------+------+------+ | 0 | s_adj | s_adj | 1 | | | 1 | age | age | 1 | | | 2 | sex | sex | 1 | | | 3 | geo | geo | 39 | | | 4 | time | time | 12 | | +-----+-------+-------+------+------+
code example:
url = 'http://json-stat.org/samples/oecd-canada.json' collection = jsonstat.from_url(url) # print list of dataset contained into the collection print(collection) # select the first dataset of the collection and print a short description oecd = collection.dataset(0) print(oecd) # print description about each dimension of the dataset for d in oecd.dimensions(): print(d) # print a datapoint contained into the dataset print(oecd.value(area='IT', year='2012')) # convert a dataset in pandas dataframe df = oecd.to_data_frame('year')
For more python script examples see examples directory.
For jupyter (ipython) notebooks see examples-notebooks directory.
This is an open source project, maintained in my spare time. Maybe a particular features or functions that you would like are missing. But things don’t have to stay that way: you can contribute the project development yourself. Or notify me and ask to implement it.
Bug reports and feature requests should be submitted using the github issue tracker. Please provide a full traceback of any error you see and if possible a sample file. If you are unable to make a file publicly available then contact me at [email protected].
You can find support also on the google group.
Any help will be greatly appreciated, just follow those steps:
- Fork it. Start a new fork for each independent feature, don’t try to fix all problems at the same time, it’s easier for those who will review and merge your changes.
- Create your feature branch (
git checkout -b my-new-feature
) - Write your code. Add unit tests for your changes!
If you added a whole new feature, or just improved something, you can be proud of it,
so add yourself to the
AUTHORS
file :-) Update the docs! - Commit your changes (
git commit -am 'Added some feature'
) - Push to the branch (
git push origin my-new-feature
) - Create new Pull Request. Click on the large "pull request" button on your repository. Wait for your code to be reviewed, and, if you followed all theses steps, merged into the main repository.
jsonstat.py is provided under the LGPL license. See LICENSE file.