Raw data plausibility checking

About the OpenSAFELY framework

The OpenSAFELY framework is a secure analytics platform for electronic health records research in the NHS.

Instead of requesting access for slices of patient data and transporting them elsewhere for analysis, the framework supports developing analytics against dummy data, and then running against the real data within the same infrastructure that the data is stored. Read more at OpenSAFELY.org.

To enable this, some exploration of raw data is required in order to implement new data as easy-to-use and well-documented functions for end users.

This repo contains (will contain) a template for performing plausibility checking of datasets.

How to use the template

Add codelist to the codelists/codelists.txt file
Make changes to the analysis/config.py file
Make changes to the analysis/config_numeric_value_checks.py file
This code can then be run locally using the command opensafely run run_all
This generates a Jupyter notebook (.ipynb) file in the analysis subfolder (e.g., analysis/Notebook_numeric_values_<codelist_name>.ipynb)
Someone with L2/3 access can then clone the repository and run the notebook as per these instructions.

How to use the numeric values template

Add desired codelist(s) to codelists/codelists.txt
Download codelist(s) using opensafely codelists update
Specify one codelist and other required information in analysis/config_numeric_value_checks.py.
Generate the notebook (ipynb) file locally using the command opensafely run create_notebook_numeric. Alternatively, run the analysis/create_notebook_numeric_value_checks.py file itself directly.
Repeat steps 3-4 for each codelist.
Commit the new & modified files to the repo.
Someone with L2/3 access can then clone the repository and run the notebook as per these instructions.
Notebooks can be saved to html and made available for release.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
analysis		analysis
codelists		codelists
docs		docs
logs		logs
notebooks		notebooks
output		output
tests		tests
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
project.yaml		project.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Raw data plausibility checking

About the OpenSAFELY framework

How to use the template

How to use the numeric values template

About

Releases

Packages

Contributors 3

Languages

License

opensafely/raw-data-plausibility-checks

Folders and files

Latest commit

History

Repository files navigation

Raw data plausibility checking

About the OpenSAFELY framework

How to use the template

How to use the numeric values template

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages