Health HeatMap Backend

This software can handle multitudes of spreadsheets and upload into a multi-dimensional database which allows querying by various dimensions.

Overview

This software can be conceptually divided into two equally important halves. The first half is an Extract-Transform-Load (ETL) pipeline, and the second half is a web API to query data.

Framework

It uses Quarkus for wiring things together. Read guides for most of the extension.

Data

Data sourced from public datasets is curated, annotated, and organized. This is in a form which can be directly ingested by the software.

ETL pipeline

Extract: Data from CSV files can be extracted directly at the moment. Data in PDF has to be converted to CSV first. Each spreadsheet that needs to be included in the database should come with a metadata.json file that describes the layout of data within the spreadsheet.
Transform: Data extracted from the CSVs can optionally be put through a series of transformations which are specified through another set of CSV files.
Load: The data points after applying all the transformations get uploaded to elasticsearch.

Query API

There is a JAX-RS API configured with endpoints to query data previously uploaded.

Setup

Java 11+ is required.
Elasticsearch 7 is required.
Clone this repo. Run ./mvnw clean install.
Run java -jar cli/target/health-heatmap-cli-runner.jar for command line options.

Add data

Put data somewhere, say /home/metastring/healthheatmap-data
Run java -jar cli/target/health-heatmap-cli-runner.jar upload --path /home/metastring/healthheatmap-data -n ''

(Add -z flag to delete the pre-existing index in elasticsearch)

Run server

Run java -jar web/target/health-heatmap-web-runner.jar
Go to http://localhost:8080/api-playground/

Development

You can do ./mvnw -pl web quarkus:dev to run a development server with automatic reloads

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Health HeatMap Backend

Overview

Framework

Data

ETL pipeline

Query API

Setup

Add data

Run server

Development

Files

README.md

Latest commit

History

README.md

File metadata and controls

Health HeatMap Backend

Overview

Framework

Data

ETL pipeline

Query API

Setup

Add data

Run server

Development