The code for Dunith Dhanushka and Mark Needham's upcoming book on Real-Time Analytics.
Put the architecture diagram here
docker-compose \
-f docker-compose-base.yml \
-f docker-compose-pinot.yml \
-f docker-compose-dashboard-enriched.yml \
-f docker-compose-dashboard.yml up
docker-compose \
-f docker-compose-base.yml \
-f docker-compose-pinot-m1.yml \
-f docker-compose-dashboard.yml \
-f docker-compose-enrich-streams.yml \
-f docker-compose-dashboard-enriched.yml \
up
Once that's run, you can navigate to the following:
- Pinot UI - http://localhost:9000
- Streamlit Dashboard - http://localhost:8501
docker run -v $PWD/pinot/config:/config \
--network rta \
apachepinot/pinot:0.11.0 \
AddTable -schemaFile /config/orders_enriched/schema.json \
-tableConfigFile /config/orders_enriched/table.json \
-controllerHost pinot-controller -exec
Setup Python environment:
python -m venv venv
source venv/bin/activate
Install dependencies:
pip install -r scripts/requirements.txt
Downloading product pages:
python scripts/download_products.py
Scrape product data:
python scripts/scrape_products.py
The products will be written to simulator/data/products.json