Skip to content

Framework for the developing and deploying interactive visulization tools to be used by experts investigating the Quality-Control status of ALICE TPC operations.

Notifications You must be signed in to change notification settings

christianreckziegel/TPCQCVis

 
 

Repository files navigation

TPCQCVis

Index:

  1. Introduction
  2. Setting up
  3. User Guide
  4. Developers Guide

Introduction:

The monitoring of the ALICE TPC quality control data in RUN3 is planned to be done using the Jupyter environment, where libraries e.g. Bokeh plotter can be used which enables very user-friendly interactivity capabilities. We have PyRoot based notebooks for visualizing the Root Object outputs from central sync and async QC. And, we have expert dashboards made with RootInteractive using skimmed data.

The webinterface for generated reports is located at alice-tpc-qc.web.cern.ch

Setting up:

  1. Install O2 & Quality Control
  2. Set up GRID certificate (if not already done)
    1. Create and download certificate from ca.cern.ch/ca
    2. Convert to right format as described in alice-analysis-tutorial
    3. Test certificate as described in alice-analysis-tutorial or run alien.py with O2 loaded.
  3. Enter QC environment
    1. Install TPCQCVis
      1. Clone this repo.
        1. git clone https://github.com/bulukutlu/TPCQCVis.git
      2. Set the following environment variables in your .bashrc or .bash_profile etc.:
        • TPCQCVIS_DIR: Directory path where the code resides.
        • TPCQCVIS_DATA: Directory path where data will be downloaded.
        • TPCQCVIS_REPORT: Directory path where reports will be stored.
      3. Install the package: pip install -e $TPCQCVIS_DIR
      4. Check if installation is OK: python $TPCQCVIS_DIR/post_install_checks.py
    2. [Optional] Install RootInteractive

User guide:

Creating reports with templates

Tip

More detailed examples for custom report creation down below in Chapter: Full workflow example The worflow for the report creation is as follows:

  1. During offline production async QC tasks are run. A QC merger process gathers all files for a run in to on file. The produced QC output goes into the QCDB (which can be visualized via the QCG) but also to the alien with the name QC_fullrun.root
  2. We download the QC_fullrun.root objects from alien.

    Implemented in tools/downloadFromAlien.py

  3. We extract the TPC relevant objects and write them to a new root file. We also run postprocessing on this file.

    Implemented in tools/runPlotter.py

  4. We create reports using jupyter notebook templates with the QC files as input.

    Implemented in tools/generateReport.py

  5. The generated reports are uploaded to the eos project folder (located at /eos/project-a/alice-tpc-qc/www also accesible via CernBox). The WebServices handle the displaying of the reports for viewers.

    Implemented in tools/syncAndUpload.py

To avoid having to go through all of the steps above everytime, there are automation scripts implemented.

Daily Async Report

python $TPCQCVIS_DIR/TPCQCVis/tools/dailyAsyncFromEmail.py [--date DATE] [--dates DATES...] [--num_threads NUM_THREADS] [--schedule SCHEDULE] [--mattermost]

Every day at 10:00 CERN time the MonALISA interface sends an email to the alice-dpg-async-qc mailing group the list of async productions completed that day.

The tools/dailyAsyncFromEmail.py script allows the automatic generation of reports for the productions of a given day.

  • Before running the script, you need to authorize access to Gmail by following these steps:
    • Get the login information for the Gmail account used to fetch the reminder emails from @bulukutlu
    • Place the generated credentials.json file in the directory specified by TPCQCVIS_DIR.
    • When running the script initially, it will prompt you to log in to the Google account and authorize access.

Options:

  • --num_threads: how many threads to use when running in parallel (downloading still done sequentially due to LRZ limitations)
  • --date: specify date (example: --date 08.04.2024)
  • --dates: specify dates (example: --dates 06.04.2024 07.04.2024 08.04.2024)
  • --schedule: Schedule automatic running of the script everyday at specified time (example: --schedule 1015)
  • --mattermost: Send overview of generated reports to TPC-AsyncQC channel

Example:

python $TPCQCVIS_DIR/TPCQCVis/tools/dailyAsyncFromEmail.py --num_threads 10 --date 13.05.2024

QC Master Automation

python $TPCQCVIS_DIR/TPCQCVis/tools/qc_master.py [--path PATH] [--apass APASS] [-d] [-p] [-r] [-rr] [-t NUM_THREADS] [period_list...]

This script allow the calling of different parts of the production chain for multiple different periods automatically.

Options:

  • period_list: List of period strings.
  • -d or --download: Run the download command.
  • -p or --plot: Run the plotter command.
  • -r or --report: Run the report command.
  • -rr or --rerun: Rerun plotter for existing periods.
  • --path: Path string for the generateReport command.
  • --apass: Apass string for the generateReport command.
  • -t or --num_threads: Number of threads to be used - (default: 1).

Examples:

python $TPCQCVIS_DIR/TPCQCVis/tools/qc_master.py --path $TPCQCVIS_DATA/2023/ --apass apass3 --download --plot --report LHC23zzk

To upload generated reports to the webpage, afterwards do: python $TPCQCVIS_DIR/TPCQCVis/tools/syncAndUpload.py'

MonteCarlo

The QC output of the MonteCarlo productions is a bit differently structured, as such the downloading has to be handled with a different script.

python $TPCQCVIS_DIR/TPCQCVis/tools/downloadSim.py [alien paths ...]

or

python $TPCQCVIS_DIR/TPCQCVis/tools/downloadSim.py [alien dir]

Example:

python $TPCQCVIS_DIR/TPCQCVis/tools/downloadSim.py --dir /alice/sim/2024/LHC24b1b/0/

After downloading the files and cd'ing to their location, the QC plot files can again be generated using:

python $TPCQCVIS_DIR/TPCQCVis/tools/runPlotter.py -t 10 $PWD/

Comparing to Data:

For comparing MC against the data runs they were anchored to, a report template exist at: reports/TPC_AQC_Template_CompareRunToMC.ipynb To generate comparison reports use the tools/generateMCComparisonReports.py tool. For this set in the python script the info for the wanted comparison. e.g.:

### Part to set
path = f"{DATADIR}/sim/2024/"
period = "LHC24e2" 
passName = "" #keep empty ("") if MC
pathComparison = f"{DATADIR}/2023/"
periodListComparison =  ["LHC23zzf","LHC23zzg","LHC23zzh"]
passNameListComparison = ["apass3","apass3","apass3"]

Afterwards, run the script:

python $TPCQCVIS_DIR/TPCQCVis/tools/generatMCComparisonReports.py 

Which will put the reports in to the corresponding data folder (direcotry with .root files for MC). To upload you can use the syncAndUpload tool.

Tip

Normally, the MC runs are very stable and don't need to be checked on their own (unless requested). If you want, you can generate the normal QC reports using the tools/generateReport.py script.

To convert the saved notebook into html report see Exporting Notebooks

Developers guide:

This repository mainly contains the automation scripts described above and ROOT functions for making nicer plots and postprocessing.

  • The implementation is located at $TPCQCVIS_DIR/TPCQCVis/src.
  • For testing functionality there are notebooks located at $TPCQCVIS_DIR/TPCQCVis/tests.
  • Notebook demonstrating some core functions UserGuide.ipynb
  • Other demos can be found at TPCQCVis/tutorials/ (not up to date)

Exporting notebooks:

As report:

Command to run:

jupyter nbconvert myNotebook.ipynb --to html --template classic --no-input --execute

Remove the --no-input to have the code also in the report

As slides:

Command to run:

jupyter nbconvert myNotebook.ipynb --to slides --no-input --SlidesExporter.reveal_scroll=True

Then open the html file with text editor and change the initializer of reveal to this:

Reveal.initialize({
            controls: true,
            progress: true,
            history: false,
            transition: "slide",
            slideNumber: "true",
            viewDistance: 50,
            mobileViewDistance: 20,
            preloadIframes: true,
            autoPlayMedia:true,
            plugins: [RevealNotes]
        });

This makes it possible that all the plots in the slides will be loaded automatically when file is opened on browser. (works on Chrome, still didn't get the expected behavior in Firefox)

Interactive Dashboards

Note

It is planned to also provide interactive dashboards generated using RootInteractive for TPC QC. These will be based on skimmed AO2D and reconstructed data, as well as timeseries and aggregated run informations collected from CCDB and QCDB. Currently this project is [WIP] and no automatized dashboards are yet being created.

For the development, some example dashboard templates can be found at /TPCQCVis/dashboards/

Full workflow example

You were asked to make all necessary reports for the newly produced LHC23zzk apass3. And the experts are curious how it compares to the previous apass2. Since we only need apass2 for comparison. We will omit generating the standalone reports for that pass, but we will still create the root files needed for the comparison report.

Warning

There is a high chance by the time you are running this, the apass2 or apass3 files for LHC23zzk were outdate and got deleted from alien. In that case, follow these steps for more recent productions. Alternatively, you can also copy the files from our project backup at /eos/project-a/alice-tpc-qc/data/ accessed via lxplus or CERNBox.

Let's see how to generate necessary reports.

Approach 1: Step-by-Step

  1. Get all necessary files on your PC from alien. We will need the QC_fullrun.root files for both apass3 and apass2. They are located in directories /alice/data/2023/LHC23zzk/ in alien. To download them locally, we make use of the downloadFromAlien.py tool:
    python $TPCQCVIS_DIR/TPCQCVis/tools/downloadFromAlien.py $TPCQCVIS_DATA/2023/LHC23zzk/apass3/ /alice/data/2023/LHC23zzk/ apass3
    python $TPCQCVIS_DIR/TPCQCVis/tools/downloadFromAlien.py $TPCQCVIS_DATA/2023/LHC23zzk/apass2/ /alice/data/2023/LHC23zzk/ apass2
    
    You should end up with two folders containing .root files for different runs in $TPCQCVIS_DATA/2023/LHC23zzk/.
  2. Run plotter command for both folders:
    python $TPCQCVIS_DIR/TPCQCVis/tools/runPlotter.py -t 10 $TPCQCVIS_DATA/2023/LHC23zzk/apass3/
    python $TPCQCVIS_DIR/TPCQCVis/tools/runPlotter.py -t 10 $TPCQCVIS_DATA/2023/LHC23zzk/apass2/
    
    Now you should have accompanying _QC.root files in your directories.
  3. Generate reports:
    python $TPCQCVIS_DIR/TPCQCVis/tools/generateReport.py  $TPCQCVIS_DATA/2023/ LHC23zzk apass3
    
    You should end up with multiple {RunNumber}.html files, one LHC23zzk_apass3.html inside the apass3 directory and one LHC23zzk_apass3_comparison.hmtl report in the outer LHC23zzk directory.
  4. Upload to the webpage:
    python $TPCQCVIS_DIR/TPCQCVis/tools/syncAndUpload.py
    
    Now you should see the latest version of your reports on alice-tpc-qc.web.cern.ch/reports/2023/LHC23zzk/.

Approach 2: QC Master

  1. Run the QC master for apass2 without reports:
    python $TPCQCVIS_DIR/TPCQCVis/tools/qc_master.py --path $TPCQCVIS_DATA/2023/ --apass apass2 --download --plot LHC23zzk
    
  2. Run the QC master for apass3 with reports:
    python $TPCQCVIS_DIR/TPCQCVis/tools/qc_master.py --path $TPCQCVIS_DATA/2023/ --apass apass3 --download --plot --report LHC23zzk
    
  3. Upload to the webpage:
    python $TPCQCVIS_DIR/TPCQCVis/tools/syncAndUpload.py
    
    Now you should see the latest version of your reports on alice-tpc-qc.web.cern.ch/reports/2023/LHC23zzk/.

About

Framework for the developing and deploying interactive visulization tools to be used by experts investigating the Quality-Control status of ALICE TPC operations.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.1%
  • Other 0.9%