REANA example - CMS Top quark mass measurement from b-jet energy spectrum

About

This repository provides a simplified particle physics analysis example for the REANA reusable research data analysis plaftorm. The objective is to extract the top-quark mass by measuring the peak position of the energy of b-tagged jets in the laboratory frame as portrayed in the CMS Data Analysis School exercises. More information can be accessed in the CMSDAS example repository here.

Analysis structure

Making a research data analysis reproducible basically means to provide "runnable recipes" addressing (1) where is the input data, (2) what software was used to analyse the data, (3) which computing environments were used to run the software and (4) which computational workflow steps were taken to run the analysis. This will permit to instantiate the analysis on the computational cloud and run the analysis to obtain (5) output results.

1. Input data

The analysis takes the following inputs:

the list of sample CMS runs from 2015 and 2016 included in the inputs directory:
- samples_Run2015_25ns.json
- samples_Run2015_25ns_m169p5.json
- samples_Run2015_25ns_m175p5.json
- samples_Run2016_25ns.json
- samples_Run2016_25ns_m169p5.json
- samples_Run2016_25ns_m175p5.json
- samples_Run2016_m169p5.json
- samples_Run2016_m175p5.json

2. Analysis code

The analysis will consist of three stages. In the first stage, we shall process the original and simulated collision data (using analyzeNplot.py) to select top-pair events that decay in the eμ channel, compare the selection (control distributions and event yields), and propagate the sources of systematic uncertainties to the b jet energy peak;. In the second stage, we shall fit the b jet energy peak and calibrate the peak measured for the set of selection criteria previously defined to the expected b jet energy peak, from which the top-quark mass can be easily extracted. In the third stage, we shall compare the results with the standard top-quark mass measurements performed with 8 TeV data.

3. Compute environment

In order to be able to rerun the analysis even several years in the future, we need to "encapsulate the current compute environment", for example to freeze the software package versions our analysis is using. We shall achieve this by preparing a Docker container image for our analysis steps.

This analysis example runs within the CMSSW analysis framework that was packaged for Docker in clelange/cmssw.

4. Analysis workflow

The analysis workflow is simple and consists of the above-mentioned stages:

We shall use the CWL workflow specification to express the computational workflow:

workflow definition

and its individual steps:

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
code		code
data		data
workflow		workflow
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REANA example - CMS Top quark mass measurement from b-jet energy spectrum

About

Analysis structure

1. Input data

2. Analysis code

3. Compute environment

4. Analysis workflow

About

Releases

Packages

Languages

diyaselis/reana-demo-cms-topmass

Folders and files

Latest commit

History

Repository files navigation

REANA example - CMS Top quark mass measurement from b-jet energy spectrum

About

Analysis structure

1. Input data

2. Analysis code

3. Compute environment

4. Analysis workflow

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages