Add script to run gain selection over a list of dates, using the "lst_select_gain" script #141

marialainez · 2022-03-12T03:38:48Z

No description provided.

morcuended · 2022-03-14T01:15:03Z

Thanks @marialainez. I have been trying to make it work with sbatch. All attempts of using --wrap sbatch option for exporting the PATH did not work. The only way it is working for me is creating job bash files that are executed with sbatch

using for example this function:

import dedent

PATH = "PATH=/fefs/aswg/software/gain_selection/bin:$PATH"

def get_sbatch_script(
        run_id,
        input_file,
        output_dir,
        log_dir,
        ref_time,
        ref_counter,
        module,
        ref_source
):
    return dedent(f"""\
    #!/bin/bash
    
    #SBATCH -D {log_dir}
    #SBATCH -o "gain_selection_{run_id:05d}_%j.log"
    #SBATCH --job-name "gain_selection_{run_id:05d}"
    #SBATCH --export {PATH} 
    
    lst_select_gain {input_file} {output_dir} {ref_time} {ref_counter} {module} {ref_source}
    """)

for file in input_files:
    run_info = run_info_from_filename(file)
    job_file = f"gain_selection_{run_info.run:05d}.{run_info.subrun:04d}.sh"
    with open(job_file, "w") as f:
        f.write(get_sbatch_script(
            run_id,
            file,
            output_dir,
            log_dir,
            ref_time,
            ref_counter,
            module,
            ref_source
        ))
    sp.run(["sbatch", job_file], check=True)

morcuended · 2022-03-14T01:16:18Z

osa/scripts/gain_selection.py

+
+def apply_gain_selection(date: str):
+    run_summary_file = "/fefs/aswg/data/real/monitoring/RunSummary/RunSummary_"+date+".ecsv"
+    data = ascii.read(run_summary_file)


you can directly use astropy Tables -> data = Tables.read(file)

morcuended · 2022-03-14T01:16:39Z

osa/scripts/gain_selection.py

+    run_summary_file = "/fefs/aswg/data/real/monitoring/RunSummary/RunSummary_"+date+".ecsv"
+    data = ascii.read(run_summary_file)
+    data.add_index("run_id")
+    data = data[(data['run_type']=='DATA')]   # apply gain selection only to DATA runs


remove redundant parenthesis

morcuended · 2022-03-14T01:17:33Z

osa/scripts/gain_selection.py

+    data.add_index("run_id")
+    data = data[(data['run_type']=='DATA')]   # apply gain selection only to DATA runs
+
+    output_dir = "/fefs/aswg/data/real/R0/gain_selected/"+date


you can use f-strings: f"/fefs/aswg/data/real/R0/gain_selected/{date}"

morcuended · 2022-03-14T01:21:02Z

osa/scripts/gain_selection.py

+    for run in data["run_id"]:
+
+        ref_time = data.loc[run]["dragon_reference_time"]
+        ref_counter = data.loc[run]["dragon_reference_counter"]
+        module = data.loc[run]["dragon_reference_module_index"]
+        ref_source = data.loc[run]["dragon_reference_source"]


you can directly loop over the whole row and extract the values afterward:

for run in data: run_id = run["run_id"] ref_time = run["dragon_reference_time"] ref_counter = run["dragon_reference_counter"] module = run["dragon_reference_module_index"] ref_source = run["dragon_reference_source"].upper()

and then I think you would not need to use data.add_index("run_id") since the table would be already indexed.

codecov · 2022-03-14T18:16:35Z

Codecov Report

Merging #141 (4e7e39e) into main (ffc7e25) will increase coverage by 0.13%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main     #141      +/-   ##
==========================================
+ Coverage   81.74%   81.88%   +0.13%     
==========================================
  Files          51       51              
  Lines        4838     4995     +157     
==========================================
+ Hits         3955     4090     +135     
- Misses        883      905      +22

Impacted Files	Coverage Δ
osa/utils/tests/test_utils.py	`98.38% <0.00%> (+1.61%)`	⬆️
osa/utils/utils.py	`81.93% <0.00%> (+1.93%)`	⬆️
osa/scripts/copy_datacheck.py	`32.91% <0.00%> (+2.00%)`	⬆️
osa/scripts/autocloser.py	`67.11% <0.00%> (+8.39%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ffc7e25...4e7e39e. Read the comment docs.

…stosa into gain_selection

morcuended · 2022-03-29T18:27:22Z

osa/scripts/gain_selection.py

+
+    log.info("Done! No more dates to process.")
+
+    check_failed_jobs(output_basedir)


@marialainez, in the current way of checking failed jobs, you need to launch the script again, which will submit jobs again, right? Either we implement a simulate option that allows only for checking job status or the check is done in a separate script independent from the launching-job script.

add script to run gain selection over a list of dates

80bc157

morcuended requested changes Mar 14, 2022

View reviewed changes

using sbatch in gain selection script and minor adjustments

17b5b33

marialainez and others added 4 commits March 29, 2022 16:33

Merge branch 'gain_selection' of https://github.com/cta-observatory/l…

ad61d4b

…stosa into gain_selection

check failed jobs + add subrun number to log files name

497f8a2

check failed jobs + add subrun number to log files name

9e28b32

add missing imports

c3467f7

morcuended reviewed Mar 29, 2022

View reviewed changes

morcuended added 4 commits March 29, 2022 12:09

address codacy complaints

d6fc7eb

remove trailing whitespace

87a9384

add entrypoint for gain selection script

d13a274

add option check to avoid launching jobs when checking for failed jobs

4e7e39e

morcuended merged commit 5bb7118 into main Mar 30, 2022

morcuended deleted the gain_selection branch March 30, 2022 15:10

morcuended mentioned this pull request Apr 4, 2022

Implement a script to run gain selection program over a set of runs #135

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add script to run gain selection over a list of dates, using the "lst_select_gain" script #141

Add script to run gain selection over a list of dates, using the "lst_select_gain" script #141

marialainez commented Mar 12, 2022

morcuended commented Mar 14, 2022

morcuended Mar 14, 2022

morcuended Mar 14, 2022

morcuended Mar 14, 2022

morcuended Mar 14, 2022

codecov bot commented Mar 14, 2022 •

edited

Loading

morcuended Mar 29, 2022 •

edited

Loading


		log.info("Done! No more dates to process.")

		check_failed_jobs(output_basedir)

Add script to run gain selection over a list of dates, using the "lst_select_gain" script #141

Add script to run gain selection over a list of dates, using the "lst_select_gain" script #141

Conversation

marialainez commented Mar 12, 2022

morcuended commented Mar 14, 2022

morcuended Mar 14, 2022

Choose a reason for hiding this comment

morcuended Mar 14, 2022

Choose a reason for hiding this comment

morcuended Mar 14, 2022

Choose a reason for hiding this comment

morcuended Mar 14, 2022

Choose a reason for hiding this comment

codecov bot commented Mar 14, 2022 • edited Loading

Codecov Report

morcuended Mar 29, 2022 • edited Loading

Choose a reason for hiding this comment

codecov bot commented Mar 14, 2022 •

edited

Loading

morcuended Mar 29, 2022 •

edited

Loading