An Efficient Lorentz Equivariant Graph Neural Network for Jet Tagging

Shiqi Gong, Qi Meng, Jue Zhang, Huilin Qu, Congqiao Li, Sitian Qian, Weitao Du, Zhi-Ming Ma, Tie-Yan Liu

https://link.springer.com/article/10.1007/JHEP07(2022)030

Abstract: Deep learning methods have been increasingly adopted to study jets in particle physics. Since symmetry-preserving behavior has been shown to be an important factor for improving the performance of deep learning in many applications, Lorentz group equivariance - a fundamental spacetime symmetry for elementary particles - has recently been incorporated into a deep learning model for jet tagging. However, the design is computationally costly due to the analytic construction of high-order tensors. In this article, we introduce LorentzNet, a new symmetry-preserving deep learning model for jet tagging. The message passing of LorentzNet relies on an efficient Minkowski dot product attention. Experiments on two representative jet tagging benchmarks show that LorentzNet achieves the best tagging performance and improves significantly over existing state-of-the-art algorithms. The preservation of Lorentz symmetry also greatly improves the efficiency and generalization power of the model, allowing LorentzNet to reach highly competitive performance when trained on only a few thousand jets.

(left): The structure of the Lorentz Group Equivariant Block (LGEB). (right): The network architecture of the LorentzNet.

Requirements

Python>=3.7, PyTorch>=1.9, and CUDA toolkit>=10.2.
Use the following command to install required packages.
- pip install -r requirements.txt
We use 4 GPUs for training and evaluation. However, it is acceptable to use fewer or more GPUs to train or test the model, simply changing the arg --nproc_per_node=4 to the number of GPUs available.

Top Tagging

Data

Top tagging dataset is an open benchmark dataset firstly released in [1], which can be downloaded from zenodo. To extract 4-vectors and labels from the raw data, we follow the data converting process from LorentzGroupNetwork [2].

For convenience, the converted top tagging dataset can be downloaded from this link. The default data path is ./data/top/; therefore, it is recommended to save the converted dataset to this folder. Otherwise, the arg --datadir /path/to/data/ can be assigned to specify the data path.

Training

Parallelly training the LorentzNet on 4 GPUs:

python -m torch.distributed.launch --nproc_per_node=4 top_tagging.py \
        --batch_size=32 --epochs=35 --warmup_epochs=5 \
        --n_layers=6 --n_hidden=72 --lr=0.001 \
        --c_weight=0.005 --dropout=0.2 --weight_decay=0.01 \
        --exp_name=CustomName

One can assign --exp_name to identity different runs. Model with the best validation accuracy will be saved in log directory as best-val-model.pt.

Evaluation

Given --exp_name, use --test_mode to evalute the best model on the test dataset.

python -m torch.distributed.launch --nproc_per_node=4 top_tagging.py \
        --test_mode --exp_name=CustomName --batch_size=100 \
        --n_layers=6 --n_hidden=72 --c_weight=0.005

Pre-trained Model

Pre-trained model are offered in ./logs/top/pretrained/. To test the pre-trained model, use:

python -m torch.distributed.launch --nproc_per_node=4 top_tagging.py \
        --test_mode --exp_name=pretrained --batch_size=100 \
        --n_layers=6 --n_hidden=72 --c_weight=0.005

Quark-gluon Tagging

Data

Quark-gluon tagging dataset is originally used in [3], we can also download the dataset from zenodo. However, we do not need to download it manually. We use the package EnergyFlow that can automatically download the dataset when needed.

Therefore, data is automatically downloaded to ./data/qg/ by default when we firstly run qg_tagging.py. The arg --datadir /path/to/data/ can be assigned to specify the data path.

Training

Parallelly training the LorentzNet on 4 GPUs:

python -m torch.distributed.launch --nproc_per_node=4 qg_tagging.py \
        --batch_size=32 --epochs=35  --warmup_epochs=5 \
        --n_layers=6 --n_hidden=72 --lr=0.001 \
        --c_weight=0.001 --dropout=0.2 --weight_decay=0.01
        --exp_name=CustomName

One can assign --exp_name to identity different runs. Model with the best validation accuracy will be saved in log directory as best-val-model.pt.

Evaluation

Given a specific --exp_name, use --test_mode to evalute the best model on the test dataset.

python -m torch.distributed.launch --nproc_per_node=4 qg_tagging.py \
        --test_mode --exp_name=CustomName --batch_size=100 \
        --n_layers=6 --n_hidden=72 --c_weight=0.001

Pre-trained Model

Pre-trained model are offered in ./logs/qg/pretrained/. To test the pre-trained model, use:

python -m torch.distributed.launch --nproc_per_node=4 qg_tagging.py \
        --test_mode --exp_name=pretrained --batch_size=100 \
        --n_layers=6 --n_hidden=72 --c_weight=0.001

ROC Curves


ROC Curves for Top Tagging	ROC Curves for Quark-gluon Tagging

Scripts and tagging scores for different models are saved in ./scripts/TopTaggingROC/ and ./scripts/QGTaggingROC/.

Equivariance Test

Equivariance test under Lorentz boosts on top tagging dataset.

Scripts and tagging accuracies under Lorentz transformations with different $\beta$ are saved in ./scripts/EquivarianceTest.

Citation

If you find this work helpful, please cite our paper:

@article{gong2022efficient,
  author={Gong, Shiqi and Meng, Qi and Zhang, Jue and Qu, Huilin and Li, Congqiao and Qian, Sitian and Du, Weitao and Ma, Zhi-Ming and Liu, Tie-Yan},
  title={An efficient Lorentz equivariant graph neural network for jet tagging},
  journal={Journal of High Energy Physics},
  year={2022},
  month={Jul},
  day={05},
  volume={2022},
  number={7},
  pages={30},
  issn={1029-8479},
  doi={10.1007/JHEP07(2022)030},
  url={https://doi.org/10.1007/JHEP07(2022)030}
}

References

[1] Kasieczka, Gregor, et al. "The Machine Learning landscape of top taggers." SciPost Physics 7.1 (2019): 014.

[2] Bogatskiy, Alexander, et al. "Lorentz group equivariant neural network for particle physics." International Conference on Machine Learning. PMLR, 2020.

[3] Komiske, Patrick T., Eric M. Metodiev, and Jesse Thaler. "Energy flow networks: deep sets for particle jets." Journal of High Energy Physics 2019.1 (2019): 1-46.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

An Efficient Lorentz Equivariant Graph Neural Network for Jet Tagging

Table of Contents

Requirements

Top Tagging

Data

Training

Evaluation

Pre-trained Model

Quark-gluon Tagging

Data

Training

Evaluation

Pre-trained Model

ROC Curves

Equivariance Test

Citation

References

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
data		data
figures		figures
logs		logs
qg		qg
scripts		scripts
top		top
LICENSE		LICENSE
README.md		README.md
models.py		models.py
qg_tagging.py		qg_tagging.py
requirements.txt		requirements.txt
top_tagging.py		top_tagging.py
utils.py		utils.py

License

sdogsq/LorentzNet-release

Folders and files

Latest commit

History

Repository files navigation

An Efficient Lorentz Equivariant Graph Neural Network for Jet Tagging

Table of Contents

Requirements

Top Tagging

Data

Training

Evaluation

Pre-trained Model

Quark-gluon Tagging

Data

Training

Evaluation

Pre-trained Model

ROC Curves

Equivariance Test

Citation

References

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages