MoLFI (Multi-objective Log message Format Identification)

MoLFI is a tool implementing a search-based approach to solve the problem of log message format identification. More details on this approach is available in this paper:

Salma Messaoudi, Annibale Panichella, Domenico Bianculli, Lionel Briand, and Raimondas Sasnauskas. A Search-based Approach for Accurate Identification of Log Message Formats. In Proceedings of ICPC ’18: 26th IEEE/ACM International Conference on Program Comprehension (ICPC ’18). Available online at http://hdl.handle.net/10993/35286

A log message template is a two parts message; one fixed and one variable.

 For example: "File config.xml sent at 191.168.1.3"

 "File", "sent" and "at" are the fixed part because the represent an event of sending a file.

 "config.xml" and "191.168.1.3" are the variable part because they change with the occurrence of the sending event.

MoLFI uses an evolutionary approach based on NSGA-II to solve this problem.

MoLFI applies the following steps:

Pre-processing the log file (detect trivial variable parts using domain knowledge).
Run NSGA-II algorithm.
Post-processing: apply corrections to the resulting solutions.

MoLFI is implemented as a python project (v3.6+).

This package contains the source code of the tool with two executable scripts.

MoLFI.py: the script used to run MoLFI. This script can be used from the command line.
validation.py: this script is used to validate the generated templates. It will apply a comparison between the generated templates and the correct templates (the oracle files)

To re-run the experiments, the MoLFI and the datasets used for the evaluation ICPC-2018-Artifacts should be under the same directory

-> The ICPC-2018-Artifacts repository is accessible from the following link: https://github.com/SalmaMessaoudi/ICPC-2018-Artifacts.git

-> go under the folder ICPC-2018-Artifacts and run make.

Under the same folder (ICPC-2018-Artifacts), a new folder will be created (Experiments_Results) with a sub-folder called "Metrics" containing the validation scores for each dataset and another sub-folder "Validation" with the generated templates being compared with the oracle.

To run MoLFI on a single log file:

from the command line, run the MoLFI.py script and precise the following arguments:

-l : specify the log file
-f : specify the log format (e.g., "<timestamp> <level> <message>")
-p : specify where to save the generated templates.
-r : provide the regular expressions if any (one after the other, separated by a normal space)

Example:

python3 MoLFI.py -l ../MoLFI_experiments/Datasets/BGL/2K/BGL_2K_log_messages.txt -f "<message>" -p templates.pkl -r "core\.[0-9]*" "0x([a-zA-Z]|[0-9])+"

Licensing

Requirements

As listed in requirements.txt, MoLFI needs to install the following python libraries:

deap (licensed under LPGL)
numpy (licensed under BSD)
pandas (licensed under BSD)

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
main		main
test		test
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
Makefile		Makefile
MoLFI.py		MoLFI.py
Notice.md		Notice.md
README.md		README.md
definitions.py		definitions.py
requirements.txt		requirements.txt
setup.py		setup.py
validation.py		validation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MoLFI (Multi-objective Log message Format Identification)

Licensing

Requirements

About

Releases

Packages

Languages

License

SNTSVV/MoLFI

Folders and files

Latest commit

History

Repository files navigation

MoLFI (Multi-objective Log message Format Identification)

Licensing

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages