-
Notifications
You must be signed in to change notification settings - Fork 7
Launching MSFragger
Windows users:
Ensure that you have placed MSFragger.jar in your working directory and have modified the parameter file to reference your protein database.
Since 2.0, MSFragger supports reading both Thermo RAW
file and Bruker .d
folder directly. Download the zip
file from https://msfragger.arsci.com/upgrader/ and put the ext
folder in the same folder as the JAR file.
Determine the amount of system memory available that you would like to make available to MSFragger. This will be specified by the Java maximum heap size parameter -Xmx (e.g. -Xmx3700m for 3700 MB or -Xmx32g for 32 GB)
.
To perform a search either use a parameter file:
-
java -Xmx32g -jar MSFragger.jar <parameter file> <list of mzML/mzXML files>
. Can get three parameter file templates by runningjava -jar MSFragger.jar --config
.
Or specify options on the command line:
java -Xmx32g -jar MSFragger.jar <options> <list of mzML/mzXML files>
The -Xmx flag is very important to ensure that MSFragger has access to sufficient memory to perform the search efficiently. We recommend that you allocate a minimum of 8 GB for standard tryptic digestions.
-Xmx32g
specifies the maximum memory assigned to the Java virtual machine. In this example, the maximum value is 32 GB. This should be changed to suit your computer configuration.
Detailed command line options can be displayed with:
java -jar MSFragger.jar --help
Linux users:
A FragPipe-equivalent shell script can be found here.
Batch processing
MSFragger allows multiple MS/MS input files to be processed in a batch. Passing multiple files to MSFragger at once allows MSFragger to reuse the fragment index for subsequent MS/MS run. This is particularly important for narrow window searches which may only take fractions of a second.
On computers or clusters with many processor cores, we highly recommended setting MSFragger to process files sequentially with all available processor cores rather than running multiple instances of MSFragger in parallel (assigning a smaller number of cores to each). This reduces initialization times and allows the fragment index to be re-used, at the same time reducing overall memory requirements.