Skip to content
This repository has been archived by the owner on Feb 6, 2024. It is now read-only.

ERROR: File not found GTR #234

Open
liangminliu opened this issue Jul 9, 2022 · 1 comment
Open

ERROR: File not found GTR #234

liangminliu opened this issue Jul 9, 2022 · 1 comment

Comments

@liangminliu
Copy link

Hello all,
I am running iqtree v1.6.10 to calculate a snp dataset, but it keeps reporting errors.

command
$ /ds3200_1/proc/iqtree-1.6.10-Linux/bin/iqtree -s all.LDfilter.thinned.sort.sequences.phy -nt 10 -m GTR+ASC -bb 1000 -pre all.LDfilter. thinned -bnni -redo

This dataset was obtained from the original dataset via the thin command of vcftools.

Error log:
IQ-TREE multicore version 1.6.10 for Linux 64-bit built Feb 19 2019
Developed by Bui Quang Minh, Nguyen Lam Tung, Olga Chernomor,
Heiko Schmidt, Dominik Schrempf, Michael Woodhams.

Host: node01 (AVX2, FMA3, 6047 GB RAM)
Command: /ds3200_1/proc/iqtree-1.6.10-Linux/bin/iqtree -s all.LDfilter.thinned.sort.sequences.phy -nt 10 -m GTR+ASC -bb 1000 -pre all.LDfilter.thinned -bnni -redo
Seed: 495099 (Using SPRNG - Scalable Parallel Random Number Generator)
Time: Sat Jul 9 13:54:11 2022
Kernel: AVX+FMA - 10 threads (88 CPU cores detected)

Reading alignment file all.LDfilter.thinned.sort.sequences.phy ... Phylip format detected
Alignment most likely contains protein sequences
Alignment has 470 sequences with 160684 columns, 157215 distinct patterns
160684 parsimony-informative, 0 singleton sites, 0 constant sites
Gap/Ambiguity Composition p-value
1 10110 0.00% failed 0.00%
2 2020001 0.00% failed 0.00%
...........
..........
**** TOTAL 0.00% 470 sequences failed composition chi2 test (p-value<5%; df=10)
NOTE: minimal branch length is reduced to 0.000000622339 for long alignment
NOTE: State(s) D, Q, E, H, I, L, F, P, V not present in alignment and thus removed from Markov process to prevent numerical problems
NOTE: 9 states (see above) are not present and thus removed from Markov process to prevent numerical problems

Create initial parsimony tree by phylogenetic likelihood library (PLL)... 569.444 seconds
Generating 1000 samples for ultrafast bootstrap (seed: 495099)...
ERROR: File not found GTR

Also, I tried iqtree2 running the full dataset (above 3 million) and the same error was reported.

log:
IQ-TREE multicore version 2.1.2 COVID-edition for Linux 64-bit built Mar 30 2021
Developed by Bui Quang Minh, James Barbetti, Nguyen Lam Tung,
Olga Chernomor, Heiko Schmidt, Dominik Schrempf, Michael Woodhams.

Host: node01 (AVX2, FMA3, 6047 GB RAM)
Command: iqtree2 -s sequences.phy -T 10 -m GTR+ASC --alrt 1000 -B 1000 -mem 1600G -pre out4 -bnni -redo
Seed: 444924 (Using SPRNG - Scalable Parallel Random Number Generator)
Time: Thu Jul 7 23:25:42 2022
Kernel: AVX+FMA - 10 threads (88 CPU cores detected)

Reading alignment file sequences.phy ... Phylip format detected
Alignment most likely contains protein sequences
Alignment has 470 sequences with 3309536 columns, 3196027 distinct patterns
3309349 parsimony-informative, 187 singleton sites, 0 constant sites
Gap/Ambiguity Composition p-value
1 10110 0.00% failed 0.00%
2 2020001 0.00% failed 0.00%
..........
..........
470 ldz2020037 0.00% failed 0.00%
**** TOTAL 0.00% 470 sequences failed composition chi2 test (p-value<5%; df=10)
NOTE: minimal branch length is reduced to 0.000000030216 for long alignment
NOTE: State(s) D, Q, E, H, I, L, F, P, V not present in alignment and thus removed from Markov process to prevent numerical problems
NOTE: 9 states (see above) are not present and thus removed from Markov process to prevent numerical problems

Create initial parsimony tree by phylogenetic likelihood library (PLL)... 22022.449 seconds
Generating 1000 samples for ultrafast bootstrap (seed: 444924)...
ERROR: File not found GTR

So, can you tell me how to solve this problem?
thanks all

@sanderdebacker
Copy link

sanderdebacker commented Jan 25, 2023

Your dataset contains SNP data, which is DNA.
IQ-Tree states "Alignment most likely contains protein sequences".
The autodetect doesn't recognize your input as DNA and therefore gives an error about the GTR model.
Try adding the -st DNA argument to your command.
Hope this helps!

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants