-
Notifications
You must be signed in to change notification settings - Fork 35
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
when calculate descriptors from smiles, an error occured. #9
Comments
Hi @kexul, Sorry for the late reply! Could you provide additional information about the command you are executing? A code snippet will help me diagnose the issue further. |
I processed a big csv file which contains about 700000 smiles, you may try it here,
|
PaDEL-Descriptor sometimes times out when calculating descriptors for smaller compounds (I'm not entirely sure why, one would think their calculations would be very quick): descriptors = from_smiles('CCC') PaDELPy's "from_smiles" function tries three times to calculate the descriptors for a given compound, and if a RuntimeError is encountered all three times, the error you saw is thrown:
By default, if the generation process exceeds 12 seconds, this is seen as a failure. You can try increasing the timeout: # increase timeout to 30 seconds
descriptors = from_smiles('CCC', timeout=30) If this doesn't help, I recommend you catch the exception and perform an action to account for it: try:
descriptors = from_smiles('CCC')
except RuntimeError:
# Do something Let me know if increasing the timeout value helps! If it does, it may justify increasing the default value for the "from_smiles" and "from_mdl" functions. Best, |
I'm going to go ahead and close this issue due to inactivity. @kexul - keep me updated as to whether any of the methods I outlined above work for you! |
Hey, I have tried increasing the time out to 30 but still facing the same error
Edit:
After 5 molecule it throws the above timeout error. System specifications: I think I have found the answer, |
Hello, |
Each descriptor gives a numerical representation of some physical, chemical, or electromechanical aspect of a given compound. For example, "nN" is the number of nitrogen atoms present in the compound, "nC" is the number of carbon atoms present, etc. Some of the descriptors are somewhat ambiguous - the ATS descriptors are a measurement of autocorrelation between neighboring atoms with respect to a certain weighting, such as mass and charge. More detailed descriptions for each descriptor can be found in a spreadsheet at http://www.yapcwsoft.com/dd/padeldescriptor/ by clicking the "1875" link towards the top of the page. Best, |
Please can you specify till how much should one increase the timeout? I am getting same error for 172 smiles, and timeout used was 60. |
Hello, guys. I had the same problem and I was not able to fully solve it, but here are my two attempts. First, increasing the number of chunks and decreasing the length of the list of compounds to calculate the descriptors for. I used Second, I used the VERY BAD strategy of
That strategy did not work, even after got to One other suggestion: it would be very nice of the developers if they could share a link to a CSV with some descriptors. I would guess that they have probably tested the library for a bunch of compounds and that they have some files with a lot of descriptors. If this is not the case or sharing such a file will not be possible, forget about it. Last but not least, thank you very much for the library :) |
The error log is
PaDEL-Descriptor encountered an error: PaDEL-Descriptor timed out during subprocess call
By the way, thanks for your great job!
The text was updated successfully, but these errors were encountered: