-
Notifications
You must be signed in to change notification settings - Fork 282
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error iterating files in directories #286
Comments
Based on the error message it looks like a memory allocation error. Does this happen consistently for one file, or only after running for a while? |
The error occurs when i iterate over files in a directory for a few seconds (approx. 10-15 seconds) and appears to happen at the same file. I've also run my script over the same directory using two different computers, one with 16GB or RAM and the other with 32GB of RAM. If it's a RAM issue is there a way to clear the RAM of already scanned files so all the data is not stored in the RAM? |
Are you creating a new instance of magic.Magic() for each file, or creating one and re-using it? |
I'm using in a for loop, so the way I have it setup is that I created one instance of it and then re-using it by feeding a file path via a variable. |
Can you share the file that seems to trigger this? Which version of libmagic are you using? |
Here is the snippet of code that is causing the issue. I'm using version: 0.4.14; I'm using Python 3.9. def file_exists():
|
Are you able to share the input file that triggers it? What version of libmagic are you using? |
It appears that I did not have python lib-magic installed . . . I tried to install it but the install is a bit problematic. Is the a trick to it? |
If you were running into this error it looks like you do have libmagic installed, that's what produces the error. |
Also this issue comes up only when running the script via PowerShell/Cmd. I ran the same code in my WSL and it seems to work completely fine and does not throw the error. |
Same error as #276 which was merged into #293. These input files trigger the issue:
Repro in Windows 10 Pro Sandbox:
error details
See also:
|
Thanks for the repo; this is definitely due to the older version of libmagic shipped with python-magic-bin. Just another case where the binaries situation causes trouble. |
When I iterate over files to get their MIME type the magic library is able to provide the MIME type of some of the files and then it will through an error:
File "C:\Users\Chad\AppData\Local\Programs\Python\Python310\lib\site-packages\magic\magic.py", line 89, in from_file
return maybe_decode(magic_file(self.cookie, filename))
File "C:\Users\Chad\AppData\Local\Programs\Python\Python310\lib\site-packages\magic\magic.py", line 255, in magic_file
return magic_file(cookie, coerce_filename(filename))
File "C:\Users\Chad\AppData\Local\Programs\Python\Python310\lib\site-packages\magic\magic.py", line 196, in errorcheck_null
raise MagicException(err)
magic.magic.MagicException: b"line I64u: regex error 14 for `^[[:space:]]*class[[:space:]]+[[:digit:][:alpha:]:]+[[:space:]]\{(.[\n])\}(;)?$', (failed to get memory)"
PS C:\Users\Chad\Documents\GitHub\Python_practice_scripts>
The text was updated successfully, but these errors were encountered: