-
Notifications
You must be signed in to change notification settings - Fork 4.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Rasa 1.10.14 Memory Leakage #7690
Comments
👋🏻 hey @OmarFarag95 thanks a lot for your bug report. Would it be a possibility for you to upgrade to:
That would help us diagnose the issue and make it easier to come up with a fix (especially if you upgrade to 2.2.4). LMK if it's not on the table 😄 |
Hi @m-vdb , Thank you for your reply. I have tried to upgrade to 2.2.4. However, the bug is still persisting. |
OK, (sadly) good to know. I'll circle back with the team and find out the best way forward. Sorry about the inconvenience 😓 |
Never mind! I will be looking forward to your response 😸 |
I just snooped around in the code for the crf entity extractor and had a quick look at the differences between v 1.3.5 and v 1.10.14. Although it is not a change, in terms of memory generally a thing that caught my eye is all_possible_transitions=True in the crf function call. The doc for this feature reads:
One possible idea is, that something minor changed in |
@OmarFarag95 would it be possible for you to provide a few more stats on the dataset you are using for context? I know you are using a sizable dataset of 120k sentences. How about (roughly)
|
Also, I am assuming that the system already crashes during the first iteration, is that right @OmarFarag95 ? |
@twerkmeister Thanks for your response.
|
Hey @OmarFarag95, much appreciated! This will make tracking down the problem easier 👍 |
Rasa version: 1.10.14
Rasa SDK version (if used & relevant):
Rasa X version (if used & relevant):
Python version: 3.6.9
Operating system (windows, osx, ...): linux
Issue:
I was training
CRF
model on about 120K sentences for entity recognition onRasa 1.3.5
and things were going just fine. However, upon upgrading toRasa 1.10.14
; I am always getting a memory leakage (whereasa RAM tends to increase significantly until the process is killed).I am using
Google Colab
with available RAM of 25GBOne important note, is that training
DIET classifier
on the same dataset size andRasa 1.10.14
works just fine. so the issue is within theCRF
itself.Error (including full traceback):
Command or request that led to error:
Content of configuration file (config.yml) (if relevant):
Content of domain file (domain.yml) (if relevant):
Definition of Done
main
state (consider using Vincent'sspacy-3.0
branch)The text was updated successfully, but these errors were encountered: