Capture memory usage #2330

jkyang92 · 2021-11-24T17:01:12Z

capture seems to have a slow memory leak. Running the following loop on my system increases M2's memory usage by ~0.5MB/s seemingly indefinitely. (I ran it till M2 was using 300MB).

while true do (scan(apply(100,i-> "x = 1;y = 2;"),capture);collectGarbage())

The following loop seems to leak much faster (1GB within 15min), but it's harder to figure out what's leaking:

needsPackage "SpecialFanoFourfolds"
while true do (scan(values tests SpecialFanoFourfolds,capture @@ code);collectGarbage())

In contrast the following loop does not cause an increase in memory usage:

while true do (scan(apply(100,i-> "x = 1;y = 2;"),v -> (x = 1;y = 2;));collectGarbage())

Valgrind does not seem to detect a leak.

The text was updated successfully, but these errors were encountered:

mahrud · 2021-11-24T21:28:16Z

Not sure if you saw #1689, but I think this is a great feature of capture:

use capture for memory leak detection and memory benchmarking!

In other words, I think the first example you have is evidence that there's a leak, but in M2's interpreter.

Also see #1728 (comment), where this was first noticed.

jkyang92 · 2021-11-27T16:42:56Z

Okay, after messing the code, it seems we are leaking DictionaryList objects (and consequently dictionaries and all the data they contain). It seems that we keep a list of all dictionaries in expr.d? (allDictionaries variable and the record function) Can a dictionary ever get collected?

DanGrayson · 2021-11-28T20:49:51Z

Yes, dictionaries don't go away -- that's so the dictionary can be recovered from the frame, which contains an integer called the frame ID. An alternative (and perhaps better) way to do it would be for each frame to contain a pointer to its dictionary.

mahrud · 2021-11-29T00:05:50Z

Oh, I had assumed that when we call collegeGarbage in capture, discarded dictionaries get collected! But in reality nearly nothing that is assigned to a symbol is ever collected?! That seems like an oversight that should be fixed.

It would be great to also document how frames work along the way.

DanGrayson · 2021-11-29T14:16:35Z

The dictionary caching has nothing to do with being assigned to a symbol -- they are cached so the dictionary can be recovered from the frame id, and that could be done away with as I sketched above. (The dictionaries cached contain just symbols, not "symbol closures". It's the symbol closures that point to frames containing symbol values.)

The number of dictionaries is limited -- once you have loaded all the code, no more dictionaries are made. Thus the non-collection of dictionaries is not a memory leak problem.

mahrud · 2021-11-29T14:42:58Z

The dictionary caching has nothing to do with being assigned to a symbol -- they are cached so the dictionary can be recovered from the frame id, and that could be done away with as I sketched above. (The dictionaries cached contain just symbols, not "symbol closures". It's the symbol closures that point to frames containing symbol values.)

Could you please explain or better yet the document this system somewhere on the wiki? Even starting with some common terminology would be good.

The number of dictionaries is limited -- once you have loaded all the code, no more dictionaries are made. Thus the non-collection of dictionaries is not a memory leak problem.

capture makes new dictionaries every time. If you run hundreds of tests, that's the equivalent of hundreds of new dictionaries.

jkyang92 · 2021-11-29T14:47:24Z

Incidentally value(String) also creates new dictionaries.

DanGrayson · 2021-11-29T22:29:52Z

Could you please explain or better yet the document this system somewhere on the wiki? Even starting with some common terminology would be good.

Recall from LISP the notion of function closure, or simply closure. When one returns a function as a value, that function closure remembers the values of the variables at the time it was created. Here is an example in M2:

i1 : f = x -> () -> x

o1 = f

o1 : FunctionClosure

i2 : g = f 33

o2 = g

o2 : FunctionClosure

i3 : g()

o3 = 33

i4 : h = f 44

o4 = h

o4 : FunctionClosure

i5 : h()

o5 = 44

The code () ->x is referred to as a function body. The lexical scope of the function body has associated with it a dictionary, whose single symbol is x, but in general it will contain all the new variables appearing in the body of the function. When the function closures g and h are created by calling f, the function body is paired with a "frame", which is an array of expressions providing the values of the variables in the lexical scope of the body of the function. So g has a frame containing 33 and h has a frame containing 44. The symbol x contains information that specifies where in the frame its value is stored, and when g and h are executed, they look into the frame at that spot to retrieve the value.

Similarly, M2 has "symbol closures". Here is an example:

i1 : f = x -> () -> symbol x

o1 = f

o1 : FunctionClosure

i2 : g = f 33

o2 = g

o2 : FunctionClosure

i3 : g()

o3 = x

o3 : Symbol

i4 : value oo

o4 = 33

i5 : h = f 44

o5 = h

o5 : FunctionClosure

i6 : h()

o6 = x

o6 : Symbol

i7 : value oo

o7 = 44

All the symbols appearing at top level are actually symbol closures in this sense. They contain a symbol and a pointer to the appropriate frame where the value is stored.

The number of dictionaries is limited -- once you have loaded all the code, no more dictionaries are made. Thus the non-collection of dictionaries is not a memory leak problem.

capture makes new dictionaries every time. If you run hundreds of tests, that's the equivalent of hundreds of new dictionaries.

Ah, thanks for the reminder. So maybe it is time to implement the fix I proposed above. (On the other hand, dictionaries aren't very large, so it may not be urgent.)

jkyang92 · 2021-12-01T01:46:47Z

Not related to the Dictionary issue, but the following loop also leaks, independent of the previous issue. (I made record in expr.d a no-op, most code doesn't need it to actually work).

testCode = ///
    R = QQ[x,y,z];
    x+1;
    assert ( 1 == 1 )
///
scan(1000, i-> (capture(testCode,UserMode=>false);collectGarbage()));

I can test it later but I'm fairly certain it's leaking the polynomial ring on every cycle, because if I replace the first line of the test code with nearly anything else, we don't leak anymore.

I think this causes most tests to leak, since most tests create polynomial rings.

mahrud · 2021-12-01T01:50:50Z

Just to make sure, you're disabling Usermode, right?

jkyang92 · 2021-12-01T01:55:16Z

Yep, it's right there in the UserMode=>false. Also, it actually doesn't seem to leak with UserMode=>true.

jkyang92 · 2021-12-01T02:18:05Z

I think I figured this out. The problem is how globalFrame in expr.d interacts with calls to new Dictionary. When you call new Dictionary in M2 you are actually creating a new DictionaryClosure in the D code. This closure uses globalFrame, and all values get stored there. The problem is after the dictionary for them goes out of scope, these values can no longer be removed and get collected. And of course the globalFrame itself will never be collected.

mahrud · 2023-03-13T18:54:48Z

Referencing a comment here:

I've ran some checks, including https://github.com/Macaulay2/M2/blob/048c777dc703365e44c3865b2c1faed698b82b3a/bugs/anton/MEMORY-LEAKS/NAG-leaks.m2
Everything seems to be fine.

BTW, if anyone has ideas on how to improve the memory leak testing (from the frontend) above, please let me know.

Originally posted by @antonleykin in #2770 (comment)

I wonder if after fixing the problem above, we could use capture to detect memory leaks from the frontend.

jkyang92 mentioned this issue Nov 28, 2021

check_254 "Core" failing during "M2 --check 2" on i386 #1834

Open

jkyang92 mentioned this issue Dec 2, 2021

Null out values in the dictionaries created by capture before returning #2344

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Capture memory usage #2330

Capture memory usage #2330

jkyang92 commented Nov 24, 2021

mahrud commented Nov 24, 2021

jkyang92 commented Nov 27, 2021

DanGrayson commented Nov 28, 2021

mahrud commented Nov 29, 2021 •

edited

Loading

DanGrayson commented Nov 29, 2021

mahrud commented Nov 29, 2021

jkyang92 commented Nov 29, 2021

DanGrayson commented Nov 29, 2021

jkyang92 commented Dec 1, 2021

mahrud commented Dec 1, 2021

jkyang92 commented Dec 1, 2021

jkyang92 commented Dec 1, 2021

mahrud commented Mar 13, 2023

Capture memory usage #2330

Capture memory usage #2330

Comments

jkyang92 commented Nov 24, 2021

mahrud commented Nov 24, 2021

jkyang92 commented Nov 27, 2021

DanGrayson commented Nov 28, 2021

mahrud commented Nov 29, 2021 • edited Loading

DanGrayson commented Nov 29, 2021

mahrud commented Nov 29, 2021

jkyang92 commented Nov 29, 2021

DanGrayson commented Nov 29, 2021

jkyang92 commented Dec 1, 2021

mahrud commented Dec 1, 2021

jkyang92 commented Dec 1, 2021

jkyang92 commented Dec 1, 2021

mahrud commented Mar 13, 2023

mahrud commented Nov 29, 2021 •

edited

Loading