doc: document how to examine `--trace_gc` output #372

HarshithaKP · 2020-04-01T16:16:34Z

No description provided.

sam-github · 2020-04-01T16:36:05Z

Slightly off topic, perhaps, but is all this information eventually going to end up in https://nodejs.org/en/docs/guides/ ? It'd be a bit more findable there.

gireeshpunathil · 2020-04-01T17:35:56Z

@sam-github - yes, according to #211 (comment) and the thread beneath it.

mhdawson · 2020-04-01T21:08:39Z

documentation/memory/step3/using_gc_traces.md

+
+```
+[PID: isolate] < time taken since GC started in ms> : < type/phase of GC > <heap used before GC call in MB> ( < allocated heap before GC call in MB > ) -> < heap used after GC in MB> ( < allocated heap after GC in MB>) <time spent in GC in ms> [ < reason for GC >]


This is hard to read. A graphic where we point to the sections and say what they are might work better.

If we have examples of how to use the traces to identify problems that would also be good. For example in terms of running out of memory. If you first see that that the old space size is continually increasing but it takes too long to hit the max, setting the max old space size to a smaller value can help.

Another one is that if you see that the time spent in GC is continually increasing or is a large portion of the overall that can mean you are short on memory even if you don't OOM.

@mhdawson, Modified according to your suggestion. PTAL.

documentation/memory/step3/using_gc_traces.md

mhdawson · 2020-04-02T13:57:38Z

documentation/memory/step3/using_gc_traces.md

+## Examples of diagnosing memory issues with trace option:
+
+A. How to get context of bad allocations using --trace-gc
+  1. Suppose we observe that the old space is ocntinously increasing.


Suggested change

1. Suppose we observe that the old space is ocntinously increasing.

1. Suppose we observe that the old space is continously increasing.

@mhdawson, thanks. Fixed it.

mhdawson · 2020-04-02T13:58:10Z

documentation/memory/step3/using_gc_traces.md

+  5. Allow the program to run, hit the out of memory.
+  6. The produced log gives shows the failing context.
+
+B. How to assert whether too many gc are happening or too many gc is causing an overhead


Suggested change

B. How to assert whether too many gc are happening or too many gc is causing an overhead

B. How to assert whether too many gcs are happening or too many gcs are causing an overhead

@mhdawson, thanks. Fixed it.

mhdawson · 2020-04-02T13:59:04Z

documentation/memory/step3/using_gc_traces.md

+B. How to assert whether too many gc are happening or too many gc is causing an overhead
+  1. Review the trace data, specifically around time between consecutive gcs.
+  2. Review the trace data, specifically around time spent in gc.
+  3. If the time between two gc is less than the time spent in gc, the application is sseverely starving.


Suggested change

3. If the time between two gc is less than the time spent in gc, the application is sseverely starving.

3. If the time between two gc is less than the time spent in gc, the application is severely starving.

mhdawson · 2020-04-02T13:59:30Z

documentation/memory/step3/using_gc_traces.md

+  1. Review the trace data, specifically around time between consecutive gcs.
+  2. Review the trace data, specifically around time spent in gc.
+  3. If the time between two gc is less than the time spent in gc, the application is sseverely starving.
+  4. If the time between two gc and the time spent in gc are very high, probably the application can use a smaller heap


Suggested change

4. If the time between two gc and the time spent in gc are very high, probably the application can use a smaller heap

4. If the time between two gcs and the time spent in gc are very high, probably the application can use a smaller heap

mhdawson · 2020-04-02T14:00:01Z

documentation/memory/step3/using_gc_traces.md

+  2. Review the trace data, specifically around time spent in gc.
+  3. If the time between two gc is less than the time spent in gc, the application is sseverely starving.
+  4. If the time between two gc and the time spent in gc are very high, probably the application can use a smaller heap
+  5. If the time between two gc is much greater than the time spent in gc, application is relatively healthy


Suggested change

5. If the time between two gc is much greater than the time spent in gc, application is relatively healthy

5. If the time between two gcs is much greater than the time spent in gc, application is relatively healthy

mhdawson · 2020-04-02T14:01:29Z

documentation/memory/step3/using_gc_traces.md

+  3. If the time between two gc is less than the time spent in gc, the application is sseverely starving.
+  4. If the time between two gc and the time spent in gc are very high, probably the application can use a smaller heap
+  5. If the time between two gc is much greater than the time spent in gc, application is relatively healthy
+  6. While the actual numbers for these metrics change from workload to workload, a reasonable gap between gcs is 20 minutes, and a reasonable gc time is < 100 ms.


I had the same thought as @Flarna as well. 20 minutes seems to long. Do we have something to back up suggestions for particular numbers?

No.I don’t have any data / proof points, so I simply removed it.

mhdawson · 2020-04-02T14:04:06Z

documentation/memory/step3/using_gc_traces.md

+  4. Reduce `--max-old-space-size` such that the total heap is closer to the limit.
+  5. Allow the program to run, hit the out of memory.
+  6. The produced log gives shows the failing context.
+


We might add another which is similar except we reduce old space size in order to ensure we actually have an OOM. If we see the heap is continually increasing but don't OOM for a long time. Reducing the old space size can help confirm it is an actual leak versus the heap just increasing because there is lots of space and no need to gc or gc aggressively enough.

@mhdawson, added another example. PTAL.

gireeshpunathil · 2020-04-09T14:04:04Z

ping @mhdawson , @Flarna - looks like your review comments are addressed; could you please have another look?

mmarchini · 2020-04-09T16:35:33Z

Do we want to document a V8 flag that is not public API and we have no control over it being removed or changed?

Edit: nevermind, it's already documented

HarshithaKP · 2020-04-10T04:26:12Z

#node --v8-options | grep "trace-gc "
  --trace-gc (print one trace line following each garbage collection)
#

it is documented here.

PR-URL: #372 Reviewed-By: Gireesh Punathil <[email protected]> Reviewed-By: Gerhard Stöbich <[email protected]>

gireeshpunathil · 2020-04-10T12:26:09Z

landed in 4e8b0bc , thanks for the contribution!

doc: explain how to examine --trace_gc output

9b077ce

mhdawson reviewed Apr 1, 2020

View reviewed changes

fixup: address review comments

10602bb

Flarna reviewed Apr 2, 2020

View reviewed changes

documentation/memory/step3/using_gc_traces.md Outdated Show resolved Hide resolved

Flarna reviewed Apr 2, 2020

View reviewed changes

documentation/memory/step3/using_gc_traces.md Outdated Show resolved Hide resolved

fixup: address review comments

bac786d

mhdawson reviewed Apr 2, 2020

View reviewed changes

fixup: address review comments

73716a1

gireeshpunathil approved these changes Apr 9, 2020

View reviewed changes

Flarna approved these changes Apr 9, 2020

View reviewed changes

gireeshpunathil pushed a commit that referenced this pull request Apr 10, 2020

doc: explain how to examine --trace_gc output

4e8b0bc

PR-URL: #372 Reviewed-By: Gireesh Punathil <[email protected]> Reviewed-By: Gerhard Stöbich <[email protected]>

gireeshpunathil closed this Apr 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

doc: document how to examine `--trace_gc` output #372

doc: document how to examine `--trace_gc` output #372

HarshithaKP commented Apr 1, 2020

sam-github commented Apr 1, 2020

gireeshpunathil commented Apr 1, 2020

mhdawson Apr 1, 2020

HarshithaKP Apr 2, 2020

mhdawson Apr 2, 2020

HarshithaKP Apr 3, 2020

mhdawson Apr 2, 2020 •

edited

Loading

HarshithaKP Apr 3, 2020

mhdawson Apr 2, 2020

HarshithaKP Apr 3, 2020

mhdawson Apr 2, 2020

HarshithaKP Apr 3, 2020

mhdawson Apr 2, 2020

HarshithaKP Apr 3, 2020

mhdawson Apr 2, 2020

HarshithaKP Apr 3, 2020

mhdawson Apr 2, 2020

HarshithaKP Apr 3, 2020

gireeshpunathil commented Apr 9, 2020

mmarchini commented Apr 9, 2020 •

edited

Loading

HarshithaKP commented Apr 10, 2020

gireeshpunathil commented Apr 10, 2020


		```
		[PID: isolate] < time taken since GC started in ms> : < type/phase of GC > <heap used before GC call in MB> ( < allocated heap before GC call in MB > ) -> < heap used after GC in MB> ( < allocated heap after GC in MB>) <time spent in GC in ms> [ < reason for GC >]

	1. Suppose we observe that the old space is ocntinously increasing.
	1. Suppose we observe that the old space is continously increasing.

	B. How to assert whether too many gc are happening or too many gc is causing an overhead
	B. How to assert whether too many gcs are happening or too many gcs are causing an overhead

	3. If the time between two gc is less than the time spent in gc, the application is sseverely starving.
	3. If the time between two gc is less than the time spent in gc, the application is severely starving.

	4. If the time between two gc and the time spent in gc are very high, probably the application can use a smaller heap
	4. If the time between two gcs and the time spent in gc are very high, probably the application can use a smaller heap

	5. If the time between two gc is much greater than the time spent in gc, application is relatively healthy
	5. If the time between two gcs is much greater than the time spent in gc, application is relatively healthy

doc: document how to examine --trace_gc output #372

doc: document how to examine --trace_gc output #372

Conversation

HarshithaKP commented Apr 1, 2020

sam-github commented Apr 1, 2020

gireeshpunathil commented Apr 1, 2020

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mhdawson Apr 2, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

gireeshpunathil commented Apr 9, 2020

mmarchini commented Apr 9, 2020 • edited Loading

HarshithaKP commented Apr 10, 2020

gireeshpunathil commented Apr 10, 2020

doc: document how to examine `--trace_gc` output #372

doc: document how to examine `--trace_gc` output #372

mhdawson Apr 2, 2020 •

edited

Loading

mmarchini commented Apr 9, 2020 •

edited

Loading