Stats memory usage can probably be improved quite a bit #3585

jmarantz · 2018-06-11T13:59:41Z

#3508 is about not having to allocate stats memory as an NxM block. But really the much bigger prize here is to use a lot less stats memory by not storing gigabytes of repeated strings. Why we want the fully elaborated stat name in memory at all?

If we represented stats structurally with a format string with named variable substitutions, e.g.
prefix.$var1.keyword.$var2
and variable assignments:
var1="xxxx"
var2="yyyy"

The static strings there "prefix.$var1.keyword.$var2" and "var1" and "var2" would not be needed in dynamic memory at all but could live in the code text as a static const char[] or one of those lazy-static-initialized structs of strings. All we'd need to keep in dynamic memory are the substitutions xxxx and yyyy in that case. And I think typically many stats would have the same valued substitutions which could share the same memory. This would be a little complex as a shared-memory block, depending on whether we need to free and reallocate within the shm-block.

I was thinking about this because we are starting to count stats memory in the gigabytes (reference earlier bug (#3463) where uint32 wasn't enough for byte offsets for @ggreenway). Most of this memory is for strings, most of which are really variations on common patterns. It'd be nice to ultimately use that memory for data cache.

I think this would speed things up too -- I was kind of going in that direction in my earlier optimizations (I got a working prototype based on structured substitutions instead of regexes) but I found I was able to get enough of the startup speed improvements with hacks to skip regex lookups, but it'd be faster still if we just used a better rep in the first place. But the main goal here is scalability.

Of course RawStatData could have a name() method which could elaborate a string for printing for debug or whatever, but I think many structured stat sinks would benefit from having the structure be explicit in the representation. Of course we'd have to make maps of stats know about this structure to avoid hanging onto the elaborated string data.

A simpler variation on the above was suggested by @htuch is to store arrays of "." separated tokens, each of which could be part of a symbol table. This would be simpler to integrate but still require complex regex-based tag-token extraction.

A few questions remain about how to do this, especially in the context of hot restart, but opening this issue now to collect discussion.

htuch · 2018-07-13T21:00:57Z

@ambuc here's the potato, let me know if you want to hold it.

ambuc · 2018-07-23T15:15:20Z

There's a symbol table PR out here: #3927.

…olTable API without taking locks. (#5414) Adds an abstract interface for SymbolTable and alternate implementation FakeSymbolTableImpl, which doesn't take locks. Once all stat tokens are symbolized at construction time, this FakeSymbolTable implementation can be deleted, and real-symbol tables can be used, thereby reducing memory and improving stat construction time per #3585 and #4980 . Note that it is not necessary to pre-allocate all elaborated stat names because multiple StatNames can be joined together without taking locks, even in SymbolTableImpl. This implementation simply stores the characters directly in the uint8_t[] that backs each StatName, so there is no sharing or memory savings, but also no state associated with the SymbolTable, and thus no locks needed. Risk Level: low Testing: //test/common/stats/... Signed-off-by: Joshua Marantz <[email protected]>

…olTable API without taking locks. (envoyproxy#5414) Adds an abstract interface for SymbolTable and alternate implementation FakeSymbolTableImpl, which doesn't take locks. Once all stat tokens are symbolized at construction time, this FakeSymbolTable implementation can be deleted, and real-symbol tables can be used, thereby reducing memory and improving stat construction time per envoyproxy#3585 and envoyproxy#4980 . Note that it is not necessary to pre-allocate all elaborated stat names because multiple StatNames can be joined together without taking locks, even in SymbolTableImpl. This implementation simply stores the characters directly in the uint8_t[] that backs each StatName, so there is no sharing or memory savings, but also no state associated with the SymbolTable, and thus no locks needed. Risk Level: low Testing: //test/common/stats/... Signed-off-by: Joshua Marantz <[email protected]>

…olTable API without taking locks. (envoyproxy#5414) Adds an abstract interface for SymbolTable and alternate implementation FakeSymbolTableImpl, which doesn't take locks. Once all stat tokens are symbolized at construction time, this FakeSymbolTable implementation can be deleted, and real-symbol tables can be used, thereby reducing memory and improving stat construction time per envoyproxy#3585 and envoyproxy#4980 . Note that it is not necessary to pre-allocate all elaborated stat names because multiple StatNames can be joined together without taking locks, even in SymbolTableImpl. This implementation simply stores the characters directly in the uint8_t[] that backs each StatName, so there is no sharing or memory savings, but also no state associated with the SymbolTable, and thus no locks needed. Risk Level: low Testing: //test/common/stats/... Signed-off-by: Joshua Marantz <[email protected]> Signed-off-by: Fred Douglas <[email protected]>

mattklein123 added enhancement Feature requests. Not bugs or questions. help wanted Needs help! labels Jun 11, 2018

jmarantz mentioned this issue Jun 12, 2018

stats: refactor to enable non-hot-restart stats to have distinct representation from hot-restart stats. #3606

Merged

This was referenced Jun 22, 2018

stats: Sink CounterImpl and GaugeImpl into stats_impl.cc; no need to have them in the .h #3701

Merged

HeapStatData with a distinct allocation mechanism for RawStatData #3710

Merged

htuch assigned ambuc Jul 13, 2018

ambuc mentioned this issue Aug 16, 2018

stats: refactoring MetricImpl without strings #4190

Merged

PiotrSikora mentioned this issue Aug 18, 2018

Per listener and per cluster memory overhead is too high #4196

Closed

ambuc mentioned this issue Aug 28, 2018

stats: symbolize strings in HeapStatData and ThreadLocalStore #4281

Closed

mattklein123 added this to the 1.9.0 milestone Nov 16, 2018

mattklein123 modified the milestones: 1.9.0, 1.10.0 Dec 14, 2018

jmarantz mentioned this issue Dec 25, 2018

stats: Add fake symbol table as an intermediate state to move to SymbolTable API without taking locks. #5414

Merged

jmarantz mentioned this issue Mar 4, 2019

stats: Use SymbolTable API for creating and representing stat names. #6161

Merged

mattklein123 modified the milestones: 1.10.0, 1.11.0 Mar 11, 2019

jmarantz assigned jmarantz and unassigned ambuc Apr 29, 2019

mattklein123 modified the milestones: 1.11.0, 1.12.0 Jul 3, 2019

mattklein123 modified the milestones: 1.12.0, 1.13.0 Oct 10, 2019

mattklein123 removed this from the 1.13.0 milestone Dec 5, 2019

mattklein123 added this to the 1.14.0 milestone Dec 5, 2019

jmarantz mentioned this issue Jan 21, 2020

stats: integrate real symbol table into stats system #4980

Merged

mattklein123 modified the milestones: 1.14.0, 1.15.0 Mar 18, 2020

mattklein123 closed this as completed in #4980 Jun 28, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stats memory usage can probably be improved quite a bit #3585

Stats memory usage can probably be improved quite a bit #3585

jmarantz commented Jun 11, 2018

htuch commented Jul 13, 2018

ambuc commented Jul 23, 2018

Stats memory usage can probably be improved quite a bit #3585

Stats memory usage can probably be improved quite a bit #3585

Comments

jmarantz commented Jun 11, 2018

htuch commented Jul 13, 2018

ambuc commented Jul 23, 2018