Add interpreter emulator for ECS in inliner #7785

cathyzhyi · 2019-11-18T19:28:59Z

This change adds the infrastructure to emulate the interpreter
execution during estimate code size of target in inliner.

Also the loop looking for callsites in ECS is refactored and moved
to findAndCreateCallsitesFromBytecodes. ECS will be changed to call
this function in a following commit.

issue: #6204
Signed-off-by: Yi Zhang [email protected]

andrewcraik · 2019-11-19T16:18:56Z

FYI @efferifick since you are also working in this space

runtime/compiler/optimizer/InterpreterEmulator.hpp

andrewcraik · 2019-11-20T15:17:25Z

What is the lifetime of the CallSites that are allocated? to make the code easier to maintain, I'd almost suggest that the CallSites be allocated in a region held by the interpreter object so that if we wanted to generate this info and throw it away at some later point during the compile we would have that flexibility. Baking use of the compilation heap in for something as big as this seems a bit dirty - it would be better to control when the memory is freed.

andrewcraik · 2019-11-20T15:18:41Z

Why is all the operand tracking etc done using the stack region rather than a region specifically tied to the lifetime of interpretation so we can free it as soon as the interpreter is done running to keep the footprint lower?

runtime/compiler/optimizer/InterpreterEmulator.cpp

runtime/compiler/optimizer/EstimateCodeSize.hpp

andrewcraik

Just marking that I requested changes and further review will be needed once the PR is updated/commented on.

cathyzhyi · 2019-11-26T19:30:29Z

What is the lifetime of the CallSites that are allocated? to make the code easier to maintain, I'd almost suggest that the CallSites be allocated in a region held by the interpreter object so that if we wanted to generate this info and throw it away at some later point during the compile we would have that flexibility. Baking use of the compilation heap in for something as big as this seems a bit dirty - it would be better to control when the memory is freed.

TheCallsites would be needed for the actually inlining as well so they have been allocated on heap memory in estimiate code size as well as in other places in inliner. Allocating to memory that can be reclaimed when the inlining is finished would be a good idea but is probably out of the scope of this item.

cathyzhyi · 2019-11-26T21:42:00Z

Why is all the operand tracking etc done using the stack region rather than a region specifically tied to the lifetime of interpretation so we can free it as soon as the interpreter is done running to keep the footprint lower?

Thanks for the suggestion. Allocated new memory region at the beginning of interpretation, namely findAndCreateCallsitesFromBytecodes so that as soon as findAndCreateCallsitesFromBytecodes returns the memories for operand tracking are release.

andrewcraik · 2019-11-29T14:15:47Z

Allocating to memory that can be reclaimed when the inlining is finished would be a good idea but is probably out of the scope of this item.

Could you create an issue for this? It is definitely something we should look at when we need to shrink the footprint again...

runtime/compiler/optimizer/EstimateCodeSize.hpp

runtime/compiler/optimizer/InterpreterEmulator.hpp

runtime/compiler/optimizer/InterpreterEmulator.cpp

andrewcraik · 2019-12-02T15:38:52Z

Overall, this is getting close. Is the trace in the base bytecode iterator enough to know which cases etc the interpeter is processing? There is some debug trace in the methods you added, but it doesn't quite seem like enough to follow the whole interpretation - I'm guessing the 'missing' bits come from the base?

cathyzhyi · 2019-12-02T21:44:08Z

Overall, this is getting close. Is the trace in the base bytecode iterator enough to know which cases etc the interpeter is processing? There is some debug trace in the methods you added, but it doesn't quite seem like enough to follow the whole interpretation - I'm guessing the 'missing' bits come from the base?

There is InterpreterEmulator::dumpStack() that dumps the stack content and the current bytecode. findAndCreateCallsitesFromBytecodescalls this after executing each bytecode.

cathyzhyi · 2019-12-03T17:43:19Z

Allocating to memory that can be reclaimed when the inlining is finished would be a good idea but is probably out of the scope of this item.

Could you create an issue for this? It is definitely something we should look at when we need to shrink the footprint again...

Created an issue to track the problem. #7950.

This change adds the infrastructure to emulate the interpreter execution during estimate code size of target in inliner. Also the loop looking for callsites in ECS is refactored and moved to `findAndCreateCallsitesFromBytecodes`. ECS will be changed to call this function in a following commit. Signed-off-by: Yi Zhang <[email protected]>

andrewcraik · 2019-12-05T15:18:49Z

Jenkins test sanity all jdk11

andrewcraik · 2019-12-09T16:59:40Z

the failed job is just infra - the rest of the testing is sufficient to merge this

cathyzhyi force-pushed the emulator branch from 5fc3c2b to 1454cc0 Compare November 18, 2019 19:48

andrewcraik self-assigned this Nov 19, 2019

andrewcraik added comp:jit enhancement labels Nov 19, 2019

andrewcraik reviewed Nov 20, 2019

View reviewed changes

runtime/compiler/optimizer/InterpreterEmulator.hpp Outdated Show resolved Hide resolved

andrewcraik reviewed Nov 20, 2019

View reviewed changes

runtime/compiler/optimizer/InterpreterEmulator.hpp Outdated Show resolved Hide resolved

andrewcraik reviewed Nov 20, 2019

View reviewed changes

runtime/compiler/optimizer/InterpreterEmulator.hpp Outdated Show resolved Hide resolved

andrewcraik reviewed Nov 20, 2019

View reviewed changes

runtime/compiler/optimizer/InterpreterEmulator.cpp Outdated Show resolved Hide resolved

andrewcraik reviewed Nov 20, 2019

View reviewed changes

runtime/compiler/optimizer/EstimateCodeSize.hpp Outdated Show resolved Hide resolved

andrewcraik mentioned this pull request Nov 26, 2019

SharedClasses.SCM23.MultiCL_0 times out on Windows #7178

Closed

andrewcraik suggested changes Nov 26, 2019

View reviewed changes

cathyzhyi force-pushed the emulator branch from 1454cc0 to 7b5e7ad Compare November 27, 2019 02:53

cathyzhyi requested a review from andrewcraik November 27, 2019 02:55

cathyzhyi force-pushed the emulator branch 3 times, most recently from fd62cf4 to 00036be Compare November 27, 2019 19:18