Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve Java kernel crash detection and error reporting #66

Open
aion-kelvin opened this issue Jul 26, 2019 · 0 comments
Open

Improve Java kernel crash detection and error reporting #66

aion-kelvin opened this issue Jul 26, 2019 · 0 comments

Comments

@aion-kelvin
Copy link
Contributor

If the Java kernel crashes, node_test_harness doesn't really notice. The test cases just fail and vaguely say TimeoutException because the transactions it is expecting to complete haven't completed in the allotted time.

This problem is exacerbated when running in the Jenkins CI of aionnetwork/aion because it deletes workspaces right after execution, so the heap dump log file is deleted. Even if it weren't, it'd require logging into the CI host and knowing to look for a heapdump file (since the crash message from the kernel isn't visible to node_test_harness, you'd basically have to blindly check if a crash happened by looking for the heap dump log file).

Idea: when tailing the kernel log, look for the message that the process crashed. Save the location of the heapdump log that's in the message. Tell the user a crash happened and print out the log file (or save it to some conveniently accessible place).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant