-
Notifications
You must be signed in to change notification settings - Fork 270
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
C++: ObjectAllocator_Destructor: Assertion `allocator->nb_inuse == 0' failed #263
Comments
Oddly, there is no Take a look at the implementation of ``rmt_UnbindOpenGL` for an example: Line 9554 in e862ba4
GPU profilers have a bunch of query data that will be in transit between the various queues and the assert message is telling you the app is shutting down without freeing them. Adding an equivalent |
Interesting. Why is the app shutting down though? This is happening after I call |
Have you tried calling |
I haven't done it in this sample, however in my application I have multiple cuda streams that are created with cudaStreamCreate, but the same error occurs |
Right, but this app isn't a valid repro until the streams are created. As I said: I have no idea what CUDA will do internally if you try to use its API (like Remotery does) without creating the stream first. Already I can see code inside |
I have just tried calling Here is the code repro that I am using:
That's the output I get:
Is there anything else that I should be aware of to use the CUDA API? |
OK! That makes a lot more sense. The code here is failing: Line 7914 in e862ba4
Points:
So your first port of call is to find out why this code is failing: |
The constructor function calls So I adapted the code to initialize cuda with with the following output:
And here is the updated repro code that I use
|
Hello! I am trying to profile my cuda program, however it results in assertion errors.
I have created a minimal reproducing example below:
Building, linking and running the above script results in the following output:
The CPU sampling works perfectly. I would like to make CUDA sampling work as well, any help is appreciated.
I was able to successfully build Remotery after the changes suggested in #262
The text was updated successfully, but these errors were encountered: