-
Notifications
You must be signed in to change notification settings - Fork 40
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
more traces? #66
Comments
Additionally, it appears that the functionality to parse the text files Transformer_HybridParallel.txt and Transformer_HybridParallel_Fwd_In_Bckwd.txt is missing. |
These text files are an artifact of ASTRA-sim 1.0 and not Chakra. The best way to get these traces is collect it by running PyTorch model and enabling the profiler. Are you looking for instructions to collect yourself? |
Thank you for your response. I appreciate the instructions on the Wiki and find them clear. I'm just interested in whether I could obtain the measured traces from your end, particularly those involving many nodes, as they would be highly beneficial for my simulation. |
Yes, I understand! What scale are you looking at? We are updating comms group info in pytorch and collecting few traces. I will check and see if we can share externally. We do want to eventually setup a DB of traces but hosting the DB and keeping them up-to-date are TBD. |
Understood. I'm currently in need of some traces for transformers and LLAMA involving tens of nodes. Once again, I really appreciate your outstanding work! |
is there any available trace now? |
If more multi-node traces can be opened, it will be very helpful to me, thank you! |
Add .github/ISSUE_TEMPLATE/
Hi,
Could you please share more ET traces, such as the LLaMA traces you mentioned in previous issues?
Currently, I only have the converted traces from Astra-sim 1.0 and the Megatron trace mentioned in issue #176.
It would be really helpful if you could share more traces.
Thanks!
The text was updated successfully, but these errors were encountered: