Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About chakra_trace_link #173

Open
Yan824 opened this issue Dec 10, 2024 · 0 comments
Open

About chakra_trace_link #173

Yan824 opened this issue Dec 10, 2024 · 0 comments
Labels
question Further information is requested

Comments

@Yan824
Copy link

Yan824 commented Dec 10, 2024

Hi, I am following your tutorial in wiki for Chakra ET generation with PyTorch ET and Kineto wiki . But when i try to merge the two trace from matmul, it raises the following error:

root@x08j03287:/workspace/astra-sim/extern/graph_frontend/chakra/example# chakra_trace_link --chakra-host-trace pytorch_et.json --chakra-device-trace kineto_trace.json --rank 0 --output-file chakra_host_device_trace.et
[2024-12-10 06:05:56,631] trace.py:328 [INFO]: /workspace/astra-sim/extern/graph_frontend/chakra/example
[2024-12-10 06:05:56,632] trace_file.py:56 [ERROR]: File /workspace/astra-sim/extern/graph_frontend/chakra/example/kineto_trace.json and file /workspace/astra-sim/extern/graph_frontend/chakra/example/pytorch_et.json has the same rank. Will use /workspace/astra-sim/extern/graph_frontend/chakra/example/pytorch_et.json as the path to rank: 0.
[2024-12-10 06:05:56,632] trace_file.py:94 [INFO]: Rank to trace file map:
{0: '/workspace/astra-sim/extern/graph_frontend/chakra/example/pytorch_et.json'}
[2024-12-10 06:05:56,632] trace.py:474 [INFO]: ranks=[0]
[2024-12-10 06:05:56,632] trace_parser.py:107 [WARNING]: Parsed /workspace/astra-sim/extern/graph_frontend/chakra/example/pytorch_et.json time = 0.00 seconds
[2024-12-10 06:05:56,633] trace_parser.py:430 [WARNING]: Parsed /workspace/astra-sim/extern/graph_frontend/chakra/example/pytorch_et.json backend=json in 0.00 seconds; current PID:58601
Traceback (most recent call last):
File "/usr/local/bin/chakra_trace_link", line 8, in
sys.exit(main())
File "/usr/local/lib/python3.10/dist-packages/chakra/src/trace_link/trace_link.py", line 47, in main
linker.link(args.rank, args.chakra_host_trace, args.chakra_device_trace, args.output_file)
File "/usr/local/lib/python3.10/dist-packages/chakra/src/trace_link/trace_linker.py", line 70, in link
sync_deps = self.load_sync_dependencies(rank, chakra_device_trace)
File "/usr/local/lib/python3.10/dist-packages/chakra/src/trace_link/trace_linker.py", line 120, in load_sync_dependencies
trace_analysis = TraceAnalysis(trace_dir=trace_dir)
File "/workspace/astra-sim/extern/graph_frontend/chakra/HolisticTraceAnalysis/hta/trace_analysis.py", line 37, in init
self.t.load_traces(include_last_profiler_step)
File "/workspace/astra-sim/extern/graph_frontend/chakra/HolisticTraceAnalysis/hta/common/trace.py", line 352, in load_traces
self.parse_traces()
File "/workspace/astra-sim/extern/graph_frontend/chakra/HolisticTraceAnalysis/hta/common/trace.py", line 477, in parse_traces
self.parse_multiple_ranks(ranks, use_multiprocessing and len(ranks) > 1)
File "/workspace/astra-sim/extern/graph_frontend/chakra/HolisticTraceAnalysis/hta/common/trace.py", line 404, in parse_multiple_ranks
result = parse_trace_file(self.trace_files[rank])
File "/workspace/astra-sim/extern/graph_frontend/chakra/HolisticTraceAnalysis/hta/common/trace.py", line 228, in parse_trace_file
add_fwd_bwd_links(df)
File "/workspace/astra-sim/extern/graph_frontend/chakra/HolisticTraceAnalysis/hta/common/trace.py", line 244, in add_fwd_bwd_links
if df.cat.eq("fwdbwd").sum() == 0:
File "/usr/local/lib/python3.10/dist-packages/pandas/core/generic.py", line 6299, in getattr
return object.getattribute(self, name)
AttributeError: 'DataFrame' object has no attribute 'cat'. Did you mean: 'at'?

How to solve this problem?

@Yan824 Yan824 added the question Further information is requested label Dec 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant