Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing train_sim_traces & test_sim_traces data at Dropbox #134

Open
joybi531 opened this issue Jul 26, 2021 · 4 comments
Open

Missing train_sim_traces & test_sim_traces data at Dropbox #134

joybi531 opened this issue Jul 26, 2021 · 4 comments

Comments

@joybi531
Copy link

joybi531 commented Jul 26, 2021

Hi,
I was trying to run the simulation with the actual data used in the experiment. However, I couldn't find the train_sim_traces and test_sim_traces data at the given link to Dropbox. That might have been removed. Could anyone please provide me the data downloaded from the link that was mentioned in the repository?

@hongzimao
Copy link
Owner

Thanks for the question! I graduated and lost the access to MIT dropbox. Here are some traces I can dug out from my local machine. Not exactly sure if it's the same as the traces in the dropbox link.

If anyone has a copy of the traces, please let us know. We will upload the traces and use updated link accordingly. Thanks!

cooked_traces.zip

@joybi531
Copy link
Author

joybi531 commented Aug 7, 2021

@hongzimao
Thanks for the help. The traces data that you just shared works fine.

@NotSpecial
Copy link

Hi, I am trying to reconstruct the traces used to train Pensieve from the FCC data, but I feel like I am missing a step.

I downloaded the raw data and ran the traces/fcc/load_webget_data.py script.
However, this script returns a very different output than the cooked traces you shared above.

It only contains a single throughput line per row, and the throughputs do not match the data in the FCC traces.

For example, these are the first couple of lines for the trace trace_866_http---www.amazon.com:

This is what is returned by the script in this repository:

1358237
1440783
1387260
1383838
1369546
...

And this is what is part of the archive you sent above:

0.0	1.41832
5.0	1.39856
10.0	1.47412
15.0	1.50672
20.0	1.484064

...

Can you help me figure out what I am missing? How can I get from the output of the script in the traces directory to the training data you shared?

Thank you very much in advance!

@YuxinLiu2023
Copy link

Hi, I am trying to reconstruct the traces used to train Pensieve from the FCC data, but I feel like I am missing a step.

I downloaded the raw data and ran the traces/fcc/load_webget_data.py script. However, this script returns a very different output than the cooked traces you shared above.

It only contains a single throughput line per row, and the throughputs do not match the data in the FCC traces.

For example, these are the first couple of lines for the trace trace_866_http---www.amazon.com:

This is what is returned by the script in this repository:

1358237
1440783
1387260
1383838
1369546
...

And this is what is part of the archive you sent above:

0.0	1.41832
5.0	1.39856
10.0	1.47412
15.0	1.50672
20.0	1.484064

...

Can you help me figure out what I am missing? How can I get from the output of the script in the traces directory to the training data you shared?

Thank you very much in advance!

I think he used the functions in plot_cooked_bandwidth.py to calculate the time_all and bandwidth_all.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants