Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

the kdd dataset is different from other kdd datasets, and how was it processed? #13

Open
Irisansan opened this issue Apr 28, 2020 · 5 comments

Comments

@Irisansan
Copy link

Hi,
I have noticed that there is only 35 features in the kdd dataset given in this project,which is different from other kdd datasets including official ones. And there is no description for how it has been processed,so I am confused.
Could anyone give me a hand?I will be grateful!
Regards,
Iris

@GPla
Copy link

GPla commented Apr 30, 2020

The data comes from here, you can find the list of features under task description. As you mentioned, there are 35 features listed (label not listed on the page). If we compare them with other sources, such as datahub, we can see that they use 7 additional features with information about the destination host. So no preprocessing was done, there are just different version available.

@Irisansan
Copy link
Author

Irisansan commented May 9, 2020 via email

@Irisansan
Copy link
Author

The data comes from here, you can find the list of features under task description. As you mentioned, there are 35 features listed (label not listed on the page). If we compare them with other sources, such as datahub, we can see that they use 7 additional features with information about the destination host. So no preprocessing was done, there are just different version available.

Hey, Have you got the satisfied performance values on SWaT data?Can you tell me how to adjust the parameters settings?Thanks a lot.

@huaxinfeihua
Copy link

The data comes from here, you can find the list of features under task description. As you mentioned, there are 35 features listed (label not listed on the page). If we compare them with other sources, such as datahub, we can see that they use 7 additional features with information about the destination host. So no preprocessing was done, there are just different version available.

The dataset from http://kdd.ics.uci.edu/databases/kddcup99/kddcup99.html has 42 features and no labels, but the code it is have 35 and get labels,What do I have to do to get these labels, please?

@luckyStarForever
Copy link

Would you like to tell me , "What are the requirements for the code project environment ?"

You can also run commands in the project environment:

pip freeze > D:\requirements.txt

Please share the requirements.txt content.
Thank you very much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants