Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Levantamento dos datasets para stream #32

Open
filipebraida opened this issue Sep 15, 2020 · 3 comments
Open

Levantamento dos datasets para stream #32

filipebraida opened this issue Sep 15, 2020 · 3 comments
Assignees
Labels
documentation Improvements or additions to documentation

Comments

@filipebraida
Copy link
Contributor

A ideia é criar um módulo ou um pacote que terá diversos datasets consolidados na área de stream. Desta maneira, facilitando o uso de experimentos clássicos.

Levantar as seguintes informações:

  • Nome da Datasets
  • Onde está hospedado
  • Licença
  • Referência
@filipebraida filipebraida added the documentation Improvements or additions to documentation label Sep 15, 2020
@filipebraida
Copy link
Contributor Author

Conhece alguns datasets clássicos @Conradox ?

@nicolasmagalhaes
Copy link
Member

nicolasmagalhaes commented Sep 16, 2020

achei um dataset da rede ferroviária do reino unido ele é livre pra usar e colocar em qualquer projeto e desenvolver aplicações/pesquisa

OpenLDBWS
http://realtime.nationalrail.co.uk/OpenLDBWSRegistration

eu não sei como distinguir se um dataset é bom ou não então resolvi mandar aqui

@raulsenaferreira
Copy link

raulsenaferreira commented Sep 19, 2020

@Article{SouzaChallenges:2020,
title={Challenges in Benchmarking Stream Learning Algorithms with Real-world Data},
author={Souza, V. M. A. and Reis, D. M. and Maletzke, A. G. and Batista, G. E. A. P. A.},
journal={Data Mining and Knowledge Discovery},
pages={1-54},
year={2020},
doi={10.1007/s10618-020-00698-5}
}

https://sites.google.com/view/uspdsrepository

Airlines | Ikonomovska et al., 2011
Chess | Zliobaite, 2011
Electricity | Harries, 1999
Forest Covertype | Blackard and Dean, 1999
Gas Sensor Array | Vergara et al., 2012
INSECTS-Abrupt (balanced) | Souza et al., 2020
INSECTS-Abrupt (imbalanced) | Souza et al., 2020
INSECTS-Incremental (balanced) | Souza et al., 2020
INSECTS-Incremental (imbalanced) | Souza et al., 2020
INSECTS-Incremental-abrupt-reoccurring (balanced) | Souza et al., 2020
INSECTS-Incremental-abrupt-reoccurring (imbalanced) | Souza et al., 2020
INSECTS-Incremental-gradual (balanced) | Souza et al., 2020
INSECTS-Incremental-gradual (imbalanced) | Souza et al., 2020
INSECTS-Incremental-reoccurring (balanced) | Souza et al., 2020
INSECTS-Incremental-reoccurring (imbalanced) | Souza et al., 2020
INSECTS-Out-of-control | Souza et al., 2020
KDDCUP99 | Tavallaee et al., 2009
Keystroke | Souza et al., 2015
Luxembourg | Zliobaite, 2011
NOAA Weather | Ditzler and Polikar, 2013
Outdoor Objects | Losing et al., 2015
Ozone | Dheeru and Karra Taniskidou, 2017
Poker-hand | Dheeru and Karra Taniskidou, 2017
Powersupply | Zhu, 2010
Rialto Bridge Timelapse | Losing et al., 2016
Sensor Stream | Zhu, 2010
Spam Assassin Katakis et al., 2009

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

3 participants