Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[No Code] Discover new datasets #1

Open
alexandrebarachant opened this issue Jun 3, 2017 · 34 comments
Open

[No Code] Discover new datasets #1

alexandrebarachant opened this issue Jun 3, 2017 · 34 comments

Comments

@alexandrebarachant
Copy link
Member

alexandrebarachant commented Jun 3, 2017

We need people browsing the web to discover interesting datasets than could be added to the moabb.

You can comment on this issue.

But first, check your dataset is not already in the list

What kind of datasets

We are interested in any datasets of time neural timeseries (EEG, MEG, ECOG, and fNIRS) with a minimum of 5 subjects, where we can apply machine learning algorithms and available online. It does not need to be a BCI dataset, but it must contains different condition/task, labelled and tagged.

How do I search for a new dataset ?

Many of the datasets of the BNCI index have not been reported. you can start here.

Researcher are making more and more datasets available. some database exists and might contains interesting things :

Finally, google is your friend

How much time does it takes ?

Entering a new dataset should took you 2 minutes.

@alexandrebarachant alexandrebarachant changed the title [Easy] Discover new datasets [No Code] Discover new datasets Jun 3, 2017
@vinay-jayaram
Copy link
Collaborator

@alexandrebarachant
Copy link
Member Author

alexandrebarachant commented Apr 1, 2018

Browsing Plos one to find New motor imagery datasets:

@vinay-jayaram
Copy link
Collaborator

re: dataset 1, All the trials are pre-epoched :(

@alexandrebarachant
Copy link
Member Author

yeah. Actually i think the GigaDb dataset is already like this ...

@vinay-jayaram
Copy link
Collaborator

ah you're right, you just concatenated all the trials. In that case we can do the same here :) good good

@vinay-jayaram
Copy link
Collaborator

also regarding the second to last: Have you e-mailed Fabien?

@alexandrebarachant
Copy link
Member Author

I'm definitely not super happy about the concatenation of individual trials. in the case of the GigaDB, the dataset was too large to ignore. in those case, we can contact the authors to ask them about the raw data, but concatenating is a good starting point to see whether the dataset is really interesting or not.

Also, let's contact fabien and camille about the second last dataset. I will do it today.

@vinay-jayaram
Copy link
Collaborator

regarding concatenation though: Couldn't we just add a buffer of zeros before and after each trial to smooth out border effects? After de-meaning the trials to eliminate the issue of offset

@alexandrebarachant
Copy link
Member Author

Yep we could. I think the most problematic part is the non zero mean that create huge edge artifact.
We could also return Mne epochs in that case, but that still not ideal from a filtering point of view.

In any case we might want to put a warning ?

@vinay-jayaram
Copy link
Collaborator

warning is good, will add

@Seburath
Copy link

the list Is not synchronized whit the documentation, why? can I help there?

sylvchev pushed a commit that referenced this issue Jul 13, 2020
sylvchev pushed a commit that referenced this issue Aug 17, 2020
This was referenced Dec 10, 2020
m2-farzan added a commit to m2-farzan/moabb that referenced this issue Feb 10, 2021
I hadn't implemented the part from Paper NeuroTechX#1 Section III.A.6 corre-
ctly which caused wrong output size (8 instead of 32).
sylvchev pushed a commit that referenced this issue Feb 20, 2021
@sylvchev
Copy link
Member

We will use this issue and the associated wiki page to keep track of the dataset that we could add in MOABB. Please, comment this issue if you want to report about a new dataset.

@sylvchev
Copy link
Member

There is a nice dataset here for SSVEP and ERP using EEG and ear-EEG while standing or moving, the data are available here

@Div12345
Copy link
Collaborator

Div12345 commented Jan 1, 2022

@sylvchev
Copy link
Member

sylvchev commented Jun 1, 2022

@okbalefthanded
Copy link

This dataset is interesting for its population age and size, it is based on SSVEP for 100 participants with ages greater than 50 years old:
https://www.nature.com/articles/s41597-022-01372-9

@sylvchev
Copy link
Member

This dataset could be integrated in MOABB, MI with information about subjects: https://zenodo.org/record/7554429

@taziksh
Copy link

taziksh commented Jan 16, 2024

Ideas for EEG datasets: https://www.fieldtriptoolbox.org/faq/open_data/

@okbalefthanded
Copy link

@vmcru
Copy link

vmcru commented Feb 6, 2024

This dataset "Inner Speech Dataset" was published in nature and seems like a good fit to add support.
Paper: Thinking out loud, an open-access EEG-based BCI dataset for inner speech recognition
Data: OpenNeuro link

@HarlockOfficial
Copy link

These are 2 other interesting ones someone pointed out on the Slack channel -

  1. Continuous sensorimotor rhythm based brain computer interface learning in a large population - Data
  2. A large electroencephalographic motor imagery dataset for electroencephalographic brain computer interfaces - Data

Hi @Div12345, I was interested in the second dataset, but unfortunately, I did not find it in the MOABB documentation.
Are there any plans related to adding the second dataset in the near future, or is the dataset already part of the library under some specific section or with a specific name?

@bruAristimunha
Copy link
Collaborator

All the dataset inside this paper: https://arxiv.org/pdf/2402.08656.pdf

@bruAristimunha bruAristimunha pinned this issue Feb 28, 2024
@okbalefthanded
Copy link

okbalefthanded commented Feb 29, 2024

@bruAristimunha
Copy link
Collaborator

@machinelatto
Copy link

machinelatto commented Apr 25, 2024

Is someone working on BEnchmark database Towards BCI Application (https://www.frontiersin.org/journals/neuroscience/articles/10.3389/fnins.2020.00627/full)?

It is an SSVEP dataset with 70 subjects performing a 40-target cued-spelling task. I saw that it was referred on the Datasets to include section, but found no Issue related to it.

@bruAristimunha
Copy link
Collaborator

Hi @machinelatto!

It seems like no one focused on this task, or if someone started, didn't commit or create the PR. Would you be interested?

You would basically need to create two functions, as shown in this tutorial: https://neurotechx.github.io/moabb/auto_tutorials/4_adding_a_dataset.html#sphx-glr-auto-tutorials-4-adding-a-dataset-py

One is to download and one is to load the dataset using mne.

@machinelatto
Copy link

Hi @bruAristimunha !

I'm probably going to use this dataset on my research, so I could try to create those functions in the next weeks. If it goes well I'l open a PR.

@tahatt13 tahatt13 unpinned this issue Jul 8, 2024
@tahatt13 tahatt13 pinned this issue Jul 8, 2024
@okbalefthanded
Copy link

59 Subjects 40 Classes SSVEP dataset

@bruAristimunha
Copy link
Collaborator

bruAristimunha commented Sep 25, 2024

@bruAristimunha
Copy link
Collaborator

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests